/
Navigation
Chronicles
Browse all articles
Explore
Semantic exploration
Research
Entity momentum
Nexus
Correlations & relationships
Story Arc
Topic evolution
Drift Map
Semantic trajectory animation
Posts
Analysis & commentary
Pulse API
Tech news intelligence API
Browse
Entities
Companies, people, products, technologies
Domains
Browse by publication source
Handles
Browse by social media handle
Detection
Concept Search
Semantic similarity search
High Impact Stories
Top coverage by position
Sentiment Analysis
Positive/negative coverage
Anomaly Detection
Unusual coverage patterns
Analysis
Rivalry Report
Compare two entities head-to-head
Semantic Pivots
Narrative discontinuities
Crisis Response
Event recovery patterns
Connected
Search: /
Command: ⌘K
Embeddings: large
TEXXR

Chronicles

The story behind the story

days · browse · Enter similar · o open

Anthropic releases Claude Opus 4.7, saying it is a “notable improvement” on Opus 4.6 in advanced software engineering and comes with a new “xhigh” effort level

Our latest model, Claude Opus 4.7, is now generally available.  —  Opus 4.7 is a notable improvement …

Anthropic

Discussion

  • @alexalbert__ Alex Albert on x
    Some of my favorite things in Opus 4.7: - Very good at async work and following instructions - Effort levels are far more predictable for token control (+ new xhigh level) - No more downscaling of high-res images - Noticeably more taste in UIs, slides, docs
  • @natolambert Nathan Lambert on x
    Opus 4.7 has a new tokenizer. This means it's also a new base model. Glory days of pretraining still very much going. [image]
  • @yuchenj_uw Yuchen Jin on x
    Claude Opus 4.7 is out! Benchmark scores look pretty strong, but clearly much worse than Mythos. It's a nerfed Mythos, they deliberately reduced cyber capabilities during training. [image]
  • @theamolavasare Amol Avasare on x
    Opus 4.7 is out! Live on our API, Claude Code, Cowork, and Claude chat. Thing I'm noticing internally: people are re-scoping what they hand to the model. Work that got chunked into small pieces for 4.6 because it was too ambiguous or too long is now going in as one task.
  • @zephyr_z9 @zephyr_z9 on x
    Need to distill Mythos harder
  • @cognition @cognition on x
    Claude Opus 4.7 is now part of Devin's agent harness! Anthropic has clearly optimized Claude Opus 4.7 for long-horizon autonomy, unlocking a class of deep investigation work we couldn't reliably run before. Claude Opus 4.7 model costs within Devin will be available at
  • @cursor_ai @cursor_ai on x
    Claude Opus 4.7 is now available in Cursor. We've found it to be impressively autonomous and more creative in its reasoning. We're launching it with 50% off for a limited time. Enjoy!
  • @scaling01 @scaling01 on x
    Opus 4.7 comes with much improved reasoning-efficiency over Opus 4.6 basically everything is now moved up one tier low is as good as medium medium as good as high high as good as max [image]
  • @claudeai Claude on x
    In Claude Code, the new /ultrareview command runs a dedicated review session that reads through your changes and flags what a careful reviewer would catch. We've also extended auto mode to Max users, so longer tasks run with fewer interruptions.
  • @claudeai Claude on x
    Introducing Claude Opus 4.7, our most capable Opus model yet. It handles long-running tasks with more rigor, follows instructions more precisely, and verifies its own outputs before reporting back. You can hand off your hardest work with less supervision. [image]
  • @claudeai Claude on x
    On the API, a new xhigh effort level between high and max gives you finer control over reasoning and latency on hard problems. Task budgets (beta) help Claude prioritize work and manage costs across longer runs.
  • @claudeai Claude on x
    Opus 4.7 also has substantially better vision. It can see images at more than three times the resolution and produces higher-quality interfaces, slides, and docs as a result.
  • @scaling01 @scaling01 on x
    big jump in coding capabilities by Claude 4.7 Opus SWE-Bench Pro 64.3% SWE-Bench Verified 87.6% TerminalBench 69.4% but interestingly, I think they kept CyberGym scores artificially low
  • @eliebakouch Elie on x
    opus 4.7 vs 4.6 on every benchmark from the system card [image]
  • @miles_brundage Miles Brundage on x
    Anthropic? The “we do not wish to advance the rate of AI capabilities progress” company? [image]
  • @mikeyk Mike Krieger on x
    Claude Opus 4.7 is out! Handles ambiguous, multi-step work even better than 4.6. Cursor's internal bench cleared 70%, up from 58% on 4.6. Notion saw a 14% lift on their evals with a third of the tool errors 🔨
  • @noahzweben Noah Zweben on x
    Yet another intelligence leap!
  • @scaling01 @scaling01 on x
    Highlights of the Opus 4.7 Launch: - new tokenizer - new xhigh reasoning effort - big improvements on all benchmarks, except long-context and cyber (they keep cyber capabilities artificially low) - notable are gains in real-world tasks - improved vision (incl larger input
  • @theprimeagen @theprimeagen on x
    It's truly space age technology that we can make something worse and then increment a number and re-release it the public.
  • @victortaelin @victortaelin on x
    Anthropic just launched a model that loses in ALL benchmarks to another model... that is also theirs 😭 I'm thankful yet worried - are we entering a world where the big labs just stop publishing their leading AI's? As a Brazilian I can only ask: for the betas, what remains?
  • @scaling01 @scaling01 on x
    for all the people calling Opus 4.7 a mid update lmao [image]
  • @deryatr_ Derya Unutmaz on x
    Claude Opus 4.7 is released! It improves on most benchmarks compared to 4.6. Graduate-level reasoning is now on par with GPT-5.4 & Gemini 3.1 Pro! It's SOTA for agentic coding and computer use! Haven't checked the token cost, but it's likely not cheap. AI progress is exponential!
  • @deedydas Deedy on x
    Opus 4.7 benchmarks colored by ranking. - Strong coding (SWE-Bench) bump - Strong Computer use bump - Strong visual reasoning (CharXiv) bump - Weak Terminal Bench bump - BrowseComp regression Slots in between 4.6 and Mythos. [Chart generated by 4.7] [image]
  • @_arohan_ Rohan Anil on x
    The pace at which Anthropic is shipping Opus variants is a very new thing in the industry.
  • @laki_0x Laki on x
    the new Claude Opus 4.7 has been released has improved in two areas: → writes code (+10%) → understands images and graphs (+13%) otherwise, it's pretty much the same as before but the next model the Mythos is already shown in the table next to it. It's even more powerful
  • @angaisb_ Angel on x
    So the price is the same but it thinks more and now inputs are split into more tokens It's going to be really expensive to use it isn't it [image]
  • @felixrieseberg Felix Rieseberg on x
    Happy model launch day! Opus 4.7 is now available on all products and a significant step up from Opus 4.6. It's better at coding, computer use, finance, and general knowledge work. 🧵 I'll put the 5 things I find most interesting in thread! [image]
  • @tekbog @tekbog on x
    saying hi to claude and immediately running out of tokens
  • @scaling01 @scaling01 on x
    Opus 4.7 much less likely to sudo rm -rf (taking destructive actions in production envs) [image]
  • @mattshumer_ Matt Shumer on x
    As you use Claude Opus 4.7, keep in mind that however good it is, Mythos is still substantially better. The frontier is getting insanely powerful.
  • @rlancemartin Lance Martin on x
    ive had a lot of fun using Opus 4.7 over the past weeks. one impt tip: there's a new effort level (xhigh) that is recommended for most agentic / coding use-cases. https://x.com/... [image]
  • @hooeem @hooeem on x
    this is why opus 4.6 was being dogshit
  • @coderabbitai @coderabbitai on x
    We ran Anthropic Claude Opus 4.7 against our hardest benchmark - complex concurrency bugs that require multi-step reasoning. Almost 20% better than previous generations. Here's what we did to get it production-ready and what you can take from it for your own stack 👇 [video]
  • @scaling01 @scaling01 on x
    Confirmed: Anthropic keeping Cyber capabilities of Opus 4.7 artificially low “during training we experimented with efforts to differentially reduce these capabilities” [image]
  • @testingcatalog @testingcatalog on x
    Anthropic released Claude Opus 4.7 👀 Opus 4.7 is a notable improvement over Opus 4.6 in software engineering and vision tasks. > Opus 4.7 handles complex, long-running tasks with rigor and consistency, pays precise attention to instructions, and devises ways to verify its own [im…
  • @hackingdave Dave Kennedy on x
    Lets hope this model goes back to how amazing Opus 4.6 was a month ago.
  • @notionhq @notionhq on x
    Opus 4.7, Anthropic's most intelligent model, is now in Notion! It's a real step up from Opus 4.6 for multi-step workflows. It uses fewer tokens. About 3× fewer tool errors. And it can troubleshoot weird mid-workflow issues like a real teammate. [image]
  • @thezvi Zvi Mowshowitz on x
    No rest for the wicked, and no rest for anyone else, I suppose.
  • @himanshustwts Himanshu on x
    benchmarks aside, this is the real BIG change in Opus 4.7 and Opus 4.6. [image]
  • @saurav_tweets Saurav on x
    polymarket is right again [image]
  • @hesamation @hesamation on x
    Hey, we got 2 months before it's down to Sonnet 3.5 intelligence. [image]
  • @kimmonismus @kimmonismus on x
    Claude Opus 4.7 is out. the TL;DR Anthropic released Opus 4.7 today. Same pricing as 4.6 ($5/$25 per million tokens), available across API, Bedrock, Vertex AI, and Microsoft Foundry. What changed vs Opus 4.6: Coding (obviously). Biggest gains on the hardest, long-horizon [image]
  • @natolambert Nathan Lambert on x
    The current pace of token-efficient reasoning improvements across minor Claude Opus/GPT model versions is pretty wild. All signs point to this continuing. 4.6 to 4.7 could've been presented as a fairly large model bump in the past with this plot. [image]
  • @scaling01 @scaling01 on x
    Opus 4.7 with a 371.75x speedup over baseline improves over Opus 4.6 in all benchmarks [image]
  • @thekitze @thekitze on x
    unnerfed 4.6*
  • @exm7777 Machina on x
    they did all of this just to flex Mythos' numbers lmao
  • @iterintellectus Vittorio on x
    IT'S HAPPENING [image]
  • @scaling01 @scaling01 on x
    Opus 4.7 System Card https://cdn.sanity.io/... [image]
  • @developedbyed @developedbyed on x
    Welcome back Claude Opus 4.6 [image]
  • @alexpalcuie @alexpalcuie on x
    asked claude opus 4.7 about reliability and it blamed the kv cache before serving a single public token [image]
  • @kimmonismus @kimmonismus on x
    Opus 4.7 Benchmarks out! Very solid upgrade to Opus 4.6! Compared to Opus 4.6: -SWE Bench Pro +11% -SWE Bench Verified +7% -Terminal Bench 2.0 +4% The benchmarks are significantly lower than for Mythos, but that was to be expected. h/t for finding @synthwavedd [image]
  • Rahul Patil Rahul Patil on linkedin
    Claude Opus 4.7 is out today.  There are some significant behavioural changes.  It reasons more, reaches for tools less, follows instructions more literally …
  • Marquist Allen Marquist Allen on linkedin
    We just launched Claude Opus 4.7, our most capable generally available model, scoring 64.3% on SWE-bench Pro and 69.4% on Terminal-Bench 2.0. …
  • @timkellogg.me Tim Kellogg on bluesky
    New Opus 4.7  —  Across the board, it closes the gap between Opus 4.6 & Mythos by about 50%, in some case almost same as Mythos  —  www.anthropic.com/news/claude- ...  [image]
  • r/technology r on reddit
    Claude Opus 4.7 released: Notable improvement in advanced software engineering, with particular gains on the most difficult tasks
  • r/accelerate r on reddit
    Introducing Claude Opus 4.7
  • r/GithubCopilot r on reddit
    New Opus 4.7 released
  • r/Anthropic r on reddit
    Claude Opus 4.7 released
  • r/ClaudeAI r on reddit
    Opus 4.7 Released!
  • @arankomatsuzaki Aran Komatsuzaki on x
    Nearly 1/3 of surveyed people in Anthropic now think entry-level engineers and researchers are likely replaced by Mythos within 3 months [image]
  • @intern @intern on x
    thanks for giving us Opus 4.7 with Mythos mogging it ur actually goated for that. it's probably the coolest thing ive seen in my life, you released an AI model to AI users but you put Mythos there so we know it's the mid version. heroic. i love worse models thats fire bro thanks …
  • @_nathancalvin Nathan Calvin on x
    This part of the 4.7 Opus system card is pretty neat and seems potentially worth emulating (Anthropic showed Mythos the private discussions/evidence underlying the system card and asked Mythos if the Opus system card accurately characterized that private evidence) [image]
  • @shaughnessy119 Tommy on x
    Opus 4.7 publicly marks the divide between what's available to you (4.7) vs what's available to them (Mythos) Top private AI models were always closed, but now the top tier is both closed and unavailable
  • @hosseeb Haseeb on x
    Interesting that they are now showing these benchmarks side-by-side with Mythos, to reinforce that you do not have access to the most intelligent model. I always wondered when we'd get here. But we have now for the first time entered the undemocratic era of AI. You are not
  • @daniel_mac8 Dan McAteer on x
    Claude Opus 4.7 is here. Significant benchmark jumps over Opus 4.6. OpenAI's turn now. ‘Spud 🥔’ is baked. Let's eat. I bet Spud 🥔 benchmarks are closer to Mythos Preview than Opus 4.7. [image]
  • @scaling01 @scaling01 on x
    Opus 4.7 (orange) with higher verbalized evaluation awareness than Claude Mythos (green) Opus 4.6 (blue) Sonnet 4.6 (yellow) [image]
  • @scaling01 @scaling01 on x
    Opus 4.7 as robust to prompt injections as Claude Mythos [image]
  • @scaling01 @scaling01 on x
    Claude Opus 4.7 exactly on the Anthropic ECI trend Claude Mythos notably above the trend! [image]
  • @kr0der Anthony Kroeger on x
    Opus 4.7 has arrived 👀 looks like a small improvement over Opus 4.6 in most of the benchmarks, but this image feels like it's advertising Mythos not Opus 4.7 the new /ultrareview in Claude Code looks interesting though, definitely trying that out
  • @andrewcurran_ Andrew Curran on x
    Many sightings of Opus 4.7 in the wild, it's looking like today is the day. If it feels like this is too soon, that's because the last Opus release was only 73 days ago. Let's call it Mythos-induced acceleration. Today was also the rumored release day for Spud, aka GPT-5.5. Not
  • @bogdanionutcir2 Bogdan Ionut Cirstea on x
    seems probably good for safety, especially if most capabilities gains came from pretraining
  • @topmass Matthew on x
    opus 4.7 tokenizer is new and uses more tokens for the same inputs... AND the new default reasoning effort inside of claude code will be high - get ready to tear through your limits! [image]
  • @natolambert Nathan Lambert on x
    There's good discussion around this one ways that it could just be adaptation at midtraining, but base model is the simplest explanation so that's my bet.
  • @myainotez @myainotez on x
    New Opus is out, they mention a new tokenizer too. Maybe we will have breadcrumbs of mythos in this one
  • @bspk_ @bspk_ on x
    New base model!
  • @schiste Christophe Henner on x
    Oh gosh, they removed 4.6 altogether from selectors. So I have to say, this does looks a lot like a downsell disguised in an upsell. A few upgrades, but a new tokenizer eating tokens much faster. Well, we knew the time to stop brut forcing things with Opus had to end. [image]
  • @maximelabonne Maxime Labonne on x
    My bet is that Mythos uses a new tokenizer, and they switched Opus over to it (through midtraining) for distillation
  • @andrew_n_carr Andrew Carr on x
    4.7 has a new tokenizer (in-part) because of the 3x vision scaling improvements
  • @realsigridjin Sigrid Jin on x
    opus 4.7 has a new tokenizer which means a new base model underneath, not just a post-training refresh [image]
  • @eliebakouch Elie on x
    my take: opus 4.7 is a distilled version of mythos
  • @kunchamsathwik @kunchamsathwik on x
    Claude Opus 4.7 launched Thing I noticed: 1. They changed the tokenizer which may map to 35% more tokens. 2. Model by default thinks more. Overall, higher token use and faster rate limit hits. [image]
  • @realsigridjin Sigrid Jin on x
    tldr; @ClaudeDevs opus 4.7 just shipped as expected > the tokenizer changed. same input maps to 1.0 to 1.35x more tokens depending on content type > output tokens also go up at higher effort, the model thinks longer on later turns in agentic loops > new effort level called
  • r/ClaudeAI r on reddit
    Introducing Claude Opus 4.7
  • @pekka Pekka Lund on bluesky
    Anthropic has released Claude Opus 4.7 with generally significant improvements over Opus 4.6.  But they say they have purposefully reduced its cyber capabilities during training.  —  www.anthropic.com/news/claude- ...  [image]
  • @wittywebhandle Blaise Ulysse Bernard Collins on bluesky
    This is incredibly funny.  —  The update is one of their vaguest yet and theyre basically doing it just because Mythos is never going to be a broad release & they've hit their ceiling w/ Opus 4.6
  • Jared Snyder Jared Snyder on linkedin
    Anthropic's Opus 4.7 went GA today marking notable improvement in the model's offensive security capabilities. …