/
Navigation
Chronicles
Browse all articles
Explore
Semantic exploration
Research
Entity momentum
Nexus
Correlations & relationships
Story Arc
Topic evolution
Drift Map
Semantic trajectory animation
Posts
Analysis & commentary
Pulse API
Tech news intelligence API
Browse
Entities
Companies, people, products, technologies
Domains
Browse by publication source
Handles
Browse by social media handle
Detection
Concept Search
Semantic similarity search
High Impact Stories
Top coverage by position
Sentiment Analysis
Positive/negative coverage
Anomaly Detection
Unusual coverage patterns
Analysis
Rivalry Report
Compare two entities head-to-head
Semantic Pivots
Narrative discontinuities
Crisis Response
Event recovery patterns
Connected
Search: /
Command: ⌘K
Embeddings: large
TEXXR

Chronicles

The story behind the story

days · browse · Enter similar · o open

Anthropic releases Claude Opus 4.7, saying it is a “notable improvement” on Opus 4.6 in advanced software engineering and comes with a new “xhigh” effort level

Our latest model, Claude Opus 4.7, is now generally available.  —  Opus 4.7 is a notable improvement …

Anthropic

Discussion

  • @claudeai Claude on x
    Introducing Claude Opus 4.7, our most capable Opus model yet. It handles long-running tasks with more rigor, follows instructions more precisely, and verifies its own outputs before reporting back. You can hand off your hardest work with less supervision. [image]
  • @miles_brundage Miles Brundage on x
    Anthropic? The “we do not wish to advance the rate of AI capabilities progress” company? [image]
  • @theprimeagen @theprimeagen on x
    It's truly space age technology that we can make something worse and then increment a number and re-release it the public.
  • @tekbog @tekbog on x
    saying hi to claude and immediately running out of tokens
  • @deryatr_ Derya Unutmaz on x
    Claude Opus 4.7 is released! It improves on most benchmarks compared to 4.6. Graduate-level reasoning is now on par with GPT-5.4 & Gemini 3.1 Pro! It's SOTA for agentic coding and computer use! Haven't checked the token cost, but it's likely not cheap. AI progress is exponential!
  • @deedydas Deedy on x
    Opus 4.7 benchmarks colored by ranking. - Strong coding (SWE-Bench) bump - Strong Computer use bump - Strong visual reasoning (CharXiv) bump - Weak Terminal Bench bump - BrowseComp regression Slots in between 4.6 and Mythos. [Chart generated by 4.7] [image]
  • @emollick Ethan Mollick on x
    It basically rarely seems to think on analysis, writing, or research tasks, which means it isn't using tools or web search. Haven't tested everything yet, so not definitive, but I am often getting lower quality answers for that sort of use case that Opus 4.6 Extended Thinking. [i…
  • @eliebakouch Elie on x
    opus 4.7 vs 4.6 on every benchmark from the system card [image]
  • @ashen_one @ashen_one on x
    image recognition on opus 4.7 is really really good i couldn't even read this screenshot myself and opus 4.7 read the entire thing completely [video]
  • @scaling01 @scaling01 on x
    Highlights of the Opus 4.7 Launch: - new tokenizer - new xhigh reasoning effort - big improvements on all benchmarks, except long-context and cyber (they keep cyber capabilities artificially low) - notable are gains in real-world tasks - improved vision (incl larger input
  • @scaling01 @scaling01 on x
    for all the people calling Opus 4.7 a mid update lmao [image]
  • @emollick Ethan Mollick on x
    With max thinking Opus 4.7 is quite impressive, with a real sense of style In two prompts: “implement the Tower of Babel, in 3D, in as sophisticated and visually interesting a way as possible. It should be interactive” and then “make it better.” Play: https://tower-of-babel-17763…
  • @victortaelin @victortaelin on x
    Anthropic just launched a model that loses in ALL benchmarks to another model... that is also theirs 😭 I'm thankful yet worried - are we entering a world where the big labs just stop publishing their leading AI's? As a Brazilian I can only ask: for the betas, what remains?
  • @_arohan_ Rohan Anil on x
    The pace at which Anthropic is shipping Opus variants is a very new thing in the industry.
  • @emollick Ethan Mollick on x
    It is not well-explained, but with the adaptive switch off, I get no thinking. I can set thinking levels in Claude Code, but not in Claude Cowork. AI companies keep seeming to assume that coding/technical work is the only kind of important intellectual work out there (it is not)
  • @mikeyk Mike Krieger on x
    Claude Opus 4.7 is out! Handles ambiguous, multi-step work even better than 4.6. Cursor's internal bench cleared 70%, up from 58% on 4.6. Notion saw a 14% lift on their evals with a third of the tool errors 🔨
  • @mparakhin Mikhail Parakhin on x
    A definite +1 to Ethan. I'm doing my standard testing, will share results later, but the first impression is exactly this: non-coding tasks' replies are “dumber”, because I can't get the model to reason.
  • @noahzweben Noah Zweben on x
    Yet another intelligence leap!
  • @emollick Ethan Mollick on x
    I think the adaptive thinking requirement in Claude Opus 4.7 is bad in the ways that all AI effort routers are bad, but magnified by the fact that there is no manual override like in ChatGPT. It regularly decides that non-math/code stuff is “low effort” & produces worse results. …
  • @laki_0x Laki on x
    the new Claude Opus 4.7 has been released has improved in two areas: → writes code (+10%) → understands images and graphs (+13%) otherwise, it's pretty much the same as before but the next model the Mythos is already shown in the table next to it. It's even more powerful
  • @angaisb_ Angel on x
    So the price is the same but it thinks more and now inputs are split into more tokens It's going to be really expensive to use it isn't it [image]
  • @felixrieseberg Felix Rieseberg on x
    Happy model launch day! Opus 4.7 is now available on all products and a significant step up from Opus 4.6. It's better at coding, computer use, finance, and general knowledge work. 🧵 I'll put the 5 things I find most interesting in thread! [image]
  • @scaling01 @scaling01 on x
    Opus 4.7 much less likely to sudo rm -rf (taking destructive actions in production envs) [image]
  • @mattshumer_ Matt Shumer on x
    As you use Claude Opus 4.7, keep in mind that however good it is, Mythos is still substantially better. The frontier is getting insanely powerful.
  • @alexalbert__ Alex Albert on x
    Some of my favorite things in Opus 4.7: - Very good at async work and following instructions - Effort levels are far more predictable for token control (+ new xhigh level) - No more downscaling of high-res images - Noticeably more taste in UIs, slides, docs
  • @natolambert Nathan Lambert on x
    Opus 4.7 has a new tokenizer. This means it's also a new base model. Glory days of pretraining still very much going. [image]
  • @rlancemartin Lance Martin on x
    ive had a lot of fun using Opus 4.7 over the past weeks. one impt tip: there's a new effort level (xhigh) that is recommended for most agentic / coding use-cases. https://x.com/... [image]
  • @hooeem @hooeem on x
    this is why opus 4.6 was being dogshit
  • @coderabbitai @coderabbitai on x
    We ran Anthropic Claude Opus 4.7 against our hardest benchmark - complex concurrency bugs that require multi-step reasoning. Almost 20% better than previous generations. Here's what we did to get it production-ready and what you can take from it for your own stack 👇 [video]
  • @scaling01 @scaling01 on x
    Confirmed: Anthropic keeping Cyber capabilities of Opus 4.7 artificially low “during training we experimented with efforts to differentially reduce these capabilities” [image]
  • @testingcatalog @testingcatalog on x
    Anthropic released Claude Opus 4.7 👀 Opus 4.7 is a notable improvement over Opus 4.6 in software engineering and vision tasks. > Opus 4.7 handles complex, long-running tasks with rigor and consistency, pays precise attention to instructions, and devises ways to verify its own [im…
  • @hackingdave Dave Kennedy on x
    Lets hope this model goes back to how amazing Opus 4.6 was a month ago.
  • @notionhq @notionhq on x
    Opus 4.7, Anthropic's most intelligent model, is now in Notion! It's a real step up from Opus 4.6 for multi-step workflows. It uses fewer tokens. About 3× fewer tool errors. And it can troubleshoot weird mid-workflow issues like a real teammate. [image]
  • @thezvi Zvi Mowshowitz on x
    No rest for the wicked, and no rest for anyone else, I suppose.
  • @himanshustwts Himanshu on x
    benchmarks aside, this is the real BIG change in Opus 4.7 and Opus 4.6. [image]
  • @yuchenj_uw Yuchen Jin on x
    Claude Opus 4.7 is out! Benchmark scores look pretty strong, but clearly much worse than Mythos. It's a nerfed Mythos, they deliberately reduced cyber capabilities during training. [image]
  • @saurav_tweets Saurav on x
    polymarket is right again [image]
  • @hesamation @hesamation on x
    Hey, we got 2 months before it's down to Sonnet 3.5 intelligence. [image]
  • @kimmonismus @kimmonismus on x
    Claude Opus 4.7 is out. the TL;DR Anthropic released Opus 4.7 today. Same pricing as 4.6 ($5/$25 per million tokens), available across API, Bedrock, Vertex AI, and Microsoft Foundry. What changed vs Opus 4.6: Coding (obviously). Biggest gains on the hardest, long-horizon [image]
  • @natolambert Nathan Lambert on x
    The current pace of token-efficient reasoning improvements across minor Claude Opus/GPT model versions is pretty wild. All signs point to this continuing. 4.6 to 4.7 could've been presented as a fairly large model bump in the past with this plot. [image]
  • @scaling01 @scaling01 on x
    Opus 4.7 with a 371.75x speedup over baseline improves over Opus 4.6 in all benchmarks [image]
  • @theamolavasare Amol Avasare on x
    Opus 4.7 is out! Live on our API, Claude Code, Cowork, and Claude chat. Thing I'm noticing internally: people are re-scoping what they hand to the model. Work that got chunked into small pieces for 4.6 because it was too ambiguous or too long is now going in as one task.
  • @thekitze @thekitze on x
    unnerfed 4.6*
  • @zephyr_z9 @zephyr_z9 on x
    Need to distill Mythos harder
  • @exm7777 Machina on x
    they did all of this just to flex Mythos' numbers lmao
  • @cognition @cognition on x
    Claude Opus 4.7 is now part of Devin's agent harness! Anthropic has clearly optimized Claude Opus 4.7 for long-horizon autonomy, unlocking a class of deep investigation work we couldn't reliably run before. Claude Opus 4.7 model costs within Devin will be available at
  • @iterintellectus Vittorio on x
    IT'S HAPPENING [image]
  • @scaling01 @scaling01 on x
    Opus 4.7 System Card https://cdn.sanity.io/... [image]
  • @cursor_ai @cursor_ai on x
    Claude Opus 4.7 is now available in Cursor. We've found it to be impressively autonomous and more creative in its reasoning. We're launching it with 50% off for a limited time. Enjoy!
  • @developedbyed @developedbyed on x
    Welcome back Claude Opus 4.6 [image]
  • @scaling01 @scaling01 on x
    Opus 4.7 comes with much improved reasoning-efficiency over Opus 4.6 basically everything is now moved up one tier low is as good as medium medium as good as high high as good as max [image]
  • @alexpalcuie @alexpalcuie on x
    asked claude opus 4.7 about reliability and it blamed the kv cache before serving a single public token [image]
  • @claudeai Claude on x
    In Claude Code, the new /ultrareview command runs a dedicated review session that reads through your changes and flags what a careful reviewer would catch. We've also extended auto mode to Max users, so longer tasks run with fewer interruptions.
  • @claudeai Claude on x
    On the API, a new xhigh effort level between high and max gives you finer control over reasoning and latency on hard problems. Task budgets (beta) help Claude prioritize work and manage costs across longer runs.
  • @claudeai Claude on x
    Opus 4.7 also has substantially better vision. It can see images at more than three times the resolution and produces higher-quality interfaces, slides, and docs as a result.
  • @kimmonismus @kimmonismus on x
    Opus 4.7 Benchmarks out! Very solid upgrade to Opus 4.6! Compared to Opus 4.6: -SWE Bench Pro +11% -SWE Bench Verified +7% -Terminal Bench 2.0 +4% The benchmarks are significantly lower than for Mythos, but that was to be expected. h/t for finding @synthwavedd [image]
  • @scaling01 @scaling01 on x
    big jump in coding capabilities by Claude 4.7 Opus SWE-Bench Pro 64.3% SWE-Bench Verified 87.6% TerminalBench 69.4% but interestingly, I think they kept CyberGym scores artificially low
  • Jared Snyder Jared Snyder on linkedin
    Anthropic's Opus 4.7 went GA today marking notable improvement in the model's offensive security capabilities. …
  • Rahul Patil Rahul Patil on linkedin
    Claude Opus 4.7 is out today.  There are some significant behavioural changes.  It reasons more, reaches for tools less, follows instructions more literally …
  • Marquist Allen Marquist Allen on linkedin
    We just launched Claude Opus 4.7, our most capable generally available model, scoring 64.3% on SWE-bench Pro and 69.4% on Terminal-Bench 2.0. …
  • @timkellogg.me Tim Kellogg on bluesky
    New Opus 4.7  —  Across the board, it closes the gap between Opus 4.6 & Mythos by about 50%, in some case almost same as Mythos  —  www.anthropic.com/news/claude- ...  [image]
  • @pekka Pekka Lund on bluesky
    Anthropic has released Claude Opus 4.7 with generally significant improvements over Opus 4.6.  But they say they have purposefully reduced its cyber capabilities during training.  —  www.anthropic.com/news/claude- ...  [image]
  • r/technology r on reddit
    Claude Opus 4.7 released: Notable improvement in advanced software engineering, with particular gains on the most difficult tasks
  • r/accelerate r on reddit
    Introducing Claude Opus 4.7
  • r/GithubCopilot r on reddit
    New Opus 4.7 released
  • r/Anthropic r on reddit
    Claude Opus 4.7 released
  • r/ClaudeAI r on reddit
    Opus 4.7 Released!
  • @arankomatsuzaki Aran Komatsuzaki on x
    Nearly 1/3 of surveyed people in Anthropic now think entry-level engineers and researchers are likely replaced by Mythos within 3 months [image]
  • @hosseeb Haseeb on x
    Interesting that they are now showing these benchmarks side-by-side with Mythos, to reinforce that you do not have access to the most intelligent model.  I always wondered when we'd get here.  But we have now for the first time entered the undemocratic era of AI.  You are not imp…
  • @_nathancalvin Nathan Calvin on x
    This part of the 4.7 Opus system card is pretty neat and seems potentially worth emulating (Anthropic showed Mythos the private discussions/evidence underlying the system card and asked Mythos if the Opus system card accurately characterized that private evidence) [image]
  • @shaughnessy119 Tommy on x
    Opus 4.7 publicly marks the divide between what's available to you (4.7) vs what's available to them (Mythos) Top private AI models were always closed, but now the top tier is both closed and unavailable
  • @intern @intern on x
    thanks for giving us Opus 4.7 with Mythos mogging it ur actually goated for that. it's probably the coolest thing ive seen in my life, you released an AI model to AI users but you put Mythos there so we know it's the mid version. heroic. i love worse models thats fire bro thanks …
  • @daniel_mac8 Dan McAteer on x
    Claude Opus 4.7 is here. Significant benchmark jumps over Opus 4.6. OpenAI's turn now. ‘Spud 🥔’ is baked. Let's eat. I bet Spud 🥔 benchmarks are closer to Mythos Preview than Opus 4.7. [image]
  • @scaling01 @scaling01 on x
    Opus 4.7 (orange) with higher verbalized evaluation awareness than Claude Mythos (green) Opus 4.6 (blue) Sonnet 4.6 (yellow) [image]
  • @scaling01 @scaling01 on x
    Opus 4.7 as robust to prompt injections as Claude Mythos [image]
  • @scaling01 @scaling01 on x
    Claude Opus 4.7 exactly on the Anthropic ECI trend Claude Mythos notably above the trend! [image]
  • @kr0der Anthony Kroeger on x
    Opus 4.7 has arrived 👀 looks like a small improvement over Opus 4.6 in most of the benchmarks, but this image feels like it's advertising Mythos not Opus 4.7 the new /ultrareview in Claude Code looks interesting though, definitely trying that out
  • @andrewcurran_ Andrew Curran on x
    Many sightings of Opus 4.7 in the wild, it's looking like today is the day. If it feels like this is too soon, that's because the last Opus release was only 73 days ago. Let's call it Mythos-induced acceleration. Today was also the rumored release day for Spud, aka GPT-5.5. Not
  • @wittywebhandle Blaise Ulysse Bernard Collins on bluesky
    This is incredibly funny.  —  The update is one of their vaguest yet and theyre basically doing it just because Mythos is never going to be a broad release & they've hit their ceiling w/ Opus 4.6
  • @realsigridjin Sigrid Jin on x
    opus 4.7 has a new tokenizer which means a new base model underneath, not just a post-training refresh [image]
  • @maximelabonne Maxime Labonne on x
    My bet is that Mythos uses a new tokenizer, and they switched Opus over to it (through midtraining) for distillation
  • @myainotez @myainotez on x
    New Opus is out, they mention a new tokenizer too. Maybe we will have breadcrumbs of mythos in this one
  • @bspk_ @bspk_ on x
    New base model!
  • @natolambert Nathan Lambert on x
    There's good discussion around this one ways that it could just be adaptation at midtraining, but base model is the simplest explanation so that's my bet.
  • @schiste Christophe Henner on x
    Oh gosh, they removed 4.6 altogether from selectors. So I have to say, this does looks a lot like a downsell disguised in an upsell. A few upgrades, but a new tokenizer eating tokens much faster. Well, we knew the time to stop brut forcing things with Opus had to end. [image]
  • @andrew_n_carr Andrew Carr on x
    4.7 has a new tokenizer (in-part) because of the 3x vision scaling improvements
  • @topmass Matthew on x
    opus 4.7 tokenizer is new and uses more tokens for the same inputs... AND the new default reasoning effort inside of claude code will be high - get ready to tear through your limits! [image]
  • @bogdanionutcir2 Bogdan Ionut Cirstea on x
    seems probably good for safety, especially if most capabilities gains came from pretraining
  • @bcherny Boris Cherny on x
    Opus 4.7 uses more thinking tokens, so we've increased rate limits for all subscribers to make up for it. Enjoy!
  • @eliebakouch Elie on x
    my take: opus 4.7 is a distilled version of mythos
  • @kunchamsathwik @kunchamsathwik on x
    Claude Opus 4.7 launched Thing I noticed: 1. They changed the tokenizer which may map to 35% more tokens. 2. Model by default thinks more. Overall, higher token use and faster rate limit hits. [image]
  • @realsigridjin Sigrid Jin on x
    tldr; @ClaudeDevs opus 4.7 just shipped as expected > the tokenizer changed. same input maps to 1.0 to 1.35x more tokens depending on content type > output tokens also go up at higher effort, the model thinks longer on later turns in agentic loops > new effort level called
  • r/ArtificialInteligence r on reddit
    Claude Mythos: Finance ministers and top bankers raise serious concerns about AI model.