Anthropic says it found Opus 4.6 “brings more focus to the most challenging parts of a task without being told to” and “thinks more deeply and more carefully”

We're upgrading our smartest model. — The new Claude Opus 4.6 improves on its predecessor's coding skills.

Anthropic 2026-02-05

Discussion

@claudeai Claude on x
Claude Opus 4.6 is available today on https://claude.ai/, the Claude Developer Platform, and all major cloud platforms. And within Cowork, Opus 4.6 can put all these skills to work autonomously on your behalf. Read more: https://www.anthropic.com/...
r/ClaudeAI r on reddit
4.6 released 6min ago!
r/singularity r on reddit
Anthropic releases Claude Opus 4.6 model, same pricing as 4.5
@steveklabnik.com Steve Klabnik on bluesky
Very interested to see how this goes — code.claude.com/docs/en/agen...
r/singularity r on reddit
Claude Opus 4.6
@cameron.stream Cameron on bluesky
Opus 4.6 is out. — Same price — Focus on subagents — 1m token context (beta), 128k output tokens — Improved agentic search — Adaptive thinking — www.anthropic.com/news/claude...
r/google_antigravity r on reddit
Introducing Claude Opus 4.6
@claudeai Claude on x
Introducing Claude Opus 4.6. Our smartest model got an upgrade. Opus 4.6 plans more carefully, sustains agentic tasks for longer, operates reliably in massive codebases, and catches its own mistakes. It's also our first Opus-class model with 1M token context in beta. [video]
@sonofalli Alli on x
anthropic vs openai is like kendrick vs drake but for nerds
@kimmonismus @kimmonismus on x
Holy sh*t! That jump is insane [image]
@claudeai Claude on x
Claude in PowerPoint is now available in research preview for Max, Team, and Enterprise. Claude reads your layouts, fonts, and slide masters to stay on-brand — whether you're building from a template or generating a full deck from a description. [image]
@nathanblawrence Nathan Lawrence on x
To get Claude Opus 4.6 on https://claude.ai/ working in Xcode 26.3 right now, with no extra waiting: 1. Set your Claude Agent model to “Default” in the settings. 2. Put this in ‘~/Library/Developer/Xcode/CodingAssist ant/ ClaudeAgentConfig/settings.json’: { “model”: [image]
@claudeai Claude on x
New on the API: we're giving developers better control over model effort and more flexibility for long-running agents. Adaptive thinking lets Claude calibrate its reasoning depth to each task, and context compaction keeps long-running tasks from hitting limits.
@damianplayer Damian Player on x
holy shit... AGI is here. [video]
@hakamite @hakamite on x
Opus 4.6 just dropped for me. Anyone else got this? [image]
@scaling01 @scaling01 on x
Claude 4.6 Opus outscores GPT-5.2 Pro on BrowseComp [image]
@scaling01 @scaling01 on x
“Opus 4.6 to be significantly stronger than prior models at subtly completing suspicious side tasks in the course of normal workflows without attracting attention, when explicitly prompted to do this.” “We did not see evidence of sandbagging or strategic attempts to tamper [image…
@claudeai Claude on x
On Claude Code, we're introducing agent teams. Spin up multiple agents that coordinate autonomously and work in parallel—best for tasks that can be split up and tackled independently. Agent teams are in research preview: https://code.claude.com/...
@scaling01 @scaling01 on x
Claude 4.6 Opus achieves a 427× speedup on kernel optimization over the baseline using a novel scaffold far exceeding the 300x threshold for 40 human-expert-hours of work [image]
@brandenflasch Branden Flasch on x
1M token context?? I can't wait to use this
@bloombergtv @bloombergtv on x
Anthropic is releasing a new version of its most powerful AI model that's designed to carry out financial research, days after the company's push into legal services upended the stocks of legacy software makers. @shiringhaffary reports https://www.bloomberg.com/... [video]
@verge @verge on x
Anthropic debuts new model with hopes to corner the market beyond coding https://www.theverge.com/...
@levie Aaron Levie on x
Claude Opus 4.6 is now out! At Box, we've been testing the new model with Box AI on complex areas of knowledge work across many key industries like Financial Services, Life Sciences, and Legal. Overall, Opus 4.6 represents a 10% jump over Opus 4.5 on our hardest knowledge work [i…
@saffronhuang Saffron Huang on x
New model just dropped. Opus 4.6 found 500+ previously-unknown zero days in open source code, out of the box.
@bindureddy Bindu Reddy on x
Opus 4.6 WILL TOP ALL THE LEADERBOARDS We will publish it on LiveBench and ChatLLM shortly That said, the early version we had access to was already ON TOP! Claude is on FIRE 🔥🔥
@nichochar Nicholas Charriere on x
Reliability: Of all apps generated in one-shot scenarios during testing, both Opus 4.5 and Sonnet 4.5 produced at least one broken app. Opus 4.6 produced none. It self-verifies its work, ending every sequence with verification steps.
@scaling01 @scaling01 on x
Opus 4.6 slightly more prone to prompt injections than Opus 4.5 [image]
@scaling01 @scaling01 on x
Opus 4.6 is now the best long-context model it's absolutely destroying Gemini 3 Pro and GPT-5.2 [image]
@kimmonismus @kimmonismus on x
1m context, longer task duration and overall significant improvements this is what I was waiting for! [image]
@llmstats @llmstats on x
Anthropic developed Opus 4.6 with computational or “in silico” biology capabilities in mind. As seen in their own graphs, Opus 4.6 performs almost twice as well in tasks related to the analysis of chemical compounds, protein structures, and even phylogenetic tests. The benchmark …
@nichochar Nicholas Charriere on x
We've had the pleasure of working with Opus 4.6 before the release to give Anthropic some feedback. Overall, this is the best model we have seen to date. Key change in behavior: it is becoming more autonomous. More details on our evaluation below 👇
@teknium @teknium on x
They released info on Opus 4.6 already on their models page [image]
@claudeai Claude on x
Opus 4.6 is state-of-the-art on several evaluations including agentic coding, multi-discipline reasoning, knowledge work, and agentic search. We're also shipping new features across Claude in Excel, Claude in PowerPoint, Claude Code, and our API to let Opus 4.6 do even more. [ima…
@cortinico Nicola Corti on x
> It's also our first Opus-class model with 1M token context in beta. 👆 huge
@yuchenj_uw Yuchen Jin on x
“Claude Opus 4.6 ... managing a ~50-person organization across 6 repositories. It handled both product and organizational decisions while synthesizing context across multiple domains, and it knew when to escalate to a human.” Sam wasn't joking about being replaced by an AI CEO. […
@emollick Ethan Mollick on x
Had early access to Opus 4.6, here is the results from same prompt to create the control panel for a spaceship in the distant future, done using the standard Claude interface (not Code). I had to zoom into a few of the subsystems so you can see the details. [video]
@deredleritt3r Prinz on x
Anthropic asked 16 of its researchers regarding the uplift they get from working with Opus 4.6. - Reported uplift ranged from 30% to 700%(!). - Mean uplift was 152%; median uplift was 100%. **Note that this survey was different from similar surveys performed for Sonnet 4.5 [image…
@scaling01 @scaling01 on x
Claude 4.6 Opus provides an estimated productivity uplift of 30% to 700%, with a mean of 152% and median of 100% [image]
@scaling01 @scaling01 on x
Claude Opus 4.6 achieved a 34× speedup on optimizing a CPU-only LLM model training, which is well above the 4× speedup considered to represent 4-8 human-effort hours. [image]
@krishnanrohit Rohit on x
Opus 4.6 is here! Looks good, on the same capabilities curve, and I remain shocked that openai didn't win with Excel and PowerPoint, considering the Microsoft partnership, before Anthropic beat them to it! [image]
@benkomalo Ben Komalo on x
If you like Opus 4.5, then hopefully you'll like this - Opus 4.6 should be better in almost every way, and supports 1M context. Let us know what you think!
@bcherny Boris Cherny on x
I've been using Opus 4.6 for a bit — it is our best model yet. It is more agentic, more intelligent, runs for longer, and is more careful and exhaustive. For Claude Code users, you can also now more precisely tune how much the model thinks. Run /model and arrow left/right to
@teknium @teknium on x
It's even better than a sonnet release its Opus 4.6! [image]
@ericbuess Eric Buess on x
Opus 4.6! - 1 million token context window in beta! - Agent Teams on Claude Code! Spin up multiple agents that coordinate autonomously and work in parallel—best for tasks that can be split up and tackled independently. https://www.anthropic.com/... [video]
@claudeai Claude on x
Claude in Excel now handles long-running and harder tasks with improved performance. It can plan before acting, support richer functionalities like conditional formatting and data validation, and handle multi-step changes in one pass. Read more: https://claude.com/... [video]
@_catwu Cat on x
We launched Opus 4.6 today + some new features to help you do more with Claude Code.
@synthwavedd Leo on x
holy shit lmao [image]
@alexalbert__ Alex Albert on x
Opus 4.6 is here. The jump in autonomy is real. The biggest shift for me personally has been learning to let it run. Give it the context, step away, and come back to something pretty amazing. The way we work alongside models is starting to completely change.
@scaling01 @scaling01 on x
Claude 4.6 Opus achieves a 427× speedup on kernel optimization over the baseline using a novel scaffold far exceeding the 300x threshold for 40 human-expert-hours of work [image]
@kyleturman Kyle Turman on x
Opus 4.6 is a bananas upgrade. The 0.1 increment undersells it. As much as Opus 4.5 changed the way I develop, my workflow now requires much less direction and detailed feedback. Claude just gets it. Very excited for you all to give it a whirl!
@scaling01 @scaling01 on x
Opus 4.6 seems to have hallucinate more [image]
@sammcallister Sam Mcallister on x
I'm Sam and I have an incredible team of agents. Claude Opus 4.6 would not exist without them and they cooked.
@camsoft2000 @camsoft2000 on x
1M token context window means there has never been a better time to install XcodeBuildMCP!
@zephyr_z9 @zephyr_z9 on x
Probably the most important release today Kimi released their agent swarm and PARL a few days ago 2026 is the year of agent swarms Other labs will roll it out quickly
@lydiahallie Lydia Hallie on x
Claude Code now supports agent teams (in research preview) Instead of a single agent working through a task sequentially, a lead agent can delegate to multiple teammates that work in parallel to research, debug, and build while coordinating with each other. Try it out today by [v…
@timkellogg.me Tim Kellogg on bluesky
Opus 4.6 is here! — biggest wins on agentic search, HLE & ARC AGI 2 — claude.com/blog/opus-4-... [image]
@harvey @harvey on x
Now live in Harvey: Claude Opus 4.6. It achieved a 90.2% on our BigLaw Bench, the highest score yet for the Claude family, with 40% perfect results. [image]
@krishnanrohit Rohit on x
Very interesting blog and an insane result, and yet again shows the problems of multi-agent coordination. It remains the case that we have to handcraft roles, artisanal, and painfully figure out how they should work. We need a better way! cc @alexolegimas @AndreyFradkin [image]
@steveklabnik.com Steve Klabnik on bluesky
“Over nearly 2,000 Claude Code sessions and $20,000 in API costs, the agent team produced a 100,000-line [C compiler written in #rustlang] that can build Linux 6.9 on x86, ARM, and RISC-V.” — www.anthropic.com/engineering/ ...
r/programare r on reddit
Building a C compiler with a team of parallel Claudes
r/singularity r on reddit
We tasked Opus 4.6 using agent teams to build a C compiler. Then we (mostly) walked away. Two weeks later, it worked on the Linux kernel.
r/ClaudeCode r on reddit
We tasked Opus 4.6 using agent teams to build a C compiler. Then we (mostly) walked away. Two weeks later, it worked on the Linux kernel.

Chronicles

Anthropic says it found Opus 4.6 “brings more focus to the most challenging parts of a task without being told to” and “thinks more deeply and more carefully”

Related Coverage

Discussion