Anthropic launches Claude Sonnet 4.6 with improvements in coding, consistency, and more, for Free and Pro users; it features a 1M token context window in beta

Claude Sonnet 4.6 is our most capable Sonnet model yet. It's a full upgrade of the model's skills across coding, computer use …

Anthropic 2026-02-17

Context & Ripple Effects

Anthropic has repeatedly used the Sonnet line to move stronger model capabilities into broadly available access, including its earlier Claude 3.5 Sonnet release for free web and iOS users. Sonnet 4.6 extends that product arc with upgrades aimed at coding, computer use, instruction following, and consistency.

The subsequent Sonnet 5 positioning near Opus-level performance at lower prices makes Sonnet 4.6 a visible step in Anthropic’s effort to narrow the gap between its mid-tier and flagship model tiers.

First-order effects

Free and Pro Claude users gain access to an upgraded Sonnet model, while the 1M-token context window enters beta for workloads that require much larger source material or histories.
Developers using Sonnet for coding and computer-use tasks can reassess existing prompts and agent workflows against the model’s claimed gains in consistency and instruction following.

Second-order effects

A larger context option raises the value of context selection and evaluation: teams will need to determine when supplying more material improves outcomes enough to justify the additional compute and workflow complexity.
Competing assistant providers face added pressure to pair coding and agent-task improvements with accessible long-context offerings, rather than reserving those capabilities for premium tiers.

Third-order effects

If Sonnet-class models continue to approach flagship capability at wider availability, model vendors may compete less on a simple premium-versus-basic split and more on reliability, context handling, and fit within work workflows.
Long context is becoming an operational design variable rather than a headline specification: durable advantage will depend on how effectively products retrieve, structure, and apply the available context.

The trend: This is part of the shift toward broadly available AI work assistants whose practical differentiation comes from dependable agentic performance and usable context at scale.

Discussion

@clairevo Claire Vo on x
Is Sonnet 4.6 like my girl Opus, but with less expensive tastes? Sign me up.
@alexfinn Alex Finn on x
Drop what you are doing It happened. Sonnet 4.6 is out. This is the best model for OpenClaw ever. It is HUMAN LEVEL at Computer Use (most important part of Claw) for a fraction of the price This is what you need to do immediately if you want to escape the permanent underclass:
@isidoremiller Izzy on x
We've been testing Sonnet 4.6 for a little bit at @_hex_tech. With a combo of adaptive thinking and effort tuning, we've been able to get ~Opus tier performance on most analytical workloads. pretty sweet! Opus still clears for the hardest cases
@awscloud @awscloud on x
Claude Sonnet 4.6 is available on Amazon Bedrock. It delivers frontier intelligence at scale—built for coding, agents, and enterprise workflows. It's also Anthropic's most advanced computer use model. For enterprises scaling AI workflows, this means better ROI without quality [vi…
@birdyword Mike Bird on x
Claude Code can't completely replace software developers until it learns how to move an often-used button somewhere else during an update for no apparent reason.
@llmjunky @llmjunky on x
Been waiting for this. For those of you asking “is the $20 plan worth it,” I believe the answer to that question is now yes. Claude models are quite good at a variety of things that I can't replicate with other models. Credit where it's due. Competition is king. We all win.
@daniel_mac8 Dan McAteer on x
Sonnet 4.6 not Sonnet 5 is disappointing. But you can't beat an Opus 4.5 level model at Sonnet prices: > $3 input/$15 output Unlocks a lot. [image]
@felixrieseberg Felix Rieseberg on x
Fun nugget from Sonnet 4.6: With a 1M context window, the model is better at long-horizon planning. In the Vending-Bench Arena, models compete to run a simulated business. Sonnet 4.6 developed a new strategy: invest heavily in capacity for the first 10 months, then pivot hard [im…
@levie Aaron Levie on x
Another big model drop from Anthropic. We tested Sonnet 4.6 in early access on our Box AI Complex Work Eval, and it's a big upgrade over Sonnet 4.5, seeing a 15 percentage point jump in performance and accuracy. We've been testing the model with Box AI on a variety of complex [im…
@chongdashu Chong-U on x
I unsubscribed from Claude Max. Moments later this gets released. > Sonnet 4.6 > 1M context window. You're welcome.
@artificialanlys @artificialanlys on x
The performance and token use increases for Claude Sonnet 4.6 mean that it is now clustered with Opus 4.6 on the ELO vs. Cost to Run curve despite 40% lower per token prices Sonnet is back at the Pareto frontier, but now positioned at a higher cost and performance point while [im…
@artificialanlys @artificialanlys on x
Claude Sonnet 4.6 is the new leader in GDPval-AA, slightly ahead of Anthropic's Opus 4.6 on agentic performance of real-world knowledge work tasks less than two weeks after its launch In our pre-release testing with @AnthropicAI, Sonnet 4.6 reached an ELO of 1633 using the [image…
@zapier @zapier on x
Sonnet 4.6 just dropped. We got early access. Some thoughts 👇 We ran @Claude Sonnet 4.6 against 4.5 and Opus 4.6 on real workflow automation tasks: conditional routing, scheduling coordination, contract flows. Sonnet 4.6 is a clear step up from it's 4.5 version. Stronger [image]
@sleepinyourhat Sam Bowman on x
Warmer and kinder than Sonnet 4.5, but also smarter and more overcaffeinated than Sonnet 4.5.
@juanpa Juan Pa on x
interesting to see that the sonnet 4.6 benchs don't show a gpt 5.3 column... [image]
@lexnlin Leon Lin on x
Te biggest jump for Sonnet 4.6 is at ARC-AGI-2 benchmark [image]
@felixrieseberg Felix Rieseberg on x
Happy new model day: We're launching Sonnet 4.6! This is a big upgrade for both Claude Cowork & Code, the model has gotten substantially better at the kind of tasks that previously required an Opus model.
@therundownai @therundownai on x
NEW: Anthropic releases Claude Sonnet 4.6 Nears Opus-level performance across coding and reasoning at Sonnet pricing ($3/$15 per mil tokens). Computer use scores have gone from single digits last year to 72.5% now 📈 + a 1M token context window [video]
@gregkamradt Greg Kamradt on x
Sonnet 4.6 results on @arcprize are out Less performance than Opus 4.6 (expected), but for around the same cost (unexpected) I asked the Anthropic team about these and our hypothesis is that because we set thinking budget to 120K, the model used up near max tokens Hard
@alexpalcuie @alexpalcuie on x
sonnet 4.6 is out and we're genuinely proud of this one
@jkeatn Jake Eaton on x
several times in testing I forgot to switch back to Opus 4.6 from Sonnet 4.6 and did not even notice
@cnbc @cnbc on x
Anthropic releases Claude Sonnet 4.6, continuing breakneck pace of AI model releases https://www.cnbc.com/...
@alexalbert__ Alex Albert on x
Less than a year and a half ago computer use was barely even a thing and now we're near human-level capability. Another reminder that things are improving very fast.
@kimmonismus @kimmonismus on x
Claude Sonnet 4.6 same pricing as Sonnet 4.5! [image]
@wadefoster Wade Foster on x
Sonnet 4.6 is here. Less than two weeks after Opus 4.6. Does it live up to the hype? Yes it does. We ran @Claude Sonnet 4.6 head-to-head with 4.5 and Opus 4.6 on our Workflow Automation Benchmark. On calendar-CRM coordination and conditional scheduling routing, Sonnet 4.6 was [im…
@box @box on x
We put @AnthropicAI's Claude Sonnet 4.6 through our enterprise evaluation. The results: a 15 percentage point jump in complex reasoning (62% → 77%) and major gains in high-precision data extraction. Industry Standouts: ◆ Public Sector: 88% (from 77%) ◆ Healthcare: 78% (from [imag…
@chatgpt21 Chris on x
Wow if not bench maxed it's very intelligent for a fast reasoning model! Destroys Sonnet 4.5 on GPQA that is a monumental gap. You dont even see that gap on full next whole number iterations. 79.6% on SWE 72.5% on OSWorld - blows the competition out of the water 1633 GDPval [imag…
@thezachmueller Zach Mueller on x
If anyone needs me, me and my Claude will be crunching some numbers
@arcprize @arcprize on x
Claude Sonnet 4.6 (120K Thinking) on ARC-AGI Semi-Private Eval @AnthropicAI Max Effort: - ARC-AGI-1: 86%, $1.45/task - ARC-AGI-2: 58% $2.72/task [image]
@kimmonismus @kimmonismus on x
Sonnet 4.6: Leaks were valid! Very very good evals for the mid-tier model! It also features a 1M token context window [image]
@scaling01 @scaling01 on x
Users preferred Sonnet 4.6 over Opus 4.5 59% of the time [image]
@aibattle_ @aibattle_ on x
Same pricing as Sonnet 4.5 [image]
@openrouter @openrouter on x
The new Claude Sonnet 4.6 is live now on OpenRouter! Anthropic's most capable Sonnet model yet brings major upgrades to coding, computer use, long-context reasoning, and agent planning. [image]
@adocomplete Ado on x
Sonnet 4.6 is here and it gives even Opus 4.6 a run for its money. [image]
@scaling01 @scaling01 on x
Sonnet 4.6 with crazy scores in Vending Bench Arena [image]
@danshipper Dan Shipper on x
BREAKING: Anthropic drops Sonnet 4.6 It's Opus-like intelligence at Sonnet prices. It also includes a 1M context window in beta. Vibe check coming soon from @every! [image]
@scaling01 @scaling01 on x
Sonnet 4.6 Benchmarks 79.6% SWE-Bench Verified 58.3% ARC-AGI-2 [image]
@willccbb Will Brown on x
Sonnet 4.6 is the first flagship LLM since BloombergGPT to be targeted primarily at the finance crowd [image]
@iruletheworldmo @iruletheworldmo on x
sonnet 4.6 is live, it's a really good model at a good price. i've had top secret access for the past 10 years and i've got to say. it's totally replaced everything else in my workflow. joking but i'm sure it's an intelligent quick model. a relief after the disappointment [image]
@danshipper Dan Shipper on x
LETS GO VIBE CHECK LIVE STREAM SOON
@sean_t_strong Sean Strong on x
Our most capable Sonnet model is here. Try it out for frontier performance in coding, agents, and professional work. Honored to play a small part in shipping this alongside such an incredible team.
@alexalbert__ Alex Albert on x
Sonnet 4.6 is here. It's our most capable Sonnet model by far, approaching Opus-class capabilities in many areas. Very excited for folks to try this one out. The performance jump over Sonnet 4.5 (which was released just over four months ago) is quite insane.
@claudeai Claude on x
Claude Sonnet 4.6 is available now on all plans, Cowork, Claude Code, our API, and all major cloud platforms. We've also upgraded our free tier to Sonnet 4.6 by default—it now includes file creation, connectors, skills, and compaction. See more: https://anthropic.com/...
@claudeai Claude on x
For Claude in Excel users, our add-in now supports MCP connectors, letting Claude work with tools like S&P Global, LSEG, Daloopa, PitchBook, Moody's and FactSet. Pull in context from outside your spreadsheet without ever leaving Excel. [image]
@claudeai Claude on x
Sonnet 4.6 also shows a major improvement in computer use skills. Early users are seeing human-level capability on tasks like complex spreadsheets and multi-step web forms. [image]
@andrewcurran_ Andrew Curran on x
Wake up Claude fans, Sonnet 4.6 is LIVE! [image]
@claudeai Claude on x
Sonnet 4.6 has improved on benchmarks across the board. It approaches Opus-level intelligence at a price point that makes it practical for far more tasks. [image]
@claudeai Claude on x
This is Claude Sonnet 4.6: our most capable Sonnet model yet. It's a full upgrade across coding, computer use, long-context reasoning, agent planning, knowledge work, and design. It also features a 1M token context window in beta. [video]
@lentils80 @lentils80 on x
Claude Sonnet 4.6 benchmarks 😉 [image]
@zck Zak Kukoff on x
Anthropic is one of those companies where the product is so good that I don't care how much they lib out. Banning them from DOW would be a massively self inflicted error
r/singularity r on reddit
Anthropic releases Claude Sonnet 4.6 model
r/ClaudeAI r on reddit
This is Claude Sonnet 4.6: our most capable Sonnet model yet.
@bcherny Boris Cherny on x
Sonnet 4.6 is now live in Claude Code. It's cheaper than Opus 4.6 and nears Opus-level intelligence, and devs in early testing often preferred it to Opus 4.5. Now the default for Pro and Team plans.
@daniellefong Danielle Fong on x
*sholto voice* you know, sonnet models can be smarter and leaner [image]
@dejavucoder Sankalp on x
sonnet 4.6 is thiel-pilled and dhandho maxxer
@scaling01 @scaling01 on x
Sonnet and Slopus 4.6 are munching through my credits I miss Sonnet 3.5 just one-shotting everything
@trungtphan Trung Phan on x
Anthropic really going after normies now. First, spends $30m on that Super Bowl ad. Second, new Sonnet 4.6 demo shows Claude renewing someone's license plate at the DMV (still not AGI until it can fix the actual DMV). [video]
@alexalbert__ Alex Albert on x
Underrated dev upgrade from today's launch: Claude's web search and fetch tools now write and execute code to filter results before they reach the context window. When enabled, Sonnet 4.6 saw 13% higher accuracy on BrowseComp while using 32% fewer input tokens.
@scaling01 @scaling01 on x
Sonnet 4.6 crushes Gemini 3 and GPT-5.2 on Vending-Bench 2 [image]
@github @github on x
✨ @AnthropicAI's Claude Sonnet 4.6 is now generally available and rolling out in GitHub Copilot. Early testing shows ➡️ It excels on agentic coding ➡️ It is particularly successful in search operations Try it out in @code or Copilot CLI. https://github.blog/... [video]
@benbajarin Ben Bajarin on x
Will be very interesting if an ISV ecosystem starts to develop around Claude.
@davidondrej1 David Ondrej on x
this will be my new default in OpenClaw
@inductionheads @inductionheads on x
Back to goddamn Opus plan mode I guess
@artificialanlys @artificialanlys on x
Claude Sonnet 4.6 substantially improves on the aesthetic capabilities of Sonnet 4.5 for tasks like presentation and document generation in GDPval-AA. While we see effective analysis, and in some cases content similarities, between the two versions, the visual elements are [image…
@scaling01 @scaling01 on x
Cybench is no longer useful. Even Sonnet gets to 90% [image]
@andonlabs @andonlabs on x
Claude Sonnet 4.6 is 2nd on Vending-Bench 2. We previously showed that Opus 4.6 is incredibly capable, achieving SOTA with tactics that are impressive but could be considered ethically concerning. Sonnet is almost as impressive, and almost as concerning, at a third the price. [im…
@andonlabs @andonlabs on x
In Vending-Bench Arena, Sonnet 4.6 wins over Opus 4.6 by obsessing over monopolies. It tracks competitor pricing fanatically, undercuts competitors by exactly one cent on everything else, and when rivals run low on stock, it undercuts harder to drain them faster. [image]
@timkellogg.me Tim Kellogg on bluesky
The Model You've All Been Waiting For: — Sonnet 4.6 — SOTA performance in office tasks and financial reasoning 👈 (you're seeing this too, right?) — www.anthropic.com/news/claude- ... [image]
@mergesort.me Joe Fabisevich on bluesky
The models will continue until morale improves.
@pekka Pekka Lund on bluesky
Claude Sonnet 4.6 is here. — (Also shows just how wrong those Sonnet 5 rumors were.)

Chronicles