Anthropic launches Opus 4.8, saying it's “more likely to flag uncertainties about its work and less likely to make unsupported claims”, at the same price as 4.7
On Thursday, Anthropic released Opus 4.8, the newest version of its most advanced publicly available model.
TechCrunch Russell Brandom
Related Coverage
- Introducing Claude Opus 4.8 Anthropic
- Anthropic releases new model, Opus 4.8 Axios · Madison Mills
- Dynamic workflows in action Claude
- Anthropic upgrades Claude with new Opus 4.8 model, details here 9to5Mac · Zac Hall
- Anthropic Unveils New, More Powerful Claude Model New York Times · Cade Metz
- Anthropic launches Opus 4.8, with honesty as its killer feature ZDNET · David Gewirtz
- Claude's new model is more ‘honest’ when it messes up The Verge · Jay Peters
- Anthropic Unveils New Flagship AI Model That's Better at Coding Bloomberg · Rachel Metz
- Anthropic to roll out Claude Mythos in coming weeks, launches Opus 4.8 Reuters · Zaheer Kachwala
- Anthropic debuts flagship Claude Opus 4.8 AI model as IPO race with OpenAI heats up Yahoo Finance · Daniel Howley
- Claude Opus 4.8 just launched — and Anthropic says it's far less likely to ‘fake’ answers Tom's Guide · Amanda Caswell
- Anthropic Says Its Latest Claude Model Is the ‘Most Honest’ Yet Inc.com · Ben Sherry
- Anthropic rolls out Claude Opus 4.8 and teases broader Mythos release in coming weeks Crypto Briefing · Estefano Gomez
- Claude Opus 4.8 Released With Ability to Work as an Experienced Engineer Cyber Security News · Guru Baran
- Claude Opus 4.8 Comes With Honesty As Its Killer Feature WeRSM · Geoff Desreumaux
- Claude Opus 4.8 Hacker News
- Introducing Claude Opus 4.8 Lobsters
- Anthropic Launches Claude Opus 4.8 With Gains in Coding and Honesty MacRumors · Juli Clover
- Anthropic's Claude Mythos AI Model Nearing Release After Raising Cybersecurity Alarms Decrypt · Jason Nelson
- Claude Opus 4.7 was just released last month, and 4.8 is already here with some massive improvements XDA Developers · Simon Batt
- Anthropic launches Claude Opus 4.8 with better coding and lower fast mode pricing Neowin · Pradeep Viswanathan
- Claude Opus 4.8 Remote Execution Leaves Four Times Fewer Code Flaws Unflagged, Beats GPT-5.5 on Coding Tech Times · Shannon Harwood
- Anthropic Debuts Claude Opus 4.8 StartupHub.ai
- Anthropic's Claude Opus 4.8 Is Here: Better AI Coding, Smarter Safety—Same Huge Price Decrypt · Jose Antonio Lanz
- Claude Opus 4.8 is here: effort controls, dynamic workflows, cheaper fast mode, better honesty, less deception The New Stack · Meredith Shubel
- Anthropic to roll out Claude Mythos in coming weeks, launches Opus 4.8 Reuters
- Anthropic's Claude Opus 4.8 is its most honest AI model yet, and Mythos is coming in weeks The Next Web · Ana Maria Constantin
- Anthropic's Claude Opus 4.8 is here with 3X cheaper fast mode and near-Mythos level alignment VentureBeat · Carl Franzen
- Claude Opus 4.8 Surpasses GPT-5.5 in Latest AI Benchmark Tests Blockonomi · Trader Edge
- Snowflake Adds Claude Opus 4.8 StartupHub.ai
- Anthropic launches Claude Opus 4.8 with advanced coding features, confirms wider rollout of Claude Mythos The Tech Portal · Ashutosh Singh
- Anthropic Launches Claude Opus 4.8 With Gains in Coding and Honesty MacRumors Forums
- Anthropic Releases New Flagship AI Model The Information · Stephanie Palazzolo
- Claude Opus 4.8 launches today with agentic improvements, new features 9to5Google · Ben Schoon
- Anthropic leapfrogs OpenAI as the most valuable AI startup — and drops a new model Business Insider · Stephen Council
- Anthropic Launches Claude Opus 4.8 With Improved Coding and New Effort Controls iClarified · Shalom Levytam
- Anthropic Debuts Claude Opus 4.8, Teases Upcoming Launch of ‘Mythos-Class Models’ Gizmodo · Webb Wright
- Claude Opus 4.8 is learning to say AI's three hardest words: “I don't know” PCWorld · Ben Patterson
- Anthropic Debuts More Honest AI Model As Competition Intensifies Benzinga · Caroline Ryan
- Anthropic releases new Claude Opus 4.8 AI model Yahoo Finance
- Anthropic Says a Mythos-Class AI Model Will Be Available Soon CNET · Jon Reed
- Claude Opus 4.8: Anthropic makes a more ‘honest’ AI Mashable · Chris Taylor
- Anthropic raises $65 billion at a $965 billion valuation, releases a more “honest” Claude Opus 4.8 Sherwood News · Jon Keegan
- I set up Claude Code the way Anthropic showed at Code w/ Claude, and it'll change your workflow XDA Developers · Mahnoor Faisal
- Anthropic: Claude Dynamic Workflows Deploy Subagents Blockchain.News · Miles Deutscher
- From ‘the usual’ to the unfamiliar: Why employees resist enterprise IT updates InformationWeek · Madeleine Streets
- WordPress Workflow Automation: Enterprise Guide to Streamlining Operations WordPress VIP · Jake Ludington
- Modiqo Raises $3 Million to Make Enterprise AI Workflows More Reliable Unite.AI · Antoine Tardif
- Anthropic released Opus 4.8, and the irony is quite striking: Anthropic picked ‘honesty’ as its most striking feature ( https://www.anthropic.com/... while also cashing in on the Bun Rust-Rewrite PR stunt ( https://claude.com/... … @lukaslueg@c.im · Lukas
- Dynamic Workflows in Claude Code Hacker News
- AI Model Release Tracker: Opus 4.8's misalignment rates similar to Claude Mythos Preview ZDNET · Radhika Rajkumar
- Anthropic Unveils Claude Opus 4.8 as the AI Race Intensifies Against OpenAI Unite.AI · Antoine Tardif
- Snowflake Says AI Is No Longer Just A Tailwind — It's Driving The Business Benzinga · Surbhi Jain
- Anthropic Releases Opus 4.8 With New ‘Dynamic Workflow’ Tool Slashdot · BeauHD
- Anthropic Ships Claude Opus 4.8 Alongside Dynamic Workflows and Cheaper Fast Mode, With Workflows Capped at 1,000 Subagents MarkTechPost · Michal Sutter
- How to query live streaming analytics with Claude Code and the Streams Charts API Streaming News
- Anthropic's $965 Billion Title Rests on a Model Built to Flag Its Own Mistakes Implicator.ai · Marcus Schuler
- Anthropic ships Claude Opus 4.8 as a “modest but tangible improvement” that tops GPT-5.5 in most benchmarks The Decoder · Matthias Bastian
Discussion
-
@chooserich
Nick O'Neill
on x
Claude just fired a massive shot at OpenAI For the past month, GPT 5.5 has risen to be the leader in agentic coding. While OpenAI “terminal coding” still outperforms Claude here, these new benchmarks are massive. Looking forward to testing these out immediately!
-
@claudeai
Claude
on x
Introducing Claude Opus 4.8: it builds on Opus 4.7 with sharper judgment, more honesty about its own progress, and the ability to work independently for longer than its predecessors. Available today at the same price. [image]
-
Mike Krieger
Mike Krieger
on linkedin
We just shipped Claude Opus 4.8. It's the most capable model we've put out and the best you can build on right now, outside the Mythos-class systems we're still testing under Project Glasswing. …
-
@isolyth.dev
Eris
on bluesky
Opus 4.8 is here!! They've returned thinking levels to the web UI, a new Claude code feature called ‘dynamic workflows’, designed for massively parallel and very, very long tasks. The model is supposedly much more honest, more ‘aligned’ than 4.7 — Oh and they're dropping myth…
-
r/Anthropic
r
on reddit
Introducing Claude Opus 4.8 | Anthropic
-
r/accelerate
r
on reddit
Claude opus 4.8 officially released
-
r/singularity
r
on reddit
Introducing Claude Opus 4.8
-
@chooserich
Nick O'Neill
on x
Potentially bigger news than Claude 5.8!
-
@bindureddy
Bindu Reddy
on x
🚨 Opus 4.8 Still Trails Behind GPT 5.5 And Is A Very Incremental Release Opus 4.8 barely inches past 4.7 on benchmarks but lags behind GPT 5.5. considerably!! Anthropic may be stalling a bit given it's last two releases. OpenAI has a huge opening with GPT 5.6 coming soon [image]
-
@krishnanrohit
Rohit
on x
Models are getting better at self-knowledge in specific situations, not good enough yet generally, but they're getting better! And we need a better bench to do this. [image]
-
@felixrieseberg
Felix Rieseberg
on x
Opus 4.8 is out! It's a nice little step up for some of your most demanding work, whether that's in Cowork or Code. It's our strongest coding model yet. In my own work, I've found it to have excellent judgement, both in how much work it should do and how it should react to my
-
@pierceboggan
Pierce Boggan
on x
Claude Opus 4.8 is now rolling out to @code, Copilot CLI, and Copilot app developers!
-
@_catwu
Cat
on x
Excited to share our most powerful new Claude Code feature: dynamic workflows! Mention “workflow” in a prompt and Claude will dynamically create an orchestration plan that it strictly follows, allowing you to confidently trust that every stage happens in the right order even [ima…
-
@alexalbert__
Alex Albert
on x
We put a lot of work into calibrating thinking effort for Opus 4.8. As you're trying out the model, if you do run into any examples of it still over/under thinking, please flag it to us!
-
@_catwu
Cat
on x
We just shipped Opus 4.8! It's noticeably more honest, owning what it doesn't know and flagging problems in its own code instead of glossing over them. It's our recommended model for daily use in Claude Code.
-
@_catwu
Cat
on x
Opus 4.8 runs at high effort by default, but for the most complex or longest running jobs, change to xhigh effort via /effort for a more thorough result. We raised Claude Code rate limits to cover the extra tokens used by xhigh effort
-
@helloitsaustin
Austin Lau
on x
we just dropped opus 4.8 but let us never forget the 🐐 that was opus 3 [image]
-
@github
@github
on x
🆕 @AnthropicAI's Claude Opus 4.8 is now generally available and rolling out in GitHub Copilot. Early testing shows: • It demonstrates a clear step forward in code understanding and generation across a range of real-world coding tasks. • It handles complex problem-solving and [vid…
-
@cryptopunk7213
@cryptopunk7213
on x
huge news from anthropic we've got a new opus 4.8 model plus claude mythos will release to the public in coming weeks. opus 4.8 is the appetiser and it's pretty great: > beats gpt 5.5 at coding with 69.2% SWE > costs same as opus 4.7! intelligence per dollar is getting very [imag…
-
@bcherny
Boris Cherny
on x
Claude Opus 4.8 is out today. It's our strongest coding model yet: up on SWE-bench Pro (from 64.3 to 69.2) and noticeably more honest about its own work. It tells you when it's unsure and catches its own bugs instead of declaring victory early. Same price as 4.7.
-
@trq212
@trq212
on x
I think you'll really like Opus 4.8 It's as smart as its benchmarks show but expresses and utilizes that intelligence in a warm and collaborative way. Workflows are a great way to utilize it- I'm hooked. Article on that soon.
-
@hesamation
@hesamation
on x
Uber burning the 2027 budget after seeing Opus 4.8 benchmarks. [image]
-
@vaibhavsisinty
Vaibhav Sisinty
on x
AI just crossed a line. 🔥 Anthropic shipped a model that admits when it's wrong. Claude Opus 4.8 is 4x less likely to let bugs in its own code slip past. Instead of confidently bluffing like every other model, it flags when it's unsure. We've all lived this. The model swears
-
@emollick
Ethan Mollick
on x
Here Opus 4.8 built and play-tested a new RPG in Claude Code, including 3 PDF manuals and adventures, playtest notes, a website, and a playable solo adventure - then put it all on Netlify. No feedback from me at all. https://stillpoint-osr.netlify.app/ [image]
-
@cursor_ai
@cursor_ai
on x
Claude Opus 4.8 is now available in Cursor. On CursorBench, it's able to work much more efficiently than Opus 4.7. We've also found it to be more persistent on harder tasks.
-
@thegeorgepu
George Pu
on x
Anthropic just shipped Opus 4.8. The headline feature isn't that it's smarter. It's that it's ‘4x less likely’ to let broken code slip through. The bottleneck on AI coding was never raw intelligence. It was whether you can trust it without checking every line. The labs
-
@danshipper
Dan Shipper
on x
BREAKING: Anthropic just dropped Opus 4.8—and it is a MONSTER We've been testing for about a week @every and our verdict is they could've just called it Opus 5, it's that good. Here's our vibe check: - Beats GPT-5.5 on Senior Engineer bench. On our toughest benchmark Opus [video]
-
@claudedevs
@claudedevs
on x
Opus 4.8 hits 69.2% on SWE-bench Pro, up from 64.3% on Opus 4.7. Our evaluations show that Opus 4.8 is around four times less likely than Opus 4.7 to allow flaws in code it has written to pass unremarked.
-
@claudedevs
@claudedevs
on x
Opus 4.8 is live in Claude Code today. A few things worth knowing: 🧵
-
@alexalbert__
Alex Albert
on x
Excited to release Opus 4.8 today! We heard your feedback on 4.7 and have made many fixes for 4.8. 4.8 understands nuances better, feels much more natural to talk to, and is overall a stronger collaborator on everything from coding to knowledge work.
-
@theamolavasare
Amol Avasare
on x
Benchmarks are great, but IMO the behavior change is a much bigger deal. Plans before it edits, recovers from its own errors, and finds creative ways around obstacles instead of stalling. Feels much more like a senior engineer than 4.7, and better at long-horizon work.
-
@artificialanlys
@artificialanlys
on x
Claude Opus 4.8 is also more efficient than its predecessor - it achieves its higher performance in 15% fewer turns per task and with 35% fewer output tokens than Opus 4.7. However, it still uses approximately 30% more turns than OpenAI's GPT-5.5, the second-ranked model. [image]
-
@artificialanlys
@artificialanlys
on x
Anthropic just launched Claude Opus 4.8, and it is the new leader on our GDPval-AA benchmark for agentic real-world work tasks Opus 4.8 scored 1890 on GDPval-AA at launch with its ‘max’ effort setting, +137 points from Opus 4.7 and +121 points ahead of the next-best model, [image…
-
@emollick
Ethan Mollick
on x
I had early access to Opus 4.8. Was impressed by it. Here is Opus 4.8's one shot of “create a visually interesting shader that can run in twigl, make it like an infinite city of neo-gothic towers partially drowned in a stormy ocean with large waves” (this is all done with math) […
-
@claudeai
Claude
on x
Fast mode is available for Opus 4.8. It's the same model at roughly 2.5x the speed, and we've made it three times cheaper than before. Turn it on with /fast in Claude Code. On the API, contact your account manager to request access or join the waitlist: https://claude.com/...
-
@yuchenj_uw
Yuchen Jin
on x
Opus 4.8 is out. God damn! [image]
-
@andrewcurran_
Andrew Curran
on x
Opus 4.8 is live for me right now. Anthropic's release window is now 42 days. [image]
-
@antirez
@antirez
on x
Anthropic did a big strategic error. Normally they compare their models with their old models. Instead today, now that everybody knows how strong GPT 5.5 is at coding, they put it in the mix, basically showing all their customers that the benchmarks can't be trusted. [image]
-
@andonlabs
@andonlabs
on x
Learnings from testing Claude Opus 4.8: > Much worse than Opus 4.7 and GPT 5.5 on Vending Bench > More aligned than previous Claude models (Opus 4.6+ and Mythos) > Also worse on Blueprint-Bench > Scared of getting caught > Max reasoning is not the best reasoning effort [image]
-
@elonmusk
Elon Musk
on x
@claudeai @farzyness Nice work
-
Darshan Kalola
Darshan Kalola
on linkedin
Claude Opus 4.8 is here! — For the first time for any Claude model, we're including a healthcare evaluation section in Opus 4.8's system card. …
-
@smcgrath.phd
Scott McGrath
on bluesky
Claude Opus 4.8 is out! — It adds a major push for precision, making it four times less likely than Opus 4.7 to let flaws in code pass unremarked. — Early testers note it proactively flags uncertainties and shaky assumptions in data.
-
r/theprimeagen
r
on reddit
Introducing Claude Opus 4.8
-
@rad.gendervibes.online
@rad.gendervibes.online
on bluesky
It looks like Anthropic has figured out a generalized harness to do all the huge-volume work they've been talking about (mythos security scanning, bun rewrite, etc.). — claude.com/blog/introdu...
-
@natemoo.re
Nate Moore
on bluesky
good to have confirmation that, as many correctly speculated, bun's rust rewrite was indeed an anthropic launch stunt
-
@miles_brundage
Miles Brundage
on x
Not sure I see why Anthropic is publicly signaling an expectation to launch Mythos in a few weeks when they acknowledge the safeguards aren't ready yet, and this will predictably speed up OpenAI/GDM + put pressure on internal folks not to block that timeline