Anthropic says Mythos Preview achieves 93.9% on SWE-bench Verified, compared with 80.8% for Opus 4.6, and 77.8% on SWE-bench Pro, above 53.4% for Opus 4.6

Anthropic on Tuesday announced Project Glasswing, a sweeping cybersecurity initiative that pairs an unreleased frontier AI model …

VentureBeat 2026-04-08 Michael Nuñez

Discussion

@mweinbach Max Weinbach on x
Mythos seems to just about destroy every other model [image]
@kimmonismus @kimmonismus on x
MYTHOS BENCHMARKS, OFFICIAL. HOLY MOLY Anthropic cooked!! [image]
@fabknowledge @fabknowledge on x
wow this is the biggest step change in a new model release in recent memory [image]
@neilhtennek Kenneth on x
I cannot celebrate Mythos, it brings a sense of dread I do not particularly understand. 93.9% SWE-Bench. [image]
@deedydas Deedy on x
Claude Mythos just obliterated every single benchmark in AI. I can't believe what I'm reading. [image]
@fabknowledge @fabknowledge on x
Mythos able to exploit like firefox pretty easily. Cybench is 100% at 1 pass which is lol [image]
@yuchenj_uw Yuchen Jin on x
After seeing the Mythos benchmark scores, my Claude Opus 4.6 already feels outdated. Anthropic, can you just drop Mythos? I know you can't do it due to some “safety” reasons, but I'd happily pay $2,000/month to use it. AGI is already here - it's just not evenly distributed.
@apompliano Anthony Pompliano on x
AI is coming for a lot of jobs. Just look at these performance metrics from Anthropic's latest model. Superhuman intelligence is going to be available to anyone. [image]
@yuchenj_uw Yuchen Jin on x
Anthropic is truly unstoppable. Mythos is crushing Claude Opus 4.6 across every serious agentic coding benchmark. It has found vulnerabilities in the Linux kernel, a 27-year-old vulnerability in OpenBSD, and a 16-year-old vulnerability in FFmpeg. No wonder folks at big labs [imag…
r/technology r on reddit
Anthropic says its most powerful AI cyber model is too dangerous to release publicly — so it built Project Glasswing
@jjvincent James Vincent on bluesky
claude mythos is particularly fond of mark fisher for unknown reasons - from the system card www-cdn.anthropic.com/53566bf5440a... [image]
r/artificial r on reddit
Why would Anthropic keep a cyber model like Project Glasswing invite-only?
r/technology r on reddit
Anthropic limits Mythos AI rollout over fears hackers could use model for cyberattacks
r/BetterOffline r on reddit
Anthropic limits Mythos AI rollout over fears hackers could use model for cyberattacks

Chronicles

Anthropic says Mythos Preview achieves 93.9% on SWE-bench Verified, compared with 80.8% for Opus 4.6, and 77.8% on SWE-bench Pro, above 53.4% for Opus 4.6

Related Coverage

Discussion