/
Navigation
Chronicles
Browse all articles
Explore
Semantic exploration
Research
Entity momentum
Nexus
Correlations & relationships
Story Arc
Topic evolution
Drift Map
Semantic trajectory animation
Posts
Analysis & commentary
Pulse API
Tech news intelligence API
Browse
Entities
Companies, people, products, technologies
Domains
Browse by publication source
Handles
Browse by social media handle
Detection
Concept Search
Semantic similarity search
High Impact Stories
Top coverage by position
Sentiment Analysis
Positive/negative coverage
Anomaly Detection
Unusual coverage patterns
Analysis
Rivalry Report
Compare two entities head-to-head
Semantic Pivots
Narrative discontinuities
Crisis Response
Event recovery patterns
Connected
Search: /
Command: ⌘K
Embeddings: large
TEXXR

Chronicles

The story behind the story

days · browse · Enter similar · o open

Anthropic says Mythos Preview achieves 93.9% on SWE-bench Verified, compared with 80.8% for Opus 4.6, and 77.8% on SWE-bench Pro, above 53.4% for Opus 4.6

Anthropic on Tuesday announced Project Glasswing, a sweeping cybersecurity initiative that pairs an unreleased frontier AI model …

VentureBeat Michael Nuñez

Discussion

  • @mweinbach Max Weinbach on x
    Mythos seems to just about destroy every other model [image]
  • @kimmonismus @kimmonismus on x
    MYTHOS BENCHMARKS, OFFICIAL. HOLY MOLY Anthropic cooked!! [image]
  • @fabknowledge @fabknowledge on x
    wow this is the biggest step change in a new model release in recent memory [image]
  • @neilhtennek Kenneth on x
    I cannot celebrate Mythos, it brings a sense of dread I do not particularly understand. 93.9% SWE-Bench. [image]
  • @deedydas Deedy on x
    Claude Mythos just obliterated every single benchmark in AI. I can't believe what I'm reading. [image]
  • @fabknowledge @fabknowledge on x
    Mythos able to exploit like firefox pretty easily. Cybench is 100% at 1 pass which is lol [image]
  • @yuchenj_uw Yuchen Jin on x
    After seeing the Mythos benchmark scores, my Claude Opus 4.6 already feels outdated. Anthropic, can you just drop Mythos? I know you can't do it due to some “safety” reasons, but I'd happily pay $2,000/month to use it. AGI is already here - it's just not evenly distributed.
  • @apompliano Anthony Pompliano on x
    AI is coming for a lot of jobs. Just look at these performance metrics from Anthropic's latest model. Superhuman intelligence is going to be available to anyone. [image]
  • @yuchenj_uw Yuchen Jin on x
    Anthropic is truly unstoppable. Mythos is crushing Claude Opus 4.6 across every serious agentic coding benchmark. It has found vulnerabilities in the Linux kernel, a 27-year-old vulnerability in OpenBSD, and a 16-year-old vulnerability in FFmpeg. No wonder folks at big labs [imag…
  • r/technology r on reddit
    Anthropic says its most powerful AI cyber model is too dangerous to release publicly — so it built Project Glasswing
  • @jjvincent James Vincent on bluesky
    claude mythos is particularly fond of mark fisher for unknown reasons - from the system card www-cdn.anthropic.com/53566bf5440a...  [image]
  • r/artificial r on reddit
    Why would Anthropic keep a cyber model like Project Glasswing invite-only?
  • r/technology r on reddit
    Anthropic limits Mythos AI rollout over fears hackers could use model for cyberattacks
  • r/BetterOffline r on reddit
    Anthropic limits Mythos AI rollout over fears hackers could use model for cyberattacks