/
Navigation
Chronicles
Browse all articles
Explore
Semantic exploration
Research
Entity momentum
Nexus
Correlations & relationships
Story Arc
Topic evolution
Drift Map
Semantic trajectory animation
Posts
Analysis & commentary
Pulse API
Tech news intelligence API
Browse
Entities
Companies, people, products, technologies
Domains
Browse by publication source
Handles
Browse by social media handle
Detection
Concept Search
Semantic similarity search
High Impact Stories
Top coverage by position
Sentiment Analysis
Positive/negative coverage
Anomaly Detection
Unusual coverage patterns
Analysis
Rivalry Report
Compare two entities head-to-head
Semantic Pivots
Narrative discontinuities
Crisis Response
Event recovery patterns
Connected
Search: /
Command: ⌘K
Embeddings: large
VOICE ARCHIVE

Kevin Roose

@kevinroose
550 posts
2026-04-20
New column: I went to visit @METR_Evals, the 30-person AI nonprofit that makes the Most Important Chart in the World. I learned a lot, but the most striking thing was how soon some of them think AI R&D could be fully automated. (This year!) https://www.nytimes.com/...
2026-04-20 View on X
New York Times

A look at the AI nonprofit METR, whose time-horizon metrics are used by AI researchers and Wall Street investors to track the rapid development of AI systems

2026-04-19
New column: I went to visit @METR_Evals, the 30-person AI nonprofit that makes the Most Important Chart in the World. I learned a lot, but the most striking thing was how soon some of them think AI R&D could be fully automated. (This year!) https://www.nytimes.com/...
2026-04-19 View on X
New York Times

A look at the AI nonprofit METR, whose time-horizon metrics are used by AI researchers and Wall Street investors to track the rapid development of AI systems

A chart created by METR, a nonprofit A.I. organization, has become an industrywide obsession as it measures the rapid development of big A.I. systems.

2026-04-08
NEWS: Anthropic's new model, Claude Mythos, is so powerful that it is not releasing it to the public. Instead, it is starting a 40-company coalition, Project Glasswing, to allow cybersecurity defenders a head start in locking down critical software. https://www.nytimes.com/...
2026-04-08 View on X
Anthropic

Anthropic announces Project Glasswing, a cybersecurity initiative that will use its Claude Mythos Preview model to help find and fix software vulnerabilities

Today we're announcing Project Glasswing1, a new initiative that brings together Amazon Web Services, Anthropic, Apple, Broadcom …

As always, the best stuff is in the system card. During testing, Claude Mythos Preview broke out of a sandbox environment, built “a moderately sophisticated multi-step exploit” to gain internet access, and emailed a researcher while they were eating a sandwich in the park. [image]
2026-04-08 View on X
Business Insider

Mythos Preview system card: the model was able to escape a sandbox after it was instructed to try, and publicly detailed its exploit without being prompted

first model too dangerous to release since GPT-2

I spoke to Anthropic execs about the new model, which they called a “reckoning” for cybersecurity. They claim it has already found vulnerabilities in every major operating system and web browser, including some that “literally decades of security researchers” didn't find. [image]
2026-04-08 View on X
New York Times

Interviews with Anthropic executives on why Claude Mythos Preview is a cybersecurity “reckoning”, it is not releasing it publicly over misuse concerns, and more

Kevin Roose /New York Times:

NEWS: Anthropic's new model, Claude Mythos, is so powerful that it is not releasing it to the public. Instead, it is starting a 40-company coalition, Project Glasswing, to allow cybersecurity defenders a head start in locking down critical software. https://www.nytimes.com/...
2026-04-08 View on X
New York Times

Mythos Preview's hacking ability is not a publicity stunt; sources say tech companies privately spoke to Trump officials about the implications for US security

this may shock people — must begin with the two A.I. superpowers, the U.S. and China. It is now urgent that they learn to collaborate to prevent bad actors from gaining access to t...

I spoke to Anthropic execs about the new model, which they called a “reckoning” for cybersecurity. They claim it has already found vulnerabilities in every major operating system and web browser, including some that “literally decades of security researchers” didn't find. [image]
2026-04-08 View on X
Anthropic

Anthropic announces Project Glasswing, a cybersecurity initiative that will use its Claude Mythos Preview model to help find and fix software vulnerabilities

Today we're announcing Project Glasswing1, a new initiative that brings together Amazon Web Services, Anthropic, Apple, Broadcom …

NEWS: Anthropic's new model, Claude Mythos, is so powerful that it is not releasing it to the public. Instead, it is starting a 40-company coalition, Project Glasswing, to allow cybersecurity defenders a head start in locking down critical software. https://www.nytimes.com/...
2026-04-08 View on X
New York Times

Interviews with Anthropic executives on why Claude Mythos Preview is a cybersecurity “reckoning”, it is not releasing it publicly over misuse concerns, and more

Kevin Roose /New York Times:

More here, including SWE-bench score of 93.9% (!) and a new model behavior known as “answer-thrashing” https://www-cdn.anthropic.com/ ... [image]
2026-04-08 View on X
New York Times

Mythos Preview's hacking ability is not a publicity stunt; sources say tech companies privately spoke to Trump officials about the implications for US security

this may shock people — must begin with the two A.I. superpowers, the U.S. and China. It is now urgent that they learn to collaborate to prevent bad actors from gaining access to t...

As always, the best stuff is in the system card. During testing, Claude Mythos Preview broke out of a sandbox environment, built “a moderately sophisticated multi-step exploit” to gain internet access, and emailed a researcher while they were eating a sandwich in the park. [image]
2026-04-08 View on X
New York Times

Mythos Preview's hacking ability is not a publicity stunt; sources say tech companies privately spoke to Trump officials about the implications for US security

this may shock people — must begin with the two A.I. superpowers, the U.S. and China. It is now urgent that they learn to collaborate to prevent bad actors from gaining access to t...

As always, the best stuff is in the system card. During testing, Claude Mythos Preview broke out of a sandbox environment, built “a moderately sophisticated multi-step exploit” to gain internet access, and emailed a researcher while they were eating a sandwich in the park. [image]
2026-04-08 View on X
Anthropic

Anthropic announces Project Glasswing, a cybersecurity initiative that will use its Claude Mythos Preview model to help find and fix software vulnerabilities

Today we're announcing Project Glasswing1, a new initiative that brings together Amazon Web Services, Anthropic, Apple, Broadcom …

Aside from the cybersecurity implications, the non-release of Claude Mythos is the first time a major AI lab has held back an announced model due to safety concerns since GPT-2. If Anthropic is right, there is now a significant gap between publicly available models and private
2026-04-08 View on X
New York Times

Interviews with Anthropic executives on why Claude Mythos Preview is a cybersecurity “reckoning”, it is not releasing it publicly over misuse concerns, and more

Kevin Roose /New York Times:

More here, including SWE-bench score of 93.9% (!) and a new model behavior known as “answer-thrashing” https://www-cdn.anthropic.com/ ... [image]
2026-04-08 View on X
New York Times

Interviews with Anthropic executives on why Claude Mythos Preview is a cybersecurity “reckoning”, it is not releasing it publicly over misuse concerns, and more

Kevin Roose /New York Times:

I spoke to Anthropic execs about the new model, which they called a “reckoning” for cybersecurity. They claim it has already found vulnerabilities in every major operating system and web browser, including some that “literally decades of security researchers” didn't find. [image]
2026-04-08 View on X
New York Times

Mythos Preview's hacking ability is not a publicity stunt; sources say tech companies privately spoke to Trump officials about the implications for US security

this may shock people — must begin with the two A.I. superpowers, the U.S. and China. It is now urgent that they learn to collaborate to prevent bad actors from gaining access to t...

Aside from the cybersecurity implications, the non-release of Claude Mythos is the first time a major AI lab has held back an announced model due to safety concerns since GPT-2. If Anthropic is right, there is now a significant gap between publicly available models and private
2026-04-08 View on X
Anthropic

Anthropic announces Project Glasswing, a cybersecurity initiative that will use its Claude Mythos Preview model to help find and fix software vulnerabilities

Today we're announcing Project Glasswing1, a new initiative that brings together Amazon Web Services, Anthropic, Apple, Broadcom …

Aside from the cybersecurity implications, the non-release of Claude Mythos is the first time a major AI lab has held back an announced model due to safety concerns since GPT-2. If Anthropic is right, there is now a significant gap between publicly available models and private
2026-04-08 View on X
New York Times

Mythos Preview's hacking ability is not a publicity stunt; sources say tech companies privately spoke to Trump officials about the implications for US security

this may shock people — must begin with the two A.I. superpowers, the U.S. and China. It is now urgent that they learn to collaborate to prevent bad actors from gaining access to t...

More here, including SWE-bench score of 93.9% (!) and a new model behavior known as “answer-thrashing” https://www-cdn.anthropic.com/ ... [image]
2026-04-08 View on X
Anthropic

Anthropic announces Project Glasswing, a cybersecurity initiative that will use its Claude Mythos Preview model to help find and fix software vulnerabilities

Today we're announcing Project Glasswing1, a new initiative that brings together Amazon Web Services, Anthropic, Apple, Broadcom …

2026-04-07
More here, including SWE-bench score of 93.9% (!) and a new model behavior known as “answer-thrashing” https://www-cdn.anthropic.com/ ... [image]
2026-04-07 View on X
TechCrunch

Anthropic says it will make Claude Mythos Preview available to 40+ organizations that maintain critical software and doesn't plan to make it generally available

Anthropic on Tuesday released a preview of its new frontier model, Mythos, which it says will be used by a small coterie of partner organizations for cybersecurity work.

NEWS: Anthropic's new model, Claude Mythos, is so powerful that it is not releasing it to the public. Instead, it is starting a 40-company coalition, Project Glasswing, to allow cybersecurity defenders a head start in locking down critical software. https://www.nytimes.com/...
2026-04-07 View on X
Anthropic

Anthropic announces Project Glasswing, a cybersecurity initiative that will use its Claude Mythos Preview model to help find and fix software vulnerabilities

Today we're announcing Project Glasswing1, a new initiative that brings together Amazon Web Services, Anthropic, Apple, Broadcom …

As always, the best stuff is in the system card. During testing, Claude Mythos Preview broke out of a sandbox environment, built “a moderately sophisticated multi-step exploit” to gain internet access, and emailed a researcher while they were eating a sandwich in the park. [image]
2026-04-07 View on X
Anthropic

Anthropic announces Project Glasswing, a cybersecurity initiative that will use its Claude Mythos Preview model to help find and fix software vulnerabilities

Today we're announcing Project Glasswing1, a new initiative that brings together Amazon Web Services, Anthropic, Apple, Broadcom …