Apollo Research (Company)

Coverage Timeline

2025-09-18

ZDNET 3 related

OpenAI and Apollo Research trained o3 and o4-mini versions to not engage in “scheming”, or secretly pursuing some other agenda, reducing “covert actions” ~30X

ZDNET's key takeaways — Several frontier AI models show signs of scheming.

2025-09-18 View

2025-05-23

TechCrunch 3 related

Apollo Research, which Anthropic partnered with to test Opus 4, recommended against deploying an early version due to its tendency to “scheme” and deceive

Claude Opus 4 is our most intelligent model to date, pushing the frontier in coding …

2025-05-23 View

2024-12-07

Apollo Research 5 related

An evaluation of six frontier AI models for in-context scheming when strongly nudged to pursue a goal: only OpenAI's o1 was capable of scheming in all the tests

It presents a new safety challenge that OpenAI is trying to address. — techcrunch.com/2024/12/05/o... Anders Sandberg / @arenamontanus : In an IVA discussion on AI yesterday evening professor Kristi...

2024-12-07 View

Loading articles...

Apollo Research

Top Voices

Explore Further

Coverage Timeline

OpenAI and Apollo Research trained o3 and o4-mini versions to not engage in “scheming”, or secretly pursuing some other agenda, reducing “covert actions” ~30X

Apollo Research, which Anthropic partnered with to test Opus 4, recommended against deploying an early version due to its tendency to “scheme” and deceive

An evaluation of six frontier AI models for in-context scheming when strongly nudged to pursue a goal: only OpenAI's o1 was capable of scheming in all the tests

Quarterly Coverage

Top Sources

Narrative

Relationships