2025-10-03
Really recommend checking out the APEX launch—this is an incredibly dense set of tasks curated and approved by the world's leading experts. Models have improved by a shocking amount in the last few years, but there's so much more to come. The economic impact will be staggering.
Mercor
Mercor launches the AI Productivity Index (APEX), which evaluates AI models' ability to perform “economically valuable knowledge work”; GPT-5 leads at 64.2%
still not production-ready Nikita Ostrovsky / Time : AI Is Learning to Do the Jobs of Doctors, Lawyers, and Consultants arXiv.org : The AI Productivity Index (APEX) Agnee Ghosh / B...