Q&A with Andrej Karpathy on AGI still being a decade away, why reinforcement learning is terrible, superintelligence, his AI education startup Eureka, and more
AGI is still a decade away (via) Extremely high signal 2 hour 25 minute (! … X: Ashpreet Bedi / @ashpreetbedi : This is exactly why we recommend keeping it simple and focusing on clarity and reliabili...
GPT-5 hands-on: it exudes competence but doesn't feel like a dramatic leap ahead of other LLMs, and the pricing is aggressively competitive with other providers
And It Changes Everything Tyler Cowen / Marginal Revolution : GPT-5, a short and enthusiastic review GPT-5 : GPT-5 — Our hands-on review of OpenAI's newest model based on weeks of testing — The Ve...
Anthropic tested Claude's ability to manage a physical “storefront” to mixed results, as the AI struggled with pricing strategy and inventory management
Anthropic had sonnet-3.7 run a shop in their SF headquarters. It was tasked with running s profitable business — Their eye popping experiment is worth the read — Was it successful? No, it was to...
OpenAI rolls out a ChatGPT memory feature that references past chats for answers, starting with ChatGPT Pro and Plus subscribers, but not in the UK and the EEA
and That Changes Everything Rimjhim Singh / Business Standard : OpenAI rolls out ChatGPT memory update, now for pro subscribers only OpenAI : Memory FAQ — Learn more about managing your memories in ...
A look at GPT-4.5's claimed performance, including on coding benchmarks, where it matches or outperforms GPT-4o but falls short of OpenAI's Deep Research
Bigger, smarter, and more powerful Samantha Kelly / CNET : OpenAI Says Its New ChatGPT 4.5 Has Better Emotional Intelligence Cristina Criddle / Financial Times : OpenAI reveals GPT-4.5 amid flurry of ...
OpenAI unveils o3 and o3-mini, trained to “think” before responding via what OpenAI calls a “private chain of thought”, and plans to launch them in early 2025
12 Days of OpenAI: Day 12 Naomi Li Gan / Tech in Asia : OpenAI unveils AI model for advanced reasoning Bojan Stojkovski / Interesting Engineering : OpenAI unveils o3 reasoning AI model to tackle compl...
o3, trained on the ARC-AGI-1 Public Training set, scored 87.5% on ARC Prize's Semi-Private Evaluation in a high-compute configuration; GPT-4o scored 5% in 2024
This is “The AI Economy,” a weekly LinkedIn-first newsletter … Sharon Goldman / Fortune : Sam Altman says OpenAI's new o3 ‘reasoning’ models begin the ‘next phase’ of AI. Is this AGI? Pradeep Viswanat...
Google releases Gemini 2.0 Flash Thinking, an experimental “reasoning” model that “explicitly shows its thoughts” and can use them to strengthen its reasoning
Quick: what sort of prompts should you run against GPT-4o vs Gemini 1.5 Flash vs o1 vs o1-pro vs gemini-2.0-flash-thinking-exp? X: Jeff Dean / @jeffdean : Introducing Gemini 2.0 Flash Thinking, an exp...
OpenAI's o1 models aren't a straightforward upgrade to GPT-4o, as they introduce some major cost and performance trade-offs in exchange for improved “reasoning”
delving into OpenAI's new ‘o1’ model PYMNTS.com : OpenAI's ‘Strawberry’ Model Sparks Fresh Discussions on AI Capabilities M.G. Siegler / Spyglass : OpenAI Reasons ‘o1’ is a Better Name than ‘Strawberr...
OpenAI releases o1, the first of its rumored reasoning-focused Strawberry models, in preview, alongside a smaller o1-mini, for ChatGPT Plus and Team subscribers
Advancing cost-efficient reasoning. — Contributions Sabrina Ortiz / ZDNET : OpenAI trained its new o1 AI models to think before they speak - how to access them Ethan Mollick / One Useful Thing : Som...