Some LLM takeaways for 2025: reasoning as a signature feature, coding agents were useful, subscriptions hit $200/month, and Chinese open-weight models impressed
It's that time. It's been a hell of a year. — At the start we barely had reasoning models. X: Simon Willison / @simonw : Here's my enormous round-up of everything we learned about LLMs in 2025 - th...
Anthropic announces Claude Skills, a tool with folders of instructions, scripts, and resources that Claude can load to improve how it performs specific tasks
here's how they could supercharge your workflow Robert Brown / Implicator.ai : Claude taps Microsoft 365 as the connector war heats up X: Olivia Moore / @omooretweets : Like many of Anthropic's produc...
Google releases Gemini 2.0 Flash Thinking, an experimental “reasoning” model that “explicitly shows its thoughts” and can use them to strengthen its reasoning
Quick: what sort of prompts should you run against GPT-4o vs Gemini 1.5 Flash vs o1 vs o1-pro vs gemini-2.0-flash-thinking-exp? X: Jeff Dean / @jeffdean : Introducing Gemini 2.0 Flash Thinking, an exp...