2025-12-20
This hits because thinking about why models behave that way starts to matter once they're running in systems, not just passing benchmarks.
karpathy
2025 LLM Year in Review: shift toward RLVR, Claude Code emerged as the first convincing example of an LLM agent, Nano Banana was paradigm shifting, and more
Andrej Karpathy / karpathy :