2025-12-04
I don't see why this has to be true. Isn't it at least possible that RL is a necessary step to build the scaffolding for learning on the job? Sure, humans don't rehearse every software task, but we also don't pop out of the womb fully baked either! [image]
Dwarkesh Podcast
Thoughts on AI progress and why AI labs' actions hint at a worldview in which AI models will continue to fare poorly at generalization and on-the-job learning
Why I'm moderately bearish in the short term, and explosively bullish in the long term — What are we scaling? X: @sriramk , @_simonsmith , @dwarkesh_sp , @emollick , @dwarkesh_sp...