arikagan_ · TEXXR

2025-12-04

I don't see why this has to be true. Isn't it at least possible that RL is a necessary step to build the scaffolding for learning on the job? Sure, humans don't rehearse every software task, but we also don't pop out of the womb fully baked either! [image]

2025-12-04 View on X

Dwarkesh Podcast

Thoughts on AI progress and why AI labs' actions hint at a worldview in which AI models will continue to fare poorly at generalization and on-the-job learning

Why I'm moderately bearish in the short term, and explosively bullish in the long term — What are we scaling? X: @sriramk , @_simonsmith , @dwarkesh_sp , @emollick , @dwarkesh_sp...

View original