Calibration

Search documents
A Taxonomy for Next-gen Reasoning โ Nathan Lambert, Allen Institute (AI2) & Interconnects.ai
AI Engineerยท 2025-07-19 21:15
[Music] I really came to this thinking about trying to reflect on six months into this like reinforcement learning with verifiable rewards post 01 post deepseeek and I think that a lot of this stuff is somewhat boring because everybody has a reasoning model Um, we all know the basics of you can scale RL at training time and the numbers will go up and that's deeply correlated with being able to then do this inference time scaling. Um, but really in AI right now everybody there's a lot of people are up to spe ...