How METR measures Long Tasks and Experienced Open Source Dev Productivity - Joel Becker, METR
AI Engineer·2026-01-19 14:00

here's the very simple argument. If you look at the sub notion of compute over time um you know this could be like R&D um spending on compute this could be experimental comput it could be training compute what you know whatever um that some particular lab is is using goes like this no surprise if you have another chart of like um you know log time horizon let's say this this uh meter measure from the um this figure that many of you would have seen on Twitter over time it looks like um uh you know let let's ...

How METR measures Long Tasks and Experienced Open Source Dev Productivity - Joel Becker, METR - Reportify