LessWrong (Curated & Popular)

"AIs can now often do massive easy-to-verify SWE tasks and I’ve updated towards shorter timelines" by ryan_greenblatt

Apr 6, 2026
A bold update on much shorter AI timelines and a bigger chance of full AI R&D automation within a few years. Stronger performance on massive, easy-to-verify software engineering tasks is highlighted. Tests, scaffolding, and cheap verification are explained as drivers. Practical experiments show AIs completing months of SWE work and speeding certain research workflows.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Test Suites Unlock Massive Iterative Gains

  • Easy-and-cheap-to-verify non-ideation SWE tasks (ESNI) scale because AIs can iteratively optimize against test suites and benchmarks.
  • Building test suites lets AIs run huge numbers of cheap iterations, spotting and fixing errors until performance improves substantially.
INSIGHT

ESNI Tasks Appear To Be In A Super-Exponential Regime

  • ESNI tasks are entering a super-exponential regime for 50% reliability time horizons because iterative error recovery compounds returns.
  • Ryan argues that once generality and error recovery exist, each doubling of allowed time becomes easier, accelerating progress.
INSIGHT

Checkability Doubles As Company Optimization And AI Self-Improvement

  • Checkability helps two ways: firms can optimize models on metrics and AIs can self-improve at runtime by iterating on cheap evaluations.
  • This explains a big gap between ESNI tasks and less-checkable benchmarks like MET A's suite.
Get the Snipd Podcast app to discover more snips from this episode
Get the app