5 points | by matt_d 12 hours ago
1 comments
I think a lot of SOTA models are already going this way. Long autonomous tasks in claude code / codex already do this to stay on track and avoid multiplying errors over many steps.
I think a lot of SOTA models are already going this way. Long autonomous tasks in claude code / codex already do this to stay on track and avoid multiplying errors over many steps.