Measuring AI Ability to Complete Long Tasks | Not Hacker News!