Thinking through how pretraining vs. RL learn | Not Hacker News!