#chain-of-thought
- OpenAI Discloses Accidental Chain-of-Thought Grading in RL Training Across Six Models OpenAI research
- The Deterministic Horizon: Information-Theoretic Proof That Extended CoT Fails and Tool Use Is Necessary research
- Quantized Reasoning Models Think They Need to Think Longer, but They Do Not Meta research