chain-of-thought — AI Digest

9 мая OpenAI Discloses Accidental Chain-of-Thought Grading in RL Training Across Six Models OpenAI research
6 июн The Deterministic Horizon: Information-Theoretic Proof That Extended CoT Fails and Tool Use Is Necessary research
25 июн Quantized Reasoning Models Think They Need to Think Longer, but They Do Not Meta research