reinforcement-learning — AI Digest

14 июн MaxProof: MiniMax Model Exceeds IMO and USAMO Gold-Medal Thresholds on Formal Math MiniMax research
16 июн DreamX-World 1.0: General-Purpose Interactive World Model with 6DoF Camera Control AMAP-ML (Alibaba Maps AI Lab) research
16 июн FastContext: Specialized Exploration Subagent Cuts Coding Agent Token Usage by 60% Microsoft / Shanghai Jiao Tong University research