#research
- Google DeepMind Invests $75M in A24, Forms First AI Research Partnership with a Film Studio Google DeepMind industry
- RoPE Provably Fails at Long Contexts: Locality Bias and Token Consistency Both Break research
- MiniMax Sparse Attention: 28× Compute Reduction at 1M-Token Context with No Quality Loss MiniMax research
- MaxProof: MiniMax Model Exceeds IMO and USAMO Gold-Medal Thresholds on Formal Math MiniMax research
- Google DeepMind Publishes AlphaEvolve One-Year Impact Report Google DeepMind research
- Crafter: Multi-Agent Harness for Editable Scientific Figure Generation Scores +16pt Over Baselines (103 HF Upvotes) Tsinghua University research
- GrepSeek: Training Search Agents for Direct Corpus Interaction via Shell Commands (93 HF Upvotes) University of Massachusetts Amherst research
- EvoArena: LLM Agents Score Only 40% on Dynamic Evolving Environments MIT / NUS / Salesforce research
- WeaveBench: Computer-Use Agents Fail at Hybrid GUI+CLI Tasks — 41% Pass Rate Microsoft Research research
- InterleaveThinker: RL Planner+Critic Pipeline for Interleaved Text-and-Image Generation CUHK Multimedia Lab research
- OpenAI Launches Economic Research Exchange for AI Impact Studies OpenAI industry
- BetaPRM: Uncertainty-Aware Process Rewards Cut Reasoning Token Use by 33% research
- Google DeepMind and Partners Launch $10M Multi-Agent AI Safety Research Fund Google DeepMind industry
- Anthropic Publishes First Public Record: 52,000-Person Survey on US AI Attitudes Anthropic research
- Google DeepMind Takes Minority Stake in CCP Games for Multi-Agent Research in EVE Online Google DeepMind industry