#long-context
- DeepSeek V4: official open-source release with Day-0 adaptation for Huawei Ascend DeepSeek models-llm
- xAI completes Grok 4.3 API rollout with 1M context, native video, and ~40% price cut xAI models-llm
- MiniMax Releases M3: Open-Weight Frontier Model with 1M-Token Context and MSA Architecture MiniMax models-llm
- NVIDIA Nemotron 3 Ultra: Open 550B MoE Model Now Available for Agentic Workloads NVIDIA models-llm
- MiniMax M3 Open Weights Released: 1M Context, MoE, Frontier Coding MiniMax models-llm
- Zhipu AI Open-Sources GLM-5.2 Under MIT License with 1M Token Context Zhipu AI models-llm
- Zhipu AI Releases GLM-5.2 Open Weights: 753B MoE with 1M-Token Context under MIT License Zhipu AI / Z.ai models-llm
- SU-01: Gold-Medal-Level Olympiad Reasoning via Curriculum SFT and Two-Stage RL SU-01 Team research
- RoPE Provably Fails at Long Contexts: Locality Bias and Token Consistency Both Break research
- Moonshot AI Releases Kimi K2.7-Code: 1T-Parameter Open-Weight Coding Model with Vision Moonshot AI models-llm
- MiniMax Sparse Attention: 28× Compute Reduction at 1M-Token Context with No Quality Loss MiniMax research
- MemLens: Benchmark for Multimodal Long-Term Memory in Vision-Language Models NVIDIA research
- Echo-Infinity: Real-Time Infinite Video Generation via Learnable Memory Query research
- GitHub Copilot Gets 1M Token Context Window and Configurable Reasoning Levels GitHub / Microsoft tools
- vLLM Adds Day-0 Support for MiniMax M3 Open Weights with 1M-Context Sparse Attention MiniMax tools
- Do Language Models Need Sleep? Offline Recurrence as Memory Consolidation for Improved Inference Google / CMU research
- Zhipu AI Releases GLM-5.2: 744B MoE with 1M-Token Context and Coding-First Design Zhipu AI models-llm
- SubtleMemory: Benchmark Reveals Agents Systematically Fail Fine-Grained Relational Memory research
- SearchSwarm: Delegation Intelligence for LLM Agents in Long-Horizon Deep Research research