MiniMax Releases M3: Open-Weight Frontier Model with 1M-Token Context and MSA Architecture
MiniMax
MiniMax officially released M3 on June 1, 2026, a frontier-class open-weight model built on the novel MiniMax Sparse Attention (MSA) architecture supporting a 1-million-token context window at one-twentieth the per-token compute of the prior generation. The model natively accepts text, image, and video input, scores 59.0% on SWE-Bench Pro (above GPT-5.5 and Gemini 3.1 Pro), and is available via API; open weights and a technical report are promised on Hugging Face within 10 days.
Why it matters
First Chinese open-weight model to combine frontier-level agentic coding, a genuine 1M-token context window, and native multimodality in a single architecture — directly challenging top closed-source models at 5–10% of the cost.
Importance: 4/5
Frontier open-weight model from Chinese lab with SOTA SWE-Bench Pro claims and 1M context; confirmed by VentureBeat, SCMP, and MarkTechPost.