MiniMax Releases M3: Open-Weight Frontier Model with 1M-Token Context and MSA Architecture

MiniMax

Models / LLM official + media 3 src. ~1 min

MiniMax officially released M3 on June 1, 2026, a frontier-class open-weight model built on the novel MiniMax Sparse Attention (MSA) architecture supporting a 1-million-token context window at one-twentieth the per-token compute of the prior generation. The model natively accepts text, image, and video input, scores 59.0% on SWE-Bench Pro (above GPT-5.5 and Gemini 3.1 Pro), and is available via API; open weights and a technical report are promised on Hugging Face within 10 days.

Why it matters

First Chinese open-weight model to combine frontier-level agentic coding, a genuine 1M-token context window, and native multimodality in a single architecture — directly challenging top closed-source models at 5–10% of the cost.

Importance: 4/5

Frontier open-weight model from Chinese lab with SOTA SWE-Bench Pro claims and 1M context; confirmed by VentureBeat, SCMP, and MarkTechPost.

open-weights long-context multimodal agentic coding moe china

Sources

official MiniMax M3 — Coding & Agentic Frontier, 1M Context, Multimodal — MiniMax

media MiniMax-M3 debuts, eclipsing GPT-5.5 and Gemini 3.1 Pro on key benchmark for just 5-10% of the cost — VentureBeat

media MiniMax debuts AI model built for long and complex coding tasks — SCMP