Ollama v0.30.9: Cohere2Moe Support, Coding Agent Single-Token Output Bug Fixed
Ollama v0.30.9 (June 15) adds Cohere2Moe architecture support, fixes the LFM2 parser for cases where thinking was not emitted, and resolves a bug where coding agents invoked via Ollama output only a single token. Also adds an explicit error when a single message exceeds the context window.
Why it matters
The single-token output bug directly blocked users running Claude Code and similar coding agents locally via Ollama — this fix unblocks local-first developer setups.
Importance: 2/5
Notable bug fix unblocking local coding-agent workflows via Ollama
Sources
official
Ollama v0.30.9 release