Ollama v0.30.9: Cohere2Moe Support, Coding Agent Single-Token Output Bug Fixed

Tools official 1 src. ~1 min

Ollama v0.30.9 (June 15) adds Cohere2Moe architecture support, fixes the LFM2 parser for cases where thinking was not emitted, and resolves a bug where coding agents invoked via Ollama output only a single token. Also adds an explicit error when a single message exceeds the context window.

Why it matters

The single-token output bug directly blocked users running Claude Code and similar coding agents locally via Ollama — this fix unblocks local-first developer setups.

Importance: 2/5

Notable bug fix unblocking local coding-agent workflows via Ollama

Sources