Mistral Releases OCR 4: State-of-the-Art Document Intelligence with On-Premises Deployment

Mistral AI

Models / LLM official + media 3 src. ~1 min

Mistral released OCR 4, a document intelligence model covering 170 languages that returns structured output including bounding boxes, typed-block classification (titles, tables, equations, signatures), and inline confidence scores. It tops OlmOCRBench at 85.20 with 72% average win rate in human preference studies, and deploys as a single container for on-premises use. Pricing is $4 per 1,000 pages via API, available on Mistral API, Amazon SageMaker, and Microsoft Foundry.

Why it matters

Combining best-in-class extraction quality with a self-hostable, single-container deployment addresses a major enterprise blocker — routing sensitive documents through third-party cloud APIs — positioning Mistral strongly in the enterprise document processing market.

Importance: 3/5

SOTA document AI with on-premises option; targets a clear enterprise gap between cloud-only competitors

Sources