DeepSeek launches image recognition mode in a gray-scale test

DeepSeek

Models / LLM media only 4 src. ~1 min

DeepSeek opened a new Image Recognition Mode to a portion of web and app users — the company's first consumer multimodal image understanding. The mode joined Quick Mode and Expert Mode; for now, only understanding is supported (viewing, reading, analysis), not generation. Multimodal team lead Chen Xiaokang hinted at the launch with an image of a blue whale with an open eye.

Why it matters

DeepSeek's first step from a purely text model to a multimodal product — an important signal following the V4 release a few days earlier.

Importance: 3/5

Notable expansion: DeepSeek's first multimodal product + ≥3 independent primary-media.

Sources