DeepSeek launches image recognition mode in a gray-scale test
DeepSeek
DeepSeek opened a new Image Recognition Mode to a portion of web and app users — the company's first consumer multimodal image understanding. The mode joined Quick Mode and Expert Mode; for now, only understanding is supported (viewing, reading, analysis), not generation. Multimodal team lead Chen Xiaokang hinted at the launch with an image of a blue whale with an open eye.
Why it matters
DeepSeek's first step from a purely text model to a multimodal product — an important signal following the V4 release a few days earlier.
Importance: 3/5
Notable expansion: DeepSeek's first multimodal product + ≥3 independent primary-media.