#multimodal
- GLM-5V-Turbo: a natively multimodal foundation model for agents Z.ai research
- DeepSeek launches image recognition mode in a gray-scale test DeepSeek models-llm
- Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond HKUST/NUS/Oxford/NTU research