speech — AI Digest

13 мая Thinking Machines Lab Unveils TML-Interaction-Small: 276B MoE Real-Time Multimodal Model Thinking Machines Lab models-llm
15 мая EVA-Bench: End-to-End Framework for Evaluating Voice Agents ServiceNow AI research
10 июн Gemini 3.5 Live Translate: Real-Time Speech-to-Speech in 70+ Languages Google DeepMind audio
3 мая MiniCPM-o 4.5: Real-Time Full-Duplex Omni-Modal AI on Edge Devices OpenBMB / Tsinghua University research
6 июн Audio Interaction Model: Unified Streaming Framework Combining Offline and Real-Time Audio Instruction Following research
4 мая xAI Grok Voice Mode Coming to Apple CarPlay, App Build Reveals xAI tools