#speech
- Thinking Machines Lab Unveils TML-Interaction-Small: 276B MoE Real-Time Multimodal Model Thinking Machines Lab models-llm
- EVA-Bench: End-to-End Framework for Evaluating Voice Agents ServiceNow AI research
- Gemini 3.5 Live Translate: Real-Time Speech-to-Speech in 70+ Languages Google DeepMind audio
- MiniCPM-o 4.5: Real-Time Full-Duplex Omni-Modal AI on Edge Devices OpenBMB / Tsinghua University research
- Audio Interaction Model: Unified Streaming Framework Combining Offline and Real-Time Audio Instruction Following research
- xAI Grok Voice Mode Coming to Apple CarPlay, App Build Reveals xAI tools