Daily digest
15 items · ~15 min · Week 2026-W24
Must-read (4)
Gemini 3.5 Live Translate: Real-Time Speech-to-Speech in 70+ Languages
Google DeepMindGoogle launched Gemini 3.5 Live Translate on June 9, 2026 — a continuous speech-to-speech translation model covering 70+ languages that preserves the speaker's intonation, pacing, and pitch. Unlike turn-by-turn systems, it generates translated speech without turn boundaries, supporting 2,000+ language-pair combinations. Available immediately: via the Gemini Live API and Google AI Studio for developers, in Google Translate on Android and iOS, and in private preview for Google Meet enterprise customers. All output audio is watermarked via SynthID.
Claude Fable 5 and Claude Mythos 5: Anthropic's Most Capable Model Goes Public
AnthropicAnthropic released Claude Fable 5 on June 9, 2026 — the first Mythos-class model made publicly available. It uses the same underlying architecture as Claude Mythos 5 but ships with three classifier-based safeguards (cybersecurity, biology/chemistry, distillation prevention) that fall back to Claude Opus 4.8 in restricted domains. Priced at $10/M input and $50/M output tokens, with 128k output token support. Free for Pro/Max/Team/Enterprise subscribers through June 22. Mythos 5 (unrestricted) remains gated to vetted cybersecurity researchers via Project Glasswing. Anthropic cited a 50-million-line codebase migration as a flagship real-world benchmark.
MiniMax M3 Open Weights Released: 1M Context, MoE, Frontier Coding
MiniMaxMiniMax released the open weights of M3 on HuggingFace on June 10, 2026 — fulfilling the promise made at the June 1 API launch. M3 uses MiniMax Sparse Attention (MSA) to deliver 1M-token context at 1/20th the per-token compute of the prior generation, achieving 9× faster prefill and 15× faster decoding. It scores 59.0% on SWE-Bench Pro (surpassing GPT-5.5 and Gemini 3.1 Pro) and supports image and video inputs natively. API pricing: $0.60/$2.40 per million tokens input/output.
DRPO: Rethinking Divergence Regularization in LLM Reinforcement Learning
Tencent HunyuanDRPO (Divergence Regularized Policy Optimization, arXiv:2606.09821) replaces the hard gradient-masking used in PPO/DPPO with a smooth advantage-weighted quadratic regularizer. Instead of discarding updates when a token crosses trust-region boundaries, DRPO applies bounded, continuous gradient weights that both attenuate harmful divergences and supply corrective signals. Validated across multiple model scales, architectures, and precision settings, showing improved stability and efficiency over existing LLM RL training methods.
Worth knowing (2)
Cohere North Mini Code: 30B Apache-2.0 MoE Coding Model for Agentic Workflows
CohereCohere released North Mini Code 1.0 on June 9, 2026 under Apache 2.0. The model has 30B total parameters with only 3B active (MoE with 128 experts, 8 activated per token), using interleaved sliding-window and full self-attention. It targets agentic software engineering workflows, scoring 33.4 on Cohere's coding index. Available on HuggingFace in BF16 and FP8, integrated into OpenCode, and accessible via the Cohere API.
Flow-DPPO: Principled RL Alignment for Flow Matching Image and Video Models
Tencent HunyuanFlow-DPPO (arXiv:2606.11025) argues that ratio-clipping PPO variants (Flow-GRPO, CPS) are structurally ill-suited for flow matching models because noisy per-step policy ratios produce inconsistent trust-region enforcement across trajectory positions. Flow-DPPO replaces ratio clipping with a divergence-based proximal constraint and leverages the Gaussian structure of per-step flow policies to compute exact KL divergences efficiently. Demonstrates superior reward, better KL efficiency, reduced catastrophic forgetting, and stable multi-epoch training on image and video generation tasks.
For reference (9)
OpenAI Launches Economic Research Exchange for AI Impact Studies
OpenAIOpenAI launched the OpenAI Economic Research Exchange on June 8, 2026 — a program inviting external researchers to conduct privacy-protected studies on AI's effects on workers, firms, and the economy. Applications open through July 5, 2026, with selected researchers notified July 31. Participants get structured access to usage data under defined governance rules.
SearchSwarm: Delegation Intelligence for LLM Agents in Long-Horizon Deep Research
SearchSwarm (arXiv:2606.09730) introduces a multi-agent framework where a main LLM decomposes long research tasks and dispatches subtasks to specialized subagents that return only summarized results to fit the main context window. Training data is synthesized via a harness guiding high-quality decomposition. SearchSwarm-30B-A3B achieves 68.1 on BrowseComp and 73.3 on BrowseComp-ZH — best results among comparable-scale open models. Weights, training data, and harness are being released open-source.
SCAIL-2: End-to-End Character Animation via In-Context Conditioning
Tsinghua UniversitySCAIL-2 (arXiv:2606.10804) eliminates intermediate representations (pose skeletons, background masks) in controlled character animation by directly concatenating driving videos into the generation sequence. Key components: MotionPair-60K (new synthetic dataset), in-context mask conditioning, mode-specific RoPE for soft guidance, and Bias-Aware DPO to reduce synthetic artifacts. Achieves SOTA across multiple controlled animation tasks.
ABot-Earth 0.5: Generative 3D Urban World Model from Satellite Imagery
Alibaba AMAP CV LabABot-Earth 0.5 (arXiv:2606.09967) synthesizes seamless 3D urban environments from geospatially referenced satellite imagery using 3D Gaussian Splatting with hierarchical level-of-detail for real-time web visualization. Generates realistic geometry and textures at under 10 minutes per square kilometer. Targets the simulation-to-reality gap for embodied AI applications such as UAV navigation.
Yandex Launches Drops: First AI Wearable Earbuds with Alice AI
YandexYandex began sales of Yandex Drops on June 9, 2026 — its first wearable AI device: wireless earbuds with an on-device chip for local wake-word detection and an always-on Alice AI. Priced at 8,990 rubles. The 'My Memory' feature converts voice notes into structured reminders and lists. Available exclusively via Alice AI chat through June 16, then in retail across Russia, Kazakhstan, and Belarus.
Claude Code v2.1.170: Claude Fable 5 Support Added
AnthropicClaude Code v2.1.170 (June 9, 2026) adds support for the newly released Claude Fable 5 model. The preceding v2.1.169 (June 8) introduced a --safe-mode flag and /cd command; v2.1.166 (June 6) added fallbackModel configuration supporting up to three alternative models for resilience under API overload; v2.1.163 (June 4) introduced version requirement policies (requiredMinimumVersion/requiredMaximumVersion) and a /plugin list command.
OpenAI Codex CLI v0.139.0: Web Search in Code Mode and MCP Schema Fixes
OpenAICodex CLI v0.139.0 (June 9, 2026) allows code mode to call standalone web search directly and receive plaintext results. Improved MCP tool schema preservation for complex tool inputs. The codex doctor diagnostic command was improved. A pre-release v0.140.0-alpha.2 also dropped June 10. Earlier v0.137.0 (June 4) added F13-F24 keybindings, monthly credit limit display for enterprise, and multi-agent v2 improvements.
OpenCode v1.17.0: fff File Search, Cohere North, and Session Recovery
SSTOpenCode v1.17.0 (June 10, 2026) adds faster file search via fff (Rust/SIMD-accelerated fuzzy finder), Cohere North model integration, Claude Fable 5 reasoning support, MCP tool improvements (abort signals, correct pagination), Java Maven workspace resolution, session recovery from provider context-overflow errors, WSL-backed Desktop on Windows, and improved sessions and servers UI.
OpenClaw 2026.6.5 Stable: MCP Tool Validation and Parallel Web Search
OpenClaw 2026.6.5 stable (June 9, 2026) follows several beta releases (beta.2–beta.6) over June 7–9. Key changes: new YYYY.M.PATCH versioning scheme, improved handling of AI model reasoning content, MCP tool result validation, Anthropic session recovery enhancements, and parallel web-search provider integration.