Daily digest
12 items · ~12 min · Week 2026-W25
Must-read (1)
Moebius: 0.2B Lightweight Image Inpainting Framework Matches 11.9B FLUX Model
Huazhong University of Science and TechnologyMoebius introduces a 0.22B parameter image inpainting model that matches or surpasses FLUX.1-Fill-Dev (11.9B parameters) through a Local-λ Mix Interaction block that summarizes spatial context and global semantic priors into fixed-size linear matrices. Adaptive multi-granularity latent-space distillation delivers a 15× inference speedup.
Worth knowing (6)
ElevenLabs Music v2 API Goes Live with Genre-Switching and Inpainting
ElevenLabsElevenLabs opened its Music v2 model via the public API in mid-June 2026. The model supports section-by-section song construction, mid-track genre switching (e.g., opera to heavy metal in one piece), and inpainting of individual song segments. API pricing dropped up to 50% versus Music v1. Commercial licensing is included.
DeepSeek Closes $7.4 Billion Series A at $55 Billion Valuation, Led by Tencent and CATL
DeepSeekDeepSeek closed its first-ever external funding round on June 16, 2026, raising ~51 billion yuan ($7.4B) at a post-money valuation of roughly $55 billion. Tencent ($1.5B) and CATL ($740M) led external investors, while founder Liang Wenfeng personally committed $3B. The deal carries an unusual governance structure: commercial investors received no voting rights and a five-year lockup, while the state-backed National AI Industry Investment Fund received direct equity with exclusive voting rights.
How Transparent is DiffusionGemma? Interpretability Study Closes the Gap to Autoregressive Models
Google DeepMindThis paper investigates whether DiffusionGemma — a masked discrete-diffusion LM that reasons in continuous latent space — is harder to interpret than autoregressive models. By mapping intermediate denoising states through an interpretable token bottleneck, the authors reduce the apparent transparency gap from 28.6× to just 1.1× relative to Gemma 4, and identify diffusion-specific phenomena such as non-chronological reasoning and token smearing. Co-authored by Neel Nanda and Rohin Shah.
Mistral Rebrands Le Chat to Vibe: Unified Work and Code AI Agent
MistralMistral rebranded its Le Chat product to Vibe in June 2026, unifying work and coding capabilities under a single agent and a single license. Vibe includes Work Mode (a long-range task agent that picks its own tools and streams progress) and Code Mode (for remote coding and pull request creation), a new VS Code extension, and CLI updates for project-wide automation. All existing Le Chat conversations, settings, and plans carry over automatically.
OpenAI Codex Adds Record and Replay for Reusable Workflow Skills
OpenAIOpenAI shipped Record & Replay for Codex on June 18, 2026 (app version 26.616), allowing users to demonstrate a repetitive workflow once on macOS and have Codex convert it into a reusable SKILL.md file that accepts variable inputs. Unlike traditional RPA, the feature captures intent rather than pixel-exact coordinates, making it resilient to UI changes. Available to ChatGPT Plus, Pro, Business, Enterprise, and Edu subscribers outside the EU, UK, and Switzerland.
Runway Launches Studio: Integrated AI Video Editing Suite
RunwayOn June 18, 2026, Runway shipped Studio, a unified interface allowing users to trim, stitch, reorder, and export final videos without leaving the platform. The feature closes the loop between AI generation and post-production editing in one workspace.
For reference (5)
FAPO: Fully Autonomous Prompt Optimization of Multi-Step LLM Pipelines
Cisco Foundation AIFAPO evaluates multi-step LLM pipeline outputs, attributes failures to the specific step that caused them, proposes targeted prompt variants, validates them with an independent agent, and iterates until accuracy improves or budget is exhausted. It outperformed GEPA (state-of-the-art optimizer) in 15 of 18 model-benchmark pairs, with mean gains of +14.1 percentage points and +33.8 on tasks requiring structural prompt changes. Open-sourced under Apache 2.0.
Playful Agentic Robot Learning: Self-Directed Play Yields Transferable Robot Skills
UC BerkeleyRobotics Agent Teams (RATs) acquire skills through self-directed play before any downstream task is specified. During play, the agent generates novel exploratory tasks, writes and executes robot-code policies, diagnoses failures, retries with step-level feedback, and distills successes into a reusable code library. Play-learned skills improved held-out downstream performance by 20.6 and 17.0 percentage points over baselines on LIBERO-PRO and MolmoSpaces, and transferred to other Code-as-Policy agents without fine-tuning.
ChatGPT Adds Pronunciation Help in 60+ Languages and World Cup Hub
OpenAIOpenAI rolled out several ChatGPT improvements on June 18–19, 2026: audio and text pronunciation guidance for words in over 60 languages, a dedicated FIFA World Cup 2026 conversational experience covering schedules, predictions, and player storylines, more granular connected-app permission controls, improved chat organization with sidebar pinning and one-click sharing, faster iOS photo uploads, and per-message model selection on Android for paid users.
OpenCode v1.17.9 Released with GLM-5.2 Support and MCP Fixes
SSTOpenCode v1.17.9, released on June 21, 2026, adds high and max thinking variants for GLM-5.2 models, fixes Devstral model detection with varying provider ID casing, passes custom headers to Copilot model requests, and fixes OpenAI-compatible providers rejecting MCP tool schemas. Cloudflare AI Gateway API key passing and session timeline flicker are also fixed, and agent step limits now force a final text response rather than failing mid-run.
Claude Code v2.1.185 Improves API Stream-Stall Messaging
AnthropicVersion 2.1.185 (June 20, 2026) changes the stream-stall indicator from "No response from API · Retrying in …" to "Waiting for API response · will retry in …" and extends the threshold before the hint appears from 10 seconds to 20 seconds.