Daily digest
12 items · ~12 min · Week 2026-W19
Must-read (1)
Anthropic and OpenAI Both Launch Enterprise AI Deployment Ventures on the Same Day
Anthropic / OpenAIOn May 4, Anthropic officially launched a $1.5B enterprise AI services joint venture backed by Blackstone, Hellman & Friedman, Goldman Sachs, Apollo, General Atlantic, GIC, Leonard Green, and Sequoia — each anchor contributing ~$300M. The standalone firm will embed Anthropic engineers directly inside mid-market client companies to redesign workflows around Claude agents. Hours later, OpenAI finalized 'The Deployment Company,' a $10B vehicle backed by TPG, Brookfield, Advent, Bain Capital, and 15 other investors, with OpenAI guaranteeing PE backers a 17.5% annual return over five years. Both ventures follow Palantir's forward-deployed-engineer model targeting private equity portfolio companies in healthcare, manufacturing, financial services, and logistics.
Worth knowing (5)
ElevenLabs Launches ElevenMusic: AI Music Creation, Remixing, and Streaming in One Platform
ElevenLabsElevenLabs launched ElevenMusic on April 29–30, combining AI music generation, remixing, and streaming. Users can create songs from lyrics, melodies, or mood prompts, or remix existing tracks by genre and tempo. Approximately 4,000 human artists are available for streaming or remixing at launch, with royalties tied to listener engagement. Free tier: 7 songs/day; Pro: $9.99/month for up to 500 tracks. Available as an iOS app and as an integrated feature within ElevenLabs.
Suno Eyes $5B Series D Valuation as AI Music Platform Reaches 100 Million Users
SunoAxios reported on May 4 that Suno is in talks to close a Series D at a valuation exceeding $5 billion — more than doubling its $2.45B Series C from November 2025. The company has reached 100 million users and generates over 7 million songs per day. Separately, Suno announced plans to reimagine its acquired concert-discovery platform Songkick using AI, hiring a general manager to integrate live-event data into the platform.
Midjourney V8.1 Goes GA with 3× Faster HD Mode and Restored Image Prompts
MidjourneyMidjourney released V8.1 on April 30, available on both midjourney.com and Discord. HD mode is 3× faster and 3× cheaper; standard resolution is 50% faster and 25% cheaper. V8.1 restores image prompts and image weights, adds a Prompt Shortener, updates Describe for longer results, and produces native 2K resolution. Moodboards and Style References are now significantly more stable versus V8.0 Alpha.
AI2 Open-Sources MolmoAct2: Robotics VLA That Claims to Beat GPT-5 on Embodied Reasoning
AI2Allen Institute for AI releases MolmoAct2, an open-source robotic control system built around MolmoER, a vision-language model trained on 3.3M samples for spatial reasoning. The release includes three new datasets — including the largest open bimanual dataset to date with 720 hours of teleoperated trajectories — an open-source action tokenizer (OpenFAST), and MolmoThink, an adaptive reasoning mechanism that re-predicts depth tokens only for changed scene regions to reduce latency. Full model weights, training code, and datasets are released publicly.
UniVidX: One Diffusion Backbone for RGB, Intrinsic Maps, and RGBA Video Generation
UniVidX proposes a single framework handling multiple video generation tasks — RGB synthesis, intrinsic map generation, and RGBA layer decomposition — without separate models. Three components enable this: Stochastic Condition Masking (SCM) randomly partitions modalities into conditions and targets during training; Decoupled Gated LoRA (DGL) applies per-modality adaptations; Cross-Modal Self-Attention (CMSA) shares information across modalities. The system achieves competitive performance training on under 1,000 videos.
For reference (6)
Odysseus: Training VLMs for 100+ Turn Interactive Decision-Making via RL
Princeton UniversityOdysseus trains vision-language models to play Super Mario Land for 100+ consecutive decision turns using a PPO variant with a lightweight turn-level critic. Pretrained VLMs provide strong action priors that significantly improve sample efficiency versus classic deep RL from scratch. The framework achieves at least 3× the average game progress of frontier models while preserving general-domain VLM capabilities.
Meta Publishes Preparedness Report for Code World Model Before Open-Weight Release
MetaMeta's safety team published a pre-release evaluation of Code World Model (CWM), a code generation model, assessing potential catastrophic risk across identified threat domains and evaluating misalignment tendencies. The report concludes CWM does not pose additional frontier risks beyond those present in the current AI ecosystem and clears it for open-weight release.
Claude Code v2.1.128: MCP Tool Count Visibility, ZIP Plugin Archives, and Bug Fixes
AnthropicClaude Code v2.1.128 (May 4) adds MCP server tool-count display in /mcp with flagging of servers reporting 0 tools; --plugin-dir now accepts .zip archives; --channels works with console (API key) auth; the /model picker collapses duplicate Opus 4.7 entries; subprocesses no longer inherit OTEL_* environment variables; workspace is now a reserved MCP server name; reconnecting MCP servers no longer flood conversation with full tool-name lists. Bug fixes include focus mode dimming, desktop notification glitch on /exit in Kitty, drag-and-drop image upload hangs, large-input crash (>10 MB via stdin), and parallel shell tool call failures.
OpenClaw 2026.5.3: File Transfer Plugin and Cross-Platform Messaging Reliability
OpenClaw released version 2026.5.3 on May 4. The headline addition is a bundled file-transfer plugin enabling binary file operations on paired nodes with a default-deny per-node path policy. Additional changes: lazy-loading of discovery, cron, and metadata functions for faster startup; enhanced channel reliability across Discord, Telegram, Matrix, and Slack. Beta builds on the same day added a /steer command for mid-turn steering and gateway configuration that fails closed on invalid settings. OpenClaw is an open-source autonomous AI agent using messaging platforms as its UI (~247K GitHub stars as of March 2026).
OpenCode v1.14.34–35: PTY Websocket Auth Tickets and Diff Rendering Fix
SSTOpenCode released two versions on May 4–5. v1.14.34 adds PTY connection tickets for authenticated terminal websockets, v2 session failure events for client-side error detection, and improved Bash/PowerShell/cmd session handling; also fixes structured error responses from the HTTP effect server, Azure Anthropic model resolution, and web UI proxy asset loading. v1.14.35 fixes diff patch boundaries so session diffs render correctly when file contents include 'diff --git' text.
Cursor Adds Granular Model Controls, Spend Limits, and Usage Analytics for Teams
CursorCursor's May 4 changelog introduces granular model and provider allow/blocklists — teams can restrict models by speed or context-window size. Soft spend limits send alerts at 50%, 80%, and 100% of budget thresholds. Usage analytics are now filterable by user and product surface (chat, completions, inline edit, etc.).