Daily digest
13 items · ~13 min · Week 2026-W20
Must-read (1)
Google Announces Gemini Intelligence for Android with Cross-App Automation
GoogleAt the Android Show: I/O Edition on May 12, 2026, Google announced Gemini Intelligence — a suite of AI features enabling multi-step task automation across apps, intelligent autofill, a speech-to-text tool called Rambler, and a natural language widget builder. Gemini in Chrome will allow users to summarize and query web content. Features roll out to Samsung Galaxy and Pixel devices in summer 2026, with broader Android availability later in the year.
Worth knowing (6)
Anthropic in Talks to Raise $30 Billion at $900 Billion Valuation
AnthropicBloomberg reported on May 12, 2026 that Anthropic is in early talks to raise at least $30 billion at a valuation exceeding $900 billion — which would make it more valuable than OpenAI. The round could close as early as end of May and is intended to fund computing infrastructure to meet growing demand for Claude. Anthropic is also reportedly considering an IPO as early as October 2026.
DeepSeek Seeks $7.35 Billion in First-Ever External Funding at $50B Valuation
DeepSeekDeepSeek is in advanced talks to raise up to CNY 50 billion (~$7.35B) in its first-ever external funding round, which would value the previously self-funded Hangzhou lab at approximately $50–51.5 billion. China's state-backed National AI Industry Investment Fund is negotiating to lead the round, with Tencent and Alibaba also reportedly in discussions to participate.
Qwen-Image-2.0: Unified Image Generation and Editing at 2K Resolution, Top-1 on AI Arena
AlibabaQwen-Image-2.0 is a unified image generation and editing model combining Qwen3-VL as a condition encoder with a Multimodal Diffusion Transformer. It supports prompts up to 1,000 tokens, generates images at native 2K resolution, and achieves top-1 ranking on AI Arena for both text-to-image and image editing — while reducing the parameter count from 20B to 7B versus its predecessor.
Google DeepMind Reimagines the Mouse Pointer with AI-Powered Gemini Integration
Google DeepMindGoogle DeepMind published a research blog on May 12, 2026 detailing its AI-enabled pointer powered by Gemini, designed to understand both what users point at and why it matters contextually. The technology is being integrated into Chrome and a new device called Googlebook, with experimental demos available in Google AI Studio for image editing and map navigation. The system follows four interaction principles: maintain flow, show-and-tell, embrace natural shorthand, and turn pixels into actionable entities.
OpenAI Launches Daybreak AI Cybersecurity Initiative with GPT-5.5 Models
OpenAIOn May 12, 2026, OpenAI launched Daybreak — an AI-powered cybersecurity initiative combining GPT-5.5 and Codex Security to help organizations detect, validate, and patch vulnerabilities before exploitation. The platform offers three tiers: standard GPT-5.5, a Trusted Access for Cyber variant for authorized defensive work, and GPT-5.5-Cyber for red teaming. Founding partners include Akamai, Cisco, Cloudflare, CrowdStrike, Fortinet, Oracle, Palo Alto Networks, and Zscaler.
Google's Gemini 'Omni' Video Model Surfaces in Early Demos Ahead of I/O 2026
Google DeepMindOn May 11, 2026, a new 'Omni' video model appeared inside the Gemini app UI, with early demo clips from Gemini Pro users showcasing strong editing capabilities — watermark removal, in-chat object swaps, and scene rewrites. The model is described as 'remix your videos, edit directly in chat' and reportedly consumed 86% of a user's daily AI Pro quota per generation, suggesting heavy compute requirements.
For reference (6)
NanoResearch: Co-Evolving Skills, Memory, and Policy for Personalized AI Research Automation
Shanghai AI LabNanoResearch is a multi-agent framework for personalized AI-driven research automation that co-evolves three components: a skill bank of reusable procedural knowledge, a memory module retaining user- and project-specific history, and a label-free policy learning mechanism internalizing user preferences through free-form feedback. The system achieves 100% end-to-end pipeline success rate in Round 1, outperforming all baselines.
TMAS: Scaling Test-Time Compute via Multi-Agent Synergy with Hierarchical Memory
TMAS scales test-time compute through structured multi-agent coordination, employing two hierarchical memory systems — an experience bank for reliable intermediate results and a guidelines bank for explored strategies — alongside a hybrid reward reinforcement learning scheme. The approach prevents redundant computation across parallel reasoning trajectories and achieves superior scaling on challenging reasoning benchmarks.
vLLM v0.21.0rc1: Python 3.14, CUDA 13.0, and Transformers v5 Compatibility
vLLM published release candidate v0.21.0rc1 on May 12, 2026, bringing PyTorch 2.11, Python 3.14 support, CUDA 13.0 as the new default, and compatibility with Transformers v5. This follows v0.20.2 (May 10), which was yanked due to a tensor parallelism bug.
Alibaba Upgrades Qwen AI Glasses S1 with Proactive AI and Spatial 3D Display
AlibabaAlibaba pushed a significant software update to its Qwen AI Glasses S1, adding proactive AI that surfaces contextual reminders based on weather, location, and calendar data without user prompting, plus a spatial 3D display system. The update deepens integration with Chinese super-apps for ride-hailing, food delivery, and trip planning; the hardware remains China-only at ¥3,799 (~$537).
Yandex Maps Adds AI-Generated Landmark-Based Voice Guidance Across Russia
YandexYandex announced on May 12, 2026 that its Maps and Navigator apps now deliver AI-generated voice prompts referencing recognizable urban landmarks — for example, 'Turn right at the store' or 'Keep left at the monument in 200 meters.' The system covers over 10,000 landmarks across Russia, with AI determining the optimal placement of landmark cues along each route.
llama.cpp Adds gpt-oss-20b Support in May 12 Build
A llama.cpp release on May 12, 2026 added support for running OpenAI's gpt-oss-20b model locally, along with prebuilt binaries for macOS (Apple Silicon and Intel), Linux (Vulkan, ROCm, OpenVINO, SYCL backends), Android, and Windows with CUDA 12.4.