Daily digest
8 items · ~8 min · Week 2026-W27
Worth knowing (2)
DeepSeek V4 Stable Release Set for Mid-July with First Time-of-Day API Pricing
DeepSeekDeepSeek announced on June 30, 2026 that the official stable release of DeepSeek V4 is scheduled for mid-July, graduating from the April 24 preview. The announcement introduces the lab's first time-of-day API pricing: rates double during peak hours (9 AM–12 PM and 2–6 PM daily). V4 Pro carries 1.6T total / 49B active parameters and a 1M-token context window; V4 Flash has 284B total / 13B active parameters with the same context.
Claude Opus 4.8 and Haiku 4.5 Now Generally Available in Microsoft Azure Foundry
AnthropicAnthropic launched Claude Opus 4.8 and Claude Haiku 4.5 as generally available in Microsoft Azure Foundry on June 29, 2026. The GA release supports Azure-native deployment with existing Microsoft identity, billing, and governance controls, plus an optional US data zone for data residency. Models are live in East US and West Europe regions, running on NVIDIA GB300 Blackwell Ultra GPUs. Azure becomes the only cloud offering both Claude and GPT frontier models on one platform.
For reference (6)
Formalizing Latent Thoughts: Axiomatic Framework for Evaluating LLM Reasoning Representations
University of British ColumbiaIntroduces an axiomatic framework for evaluating latent thought representations in LLMs independent of downstream benchmark scores. Defines four axioms — Causality, Minimality, Separability, and Stability — with quantitative measures. Testing across 23 reasoning tasks on open-weight models reveals no model satisfies all four axioms simultaneously, and representations encode minimal information beyond what is already in input embeddings.
PhysisForcing: Physics-Reinforced World Models Improve Robot Manipulation Success by 50%
Peking University / NVIDIAPhysisForcing applies hierarchical physics supervision to video-generation-based world models for robot training: pixel-level trajectory alignment using reference point trajectories, and semantic-level relational alignment from a frozen video encoder. Improves closed-loop manipulation success from 16.0% to 24.0% and achieves 3.7–22.3% gains over baselines. Model-agnostic approach demonstrated on Cosmos3-Nano.
Claude Code v2.1.196: Org Default Models and MCP Security Fix for Untrusted Repos
AnthropicClaude Code v2.1.196 shipped June 29, 2026. Key additions: org admins can set default models in the console (shown as 'Org default' in /model); security fix prevents MCP servers from spawning via .mcp.json in repos that carry a committed .claude/settings.json (supply-chain attack vector); background agents auto-resume when the daemon restarts; streaming idle watchdog enabled by default; /code-review token usage cut ~25%; terminal rendering work reduced ~37% per frame.
Cursor v3.9 iOS Public Beta: Launch and Steer Cloud Agents Remotely from Phone
CursorCursor version 3.9 shipped June 29, 2026, introducing an iOS public beta available to all paid subscribers. The mobile app lets users launch cloud agents via voice and slash commands, direct agents running on a desktop via Remote Control, track agent progress on the iOS lock screen via Live Activities with push notifications, and review diffs, logs, and screenshots as Artifacts in the app.
Yandex Launches Developer Platform for Building AI Agents Inside Alice AI
YandexYandex unveiled a developer platform for building, testing, and deploying AI agents inside Alice AI on June 29, 2026. The platform enables agents to understand natural language, plan multi-step action chains, and adapt to user context. Initial agents from Yandex Taxi and Yandex Lavka are live in limited testing; agents from Yandex Delivery and Yandex Market are next. External partners will get access by end of 2026.
OpenClaw v2026.6.11-beta.2: Slack Relay Mode and Codex Partial Delta Fix
OpenClawOpenClaw v2026.6.11-beta.2 released June 28, 2026 (308 merged PRs). Key additions: Slack relay mode for channel-proxied agent interaction; native Mattermost /oc_queue command support; Codex partial delta handling for long-context stability; externalized official plugins for faster per-plugin security updates; Android settings improvements.