Daily digest
9 items · ~9 min · Week 2026-W25
Worth knowing (5)
Kimi K2.7-Code HighSpeed: 6× Throughput for Production Coding Agent Pipelines
Moonshot AIOn June 15, 2026, Moonshot AI announced a HighSpeed variant of Kimi K2.7-Code, rolling out to Kimi Code Beta and Kimi Business users. The HighSpeed mode delivers approximately 180 tokens/second on median-length coding inputs and up to 260 tokens/second on shorter tasks — roughly six times faster than the standard release. The base K2.7-Code (1 trillion-parameter MoE, 32B active, 256K context) shipped on June 12, reporting +21.8% on Kimi Code Bench v2 and approximately 30% fewer reasoning tokens over K2.6.
DreamX-World 1.0: General-Purpose Interactive World Model with 6DoF Camera Control
AMAP-ML (Alibaba Maps AI Lab)DreamX-World is a general-purpose interactive world model that generates diverse, high-fidelity worlds from text or image prompts and allows users or agents to explore them via WASD-style 6DoF camera control. Trained on a mix of Unreal Engine data, gameplay footage, and real-world video, it supports 720P generation up to 7.5 seconds per clip and long-horizon rollouts up to one minute. Two variants are released under Apache 2.0: DreamX-World-5B-Cam (bidirectional, 5s) and DreamX-World-5B (autoregressive, long-horizon).
FastContext: Specialized Exploration Subagent Cuts Coding Agent Token Usage by 60%
Microsoft / Shanghai Jiao Tong UniversityFastContext decouples repository exploration from task-solving in LLM-based coding agents by introducing a dedicated exploration subagent (4B–30B parameters) that issues parallel read/glob/grep tool calls and returns compact file-path and line-range citations to the main solver. Training uses supervised fine-tuning followed by task-grounded reinforcement learning. Integrated into Mini-SWE-Agent, FastContext improves resolution rates by up to 5.5 percentage points on SWE-bench Multilingual, SWE-bench Pro, and SWE-QA, while cutting main-agent token usage by up to 60%.
NVIDIA SkillSpector: Open-Source Security Scanner for AI Agent Skills
NVIDIANVIDIA released SkillSpector (June 13, 2026), an open-source security scanner purpose-built for AI agent skills. It checks 64 vulnerability patterns across 16 categories, covering conventional software risks and agent-specific risks such as prompt injection, insecure data handling, and logic flaws. The tool is grounded in OWASP LLM guidance and MITRE ATLAS. An accompanying Snyk audit of 3,984 skills found that 26.1% contain vulnerabilities and 5.2% show likely malicious intent, including 1,467 malicious payloads such as trojans, cryptominers, and credential harvesters. The repository is available at github.com/NVIDIA/SkillSpector.
Claude Code 2.1.178: Parameterized Permission Rules and Nested Skills
AnthropicClaude Code version 2.1.178 (June 15, 2026) adds Tool(param:value) syntax for permission rules, enabling fine-grained matching on tool input parameters with wildcard support — for example, Agent(model:opus) can block Opus subagents specifically. Nested .claude/skills directories now load automatically when working in those directories, with name-clash resolution via <dir>:<name> namespacing. Auto mode now runs a classifier check before spawning subagents to prevent blocked actions from being delegated. Multiple bug fixes address OOM crashes from stale file-descriptor env vars, OAuth account mismatches in Chrome, subagent transcript handling, compaction fallback model, and VSCode CJK IME dismissal.
For reference (4)
OpenAI Launches Partner Network with $150M Investment
OpenAIOpenAI officially introduced the OpenAI Partner Network on June 14–15, 2026, a formal global partner program backed by $150 million targeting consulting firms, systems integrators, and technology specialists. The program has three tiers — Select, Advanced, and Elite — and aims to certify 300,000 consultants by end of 2026. Founding partners include Accenture, BCG, Bain, PwC, and McKinsey's QuantumBlack. OpenAI framed the initiative around the idea that the bottleneck for enterprise AI value is no longer model capability but implementation and workflow redesign.
Memory is Reconstructed, Not Retrieved: Graph Memory Improves LLM Agent Recall by 23%
National University of SingaporeMRAgent replaces the standard retrieve-then-reason memory paradigm with active reconstruction: agent memory is stored as a Cue-Tag-Content graph where associative tags act as semantic bridges. During inference the agent iteratively explores and prunes retrieval paths guided by intermediate reasoning evidence, avoiding combinatorial explosion. Evaluated on LoCoMo and LongMemEval benchmarks, MRAgent achieves up to 23% improvement over strong retrieval baselines.
OpenAI Codex CLI 0.140.0: Token Usage Tracking, Claude Code Import, and Amazon Bedrock Auth
OpenAICodex CLI 0.140.0 (June 15, 2026) ships token activity dashboards via /usage views (daily, weekly, cumulative), session deletion with confirmation guards via codex delete and /delete commands, and an /import command that reads Claude Code project configurations. Amazon Bedrock API authentication is now supported with encrypted local credential storage. A unified @ mentions menu replaces scattered context-injection entry points. The release also fixes corrupted SQLite auto-recovery, /review crashes, MCP server reliability issues, and plugin installation bugs.
OpenClaw v2026.6.8-beta.2: GLM-5.2 and Claude Haiku 4.5 Support, Rich Telegram Formatting
OpenClawOpenClaw v2026.6.8-beta.2 (June 16, 2026) adds support for GLM-5.2 and Claude Haiku 4.5 models and normalizes provider-qualified model IDs across OpenRouter and Google Vertex. Telegram delivery now supports structured rich text including tables, lists, and expandable blockquotes while preserving intentional line breaks. WhatsApp gains configured ACP bindings. Agent and gateway recovery is improved across DM sends, media completions, auto-reply handling, session restart aborts, and subagent operations. UI additions include collapsible workspace files, improved WebChat backscroll stability, and iOS gateway reconnection fixes.