Playful Agentic Robot Learning: Self-Directed Play Yields Transferable Robot Skills
UC Berkeley
Robotics Agent Teams (RATs) acquire skills through self-directed play before any downstream task is specified. During play, the agent generates novel exploratory tasks, writes and executes robot-code policies, diagnoses failures, retries with step-level feedback, and distills successes into a reusable code library. Play-learned skills improved held-out downstream performance by 20.6 and 17.0 percentage points over baselines on LIBERO-PRO and MolmoSpaces, and transferred to other Code-as-Policy agents without fine-tuning.
Why it matters
Demonstrates that unstructured pre-task play with code-based policies yields skills that generalize to unseen tasks and third-party agents — a step toward robots that self-improve before deployment. Received 42 upvotes on HuggingFace Daily Papers.
Importance: 2/5
42 HF Daily upvotes; play-acquired skills transfer to third-party agents without fine-tuning — a generalizable finding for embodied AI pre-deployment improvement.