#distillation
- Moebius: 0.2B Lightweight Image Inpainting Framework Matches 11.9B FLUX Model Huazhong University of Science and Technology research
- Causal Forcing++: 2-Step Distillation Enables Real-Time Interactive Video Generation Tsinghua University research
- SDAR: Self-Distilled Agentic Reinforcement Learning for Multi-Turn Agents Zhejiang University / Meituan research
- ThoughtFold: Introspective Preference Learning Cuts Reasoning Tokens by 56% Without Accuracy Loss research
- AnyFlow: Any-Step Video Diffusion with On-Policy Flow Map Distillation MIT / NVIDIA research
- TrOPD: Trust-Region On-Policy Distillation Stabilizes LLM Training When Teacher-Student Gap Is Large Samsung Research research
- DanceOPD: On-Policy Generative Field Distillation for Unified Image Generation ByteDance Seed research
- Anthropic Accuses Alibaba of Largest Known Claude Distillation Attack: 28.8M Conversations Anthropic industry
- On the Geometry of On-Policy Distillation: A Training Paradigm Distinct from SFT and RLVR Hong Kong University of Science and Technology research
- Weak Critics Make Strong Learners: On-Policy Critique Distillation for Scalable Oversight Rutgers University research
- ZPPO: Teacher-in-Prompts Knowledge Distillation Outperforms Gradient Methods for Small Reasoners NVIDIA research