#diffusion
- Pixal3D: Pixel-Aligned Image-to-3D Generation Accepted at SIGGRAPH 2026 Tencent ARC Lab research
- UniVidX: One Diffusion Backbone for RGB, Intrinsic Maps, and RGBA Video Generation research
- Mean Mode Screaming: Training Pathology Fix Enables 1000-Layer Diffusion Transformers research
- Flow-OPD: On-Policy Distillation Pushes GenEval +29 Points on Stable Diffusion 3.5 research
- Qwen-Image-2.0: Unified Image Generation and Editing at 2K Resolution, Top-1 on AI Arena Alibaba research
- Asymmetric Flow Models: SOTA 1.57 FID on ImageNet via Rank-Asymmetric Velocity Parameterization Stanford University research
- Orthrus: 7.8x Inference Speedup for Qwen3 via Autoregressive-Diffusion KV Sharing research
- Causal Forcing++: 2-Step Distillation Enables Real-Time Interactive Video Generation Tsinghua University research
- SANA-WM: Minute-Scale 720p World Modeling on a Single GPU NVIDIA research
- Flow-DPPO: Principled RL Alignment for Flow Matching Image and Video Models Tencent Hunyuan research
- AnyFlow: Any-Step Video Diffusion with On-Policy Flow Map Distillation MIT / NVIDIA research
- Cola DLM: Continuous Latent Diffusion Language Model with Competitive Scaling research
- SCAIL-2: End-to-End Character Animation via In-Context Conditioning Tsinghua University research
- Diffusion-Proof: Formal Theorem Proving via Diffusion Language Models research
- DreamReasoner-8B: Block-Size Curriculum for Diffusion Reasoning Models research