Forcing-KV: Hybrid KV Cache Compression for Efficient Autoregressive Video Diffusion Models Paper • 2605.09681 • Published 6 days ago • 7
SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer Paper • 2605.15178 • Published 2 days ago • 53
Warp-as-History: Generalizable Camera-Controlled Video Generation from One Training Video Paper • 2605.15182 • Published 2 days ago • 34
Causal Forcing++: Scalable Few-Step Autoregressive Diffusion Distillation for Real-Time Interactive Video Generation Paper • 2605.15141 • Published 2 days ago • 76
Flow-OPD: On-Policy Distillation for Flow Matching Models Paper • 2605.08063 • Published 8 days ago • 93
LTX-2.3 Collection LTX-2.3 base models, quantized models and accompanying LoRAs and IC-LoRAs • 10 items • Updated 5 days ago • 49
Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon Memory Paper • 2604.08995 • Published Apr 10 • 50
Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision Paper • 2604.04934 • Published Apr 6 • 46
Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding Paper • 2604.05015 • Published Apr 6 • 235
view article Article Introducing Modular Diffusers - Composable Building Blocks for Diffusion Pipelines +2 YiYiXu, OzzyGT, dn6, sayakpaul • Mar 5 • 51
JavisDiT-v1.0 Collection Unified Modeling and Optimization for Joint Audio-Video Generation • 2 items • Updated Feb 26 • 1
JavisDiT++: Unified Modeling and Optimization for Joint Audio-Video Generation Paper • 2602.19163 • Published Feb 22 • 14
SkyReels-V4: Multi-modal Video-Audio Generation, Inpainting and Editing model Paper • 2602.21818 • Published Feb 25 • 55
DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation Paper • 2602.12160 • Published Feb 12 • 38