Collections
Discover the best community collections!
Collections including paper arxiv:2512.16093
-
ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding
Paper • 2512.13586 • Published • 88 -
TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows
Paper • 2512.05150 • Published • 74 -
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
Paper • 2512.16093 • Published • 91
-
Decoupled DMD: CFG Augmentation as the Spear, Distribution Matching as the Shield
Paper • 2511.22677 • Published • 29 -
DiP: Taming Diffusion Models in Pixel Space
Paper • 2511.18822 • Published • 28 -
What about gravity in video generation? Post-Training Newton's Laws with Verifiable Rewards
Paper • 2512.00425 • Published • 50 -
Learning Eigenstructures of Unstructured Data Manifolds
Paper • 2512.01103 • Published • 5
-
Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone
Paper • 2512.22615 • Published • 43 -
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models
Paper • 2512.20557 • Published • 49 -
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
Paper • 2512.16093 • Published • 91
-
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
Paper • 2512.16093 • Published • 91 -
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer
Paper • 2511.22699 • Published • 223 -
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 201 -
Sharp Monocular View Synthesis in Less Than a Second
Paper • 2512.10685 • Published • 25
-
ReVSeg: Incentivizing the Reasoning Chain for Video Segmentation with Reinforcement Learning
Paper • 2512.02835 • Published • 9 -
Joint 3D Geometry Reconstruction and Motion Generation for 4D Synthesis from a Single Image
Paper • 2512.05044 • Published • 16 -
Entropy Ratio Clipping as a Soft Global Constraint for Stable Reinforcement Learning
Paper • 2512.05591 • Published • 16 -
SpaceControl: Introducing Test-Time Spatial Control to 3D Generative Modeling
Paper • 2512.05343 • Published • 24
-
Seedance 1.0: Exploring the Boundaries of Video Generation Models
Paper • 2506.09113 • Published • 105 -
Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion
Paper • 2506.08009 • Published • 30 -
Seeing Voices: Generating A-Roll Video from Audio with Mirage
Paper • 2506.08279 • Published • 27 -
PolyVivid: Vivid Multi-Subject Video Generation with Cross-Modal Interaction and Enhancement
Paper • 2506.07848 • Published • 4
-
Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone
Paper • 2512.22615 • Published • 43 -
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models
Paper • 2512.20557 • Published • 49 -
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
Paper • 2512.16093 • Published • 91
-
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
Paper • 2512.16093 • Published • 91 -
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer
Paper • 2511.22699 • Published • 223 -
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 201 -
Sharp Monocular View Synthesis in Less Than a Second
Paper • 2512.10685 • Published • 25
-
ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding
Paper • 2512.13586 • Published • 88 -
TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows
Paper • 2512.05150 • Published • 74 -
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
Paper • 2512.16093 • Published • 91
-
ReVSeg: Incentivizing the Reasoning Chain for Video Segmentation with Reinforcement Learning
Paper • 2512.02835 • Published • 9 -
Joint 3D Geometry Reconstruction and Motion Generation for 4D Synthesis from a Single Image
Paper • 2512.05044 • Published • 16 -
Entropy Ratio Clipping as a Soft Global Constraint for Stable Reinforcement Learning
Paper • 2512.05591 • Published • 16 -
SpaceControl: Introducing Test-Time Spatial Control to 3D Generative Modeling
Paper • 2512.05343 • Published • 24
-
Decoupled DMD: CFG Augmentation as the Spear, Distribution Matching as the Shield
Paper • 2511.22677 • Published • 29 -
DiP: Taming Diffusion Models in Pixel Space
Paper • 2511.18822 • Published • 28 -
What about gravity in video generation? Post-Training Newton's Laws with Verifiable Rewards
Paper • 2512.00425 • Published • 50 -
Learning Eigenstructures of Unstructured Data Manifolds
Paper • 2512.01103 • Published • 5
-
Seedance 1.0: Exploring the Boundaries of Video Generation Models
Paper • 2506.09113 • Published • 105 -
Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion
Paper • 2506.08009 • Published • 30 -
Seeing Voices: Generating A-Roll Video from Audio with Mirage
Paper • 2506.08279 • Published • 27 -
PolyVivid: Vivid Multi-Subject Video Generation with Cross-Modal Interaction and Enhancement
Paper • 2506.07848 • Published • 4