Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence Paper • 2510.20579 • Published Oct 23 • 55
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain Paper • 2509.26507 • Published Sep 30 • 535
Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model Paper • 2508.13009 • Published Aug 18 • 25
LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos Paper • 2508.14041 • Published Aug 19 • 59
SceneGen: Single-Image 3D Scene Generation in One Feedforward Pass Paper • 2508.15769 • Published Aug 21 • 19
Realistic Evaluation of Model Merging for Compositional Generalization Paper • 2409.18314 • Published Sep 26, 2024
SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation Paper • 2411.04989 • Published Nov 7, 2024 • 15
Pippo: High-Resolution Multi-View Humans from a Single Image Paper • 2502.07785 • Published Feb 11 • 10
Pippo: High-Resolution Multi-View Humans from a Single Image Paper • 2502.07785 • Published Feb 11 • 10