LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling Paper • 2511.20785 • Published 12 days ago • 148
OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe Paper • 2511.16334 • Published 17 days ago • 91
Can World Simulators Reason? Gen-ViRe: A Generative Visual Reasoning Benchmark Paper • 2511.13853 • Published 20 days ago • 34
Depth Anything 3: Recovering the Visual Space from Any Views Paper • 2511.10647 • Published 24 days ago • 92
MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation Paper • 2511.09611 • Published 25 days ago • 68
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling Paper • 2511.11793 • Published 23 days ago • 158
MiroThinker-v0.1 Collection High performance in deep research and tool use. • 7 items • Updated Sep 8 • 35
MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization Paper • 2507.14683 • Published Jul 19 • 134
Disentangling Writer and Character Styles for Handwriting Generation Paper • 2303.14736 • Published Mar 26, 2023 • 3
HiPA: Enabling One-Step Text-to-Image Diffusion Models via High-Frequency-Promoting Adaptation Paper • 2311.18158 • Published Nov 30, 2023