oguzhanercan 's Collections MultiModal Reasoning
updated
Perception-Aware Policy Optimization for Multimodal Reasoning
Paper
• 2507.06448
• Published
• 48
High-Resolution Visual Reasoning via Multi-Turn Grounding-Based
Reinforcement Learning
Paper
• 2507.05920
• Published
• 12
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility,
Reasoning, and Efficiency
Paper
• 2508.18265
• Published
• 214
Latent Chain-of-Thought for Visual Reasoning
Paper
• 2510.23925
• Published
• 10
Thinking with Video: Video Generation as a Promising Multimodal
Reasoning Paradigm
Paper
• 2511.04570
• Published
• 241
V-Thinker: Interactive Thinking with Images
Paper
• 2511.04460
• Published
• 97
NVIDIA Nemotron Nano V2 VL
Paper
• 2511.03929
• Published
• 30
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models
Paper
• 2512.02014
• Published
• 73
Latent Implicit Visual Reasoning
Paper
• 2512.21218
• Published
• 68
Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding
Paper
• 2512.17532
• Published
• 67