RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation Paper • 2509.15212 • Published Sep 18 • 21
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding Paper • 2501.13106 • Published Jan 22 • 90
Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding Paper • 2311.16922 • Published Nov 28, 2023 • 1