Yunzhuo Hao's picture

Yunzhuo Hao

luckychao

·

hychaochao

AI & ML interests

NLP

Recent Activity

upvoted a paper 23 days ago

P1: Mastering Physics Olympiads with Reinforcement Learning

upvoted a paper 23 days ago

TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models

upvoted a paper 29 days ago

VideoSSR: Video Self-Supervised Reinforcement Learning

View all activity

Organizations

upvoted 2 papers 23 days ago

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published 23 days ago • 132

TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models

Paper • 2511.13704 • Published 23 days ago • 42

upvoted a paper 29 days ago

VideoSSR: Video Self-Supervised Reinforcement Learning

Paper • 2511.06281 • Published Nov 9 • 24

upvoted 2 papers about 1 month ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6 • 208

ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning

Paper • 2510.27492 • Published Oct 30 • 81

upvoted a paper about 2 months ago

Spotlight on Token Perception for Multimodal Reinforcement Learning

Paper • 2510.09285 • Published Oct 10 • 36

upvoted 2 papers 3 months ago

Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Delibration

Paper • 2509.14760 • Published Sep 18 • 53

WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

Paper • 2509.06501 • Published Sep 8 • 79

upvoted a paper 4 months ago

Speed Always Wins: A Survey on Efficient Architectures for Large Language Models

Paper • 2508.09834 • Published Aug 13 • 53

upvoted a paper 5 months ago

STITCH: Simultaneous Thinking and Talking with Chunked Reasoning for Spoken Language Models

Paper • 2507.15375 • Published Jul 21 • 30

upvoted 7 papers 6 months ago

Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in LLMs

Paper • 2506.19290 • Published Jun 24 • 52

Matrix-Game: Interactive World Foundation Model

Paper • 2506.18701 • Published Jun 23 • 72

SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks

Paper • 2506.10954 • Published Jun 12 • 52

VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models

Paper • 2505.23656 • Published May 29 • 25

Unfolding Spatial Cognition: Evaluating Multimodal Models on Visual Simulations

Paper • 2506.04633 • Published Jun 5 • 19

Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning

Paper • 2506.04207 • Published Jun 4 • 48

CSVQA: A Chinese Multimodal Benchmark for Evaluating STEM Reasoning Capabilities of VLMs

Paper • 2505.24120 • Published May 30 • 49

upvoted 3 papers 7 months ago

SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

Paper • 2505.19641 • Published May 26 • 67

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published May 28 • 131

Skywork Open Reasoner 1 Technical Report

Paper • 2505.22312 • Published May 28 • 54