qingyang zhang's picture

qingyang zhang

qingyangzhang

·

https://qingyangzhang.github.io

AI & ML interests

LLM Reasoning

Recent Activity

upvoted a paper about 11 hours ago

Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows

authored a paper 10 days ago

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

authored a paper 10 days ago

TEMPO: Scaling Test-time Training for Large Reasoning Models

View all activity

Organizations

None yet

commented 2 papers 11 months ago

SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization

Paper • 2505.12346 • Published May 18, 2025 • 19 •

SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization

Paper • 2505.12346 • Published May 18, 2025 • 19 •