Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
qingyang zhang's picture
4 16 2

qingyang zhang

qingyangzhang
John6666's profile picture Stars321123's profile picture Haitao999's profile picture
·
https://qingyangzhang.github.io

AI & ML interests

LLM Reasoning

Recent Activity

upvoted a paper about 11 hours ago
Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows
authored a paper 10 days ago
P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads
authored a paper 10 days ago
TEMPO: Scaling Test-time Training for Large Reasoning Models
View all activity

Organizations

None yet

commented 2 papers 11 months ago

SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization

Paper • 2505.12346 • Published May 18, 2025 • 19 •
16

SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization

Paper • 2505.12346 • Published May 18, 2025 • 19 •
16
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs