1 11 42

Yuguang Yao

yaoyugua

AI & ML interests

None yet

Recent Activity

authored a paper 4 days ago

ToolPRMBench: Evaluating and Advancing Process Reward Models for Tool-using Agents

liked a dataset 10 days ago

THU-KEG/BlendQA

liked a dataset 5 months ago

dnkdnk/CVTG-2K

View all activity

Organizations

None yet

authored a paper 4 days ago

ToolPRMBench: Evaluating and Advancing Process Reward Models for Tool-using Agents

Paper • 2601.12294 • Published 7 days ago • 16

liked a dataset 10 days ago

THU-KEG/BlendQA

Updated Feb 22, 2025 • 14 • 3

liked a dataset 5 months ago

dnkdnk/CVTG-2K

Updated Apr 1, 2025 • 44 • 5

liked a model 5 months ago

Qwen/Qwen-Image

Text-to-Image • Updated Aug 18, 2025 • 225k • • 2.36k

upvoted 2 papers 6 months ago

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26, 2025 • 158

MCP-RADAR: A Multi-Dimensional Benchmark for Evaluating Tool Use Capabilities in Large Language Models

Paper • 2505.16700 • Published May 22, 2025 • 1

liked a dataset 8 months ago

Team-ACE/ToolACE

Viewer • Updated Sep 4, 2024 • 11.3k • 710 • 161

liked a model 9 months ago

ServiceNow-AI/Apriel-Nemotron-15b-Thinker

Text Generation • 15B • Updated Nov 10, 2025 • 1.49k • 124

liked a dataset 10 months ago

bytedance-research/ToolHop

Updated Aug 26, 2025 • 289 • 18

liked a model 11 months ago

Qwen/QwQ-32B

Text Generation • 33B • Updated Mar 11, 2025 • 72.5k • • 2.88k

liked 2 datasets 11 months ago

dgslibisey/MuSiQue_subset

Viewer • Updated Jun 18, 2023 • 1.25k • 9 • 1

dgslibisey/MuSiQue

Viewer • Updated Jun 16, 2023 • 22.4k • 2.42k • 30

liked 6 models 11 months ago

upvoted a collection 11 months ago

MIRage

Collection

Official model collection of paper: Rethinking Bottlenecks in Safety Fine-Tuning of Vision Language Models • 2 items • Updated Feb 4, 2025 • 1

liked a dataset 11 months ago

JailbreakV-28K/JailBreakV-28k

Viewer • Updated Jul 10, 2024 • 30.3k • 3.37k • 60

Yuguang Yao

AI & ML interests

Recent Activity

Organizations

yaoyugua's activity