Yanjun Zhao
yanjunzhao97
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
10 days ago
Latent Collaboration in Multi-Agent Systems
upvoted
a
paper
about 2 months ago
RiskPO: Risk-based Policy Optimization via Verifiable Reward for LLM
Post-Training
upvoted
a
paper
about 2 months ago
Demystifying Reinforcement Learning in Agentic Reasoning
Organizations
None yet