Zhang Xu
texzhang
ยท
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
11 days ago
On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models
upvoted
a
paper
25 days ago
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
upvoted
a
paper
about 1 month ago
From Trial-and-Error to Improvement: A Systematic Analysis of LLM
Exploration Mechanisms in RLVR
Organizations
None yet