Zhang Xu's picture

4 4

Zhang Xu

texzhang

·

CheungXu

AI & ML interests

None yet

Recent Activity

upvoted a paper 11 days ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

upvoted a paper 25 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

upvoted a paper about 1 month ago

From Trial-and-Error to Improvement: A Systematic Analysis of LLM Exploration Mechanisms in RLVR

View all activity

Organizations

None yet

texzhang 's models

None public yet