Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zhang Xu's picture
3 4

Zhang Xu

texzhang
·
  • CheungXu

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
upvoted a paper 10 days ago
From Trial-and-Error to Improvement: A Systematic Analysis of LLM Exploration Mechanisms in RLVR
upvoted a paper about 2 months ago
Agent Learning via Early Experience
View all activity

Organizations

None yet

Collections 2

LLM
  • On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

    Paper • 2508.05629 • Published Aug 7 • 180
multi-image
  • MANTIS: Interleaved Multi-Image Instruction Tuning

    Paper • 2405.01483 • Published May 2, 2024 • 6
LLM
  • On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

    Paper • 2508.05629 • Published Aug 7 • 180
multi-image
  • MANTIS: Interleaved Multi-Image Instruction Tuning

    Paper • 2405.01483 • Published May 2, 2024 • 6

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs