Add model card for ExGRPO: Learning to Reason from Experience

#1
by nielsr HF Staff - opened

This PR adds a comprehensive model card for the ExGRPO model.
It includes:

These additions will significantly improve the discoverability and usability of this model on the Hugging Face Hub. Please review and merge.

rzzhan changed pull request status to merged

Sign up or log in to comment