Add model card for ExGRPO: Learning to Reason from Experience

by nielsr HF Staff - opened Oct 4, 2025

←

nielsr

Oct 4, 2025

This PR adds a comprehensive model card for the ExGRPO model.
It includes:

Relevant metadata (license: apache-2.0, library_name: transformers, pipeline_tag: text-generation).
A link to the paper: ExGRPO: Learning to Reason from Experience.
A link to the official GitHub repository: https://github.com/ElliottYan/LUFFY/tree/main/ExGRPO.
An overview of the model, including key highlights and an illustrative image from the GitHub README.
A link to the Hugging Face Collection associated with ExGRPO.
The BibTeX citation for the paper.

These additions will significantly improve the discoverability and usability of this model on the Hugging Face Hub. Please review and merge.

rzzhan changed pull request status to merged Oct 24, 2025

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment