Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
di-zhang-fdu
/
PPRM-gemma-2-2b-it
like
2
PEFT
Safetensors
llama-factory
lora
Generated from Trainer
arxiv:
2410.02884
arxiv:
2406.07394
License:
other
Model card
Files
Files and versions
xet
Community
Use this model
main
PPRM-gemma-2-2b-it
/
checkpoint-1300
/
latest
Di Zhang
Upload folder using huggingface_hub
2056078
verified
about 1 year ago
raw
Copy download link
history
blame
contribute
delete
Safe
15 Bytes
global_step1300