Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
nicholasKluge
/
Harmless-RewardModel
like
1
Text Classification
Transformers
Safetensors
nicholasKluge/harmless-aira-dataset
Anthropic/hh-rlhf
English
roberta
reward model
alignment
preference model
RLHF
Carbon Emissions
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Harmless-RewardModel
1.51 GB
1 contributor
History:
13 commits
nicholasKluge
Update README.md
3cf2a1e
verified
6 months ago
.gitattributes
1.52 kB
initial commit
over 1 year ago
LICENSE
10.8 kB
Upload LICENSE
over 1 year ago
README.md
8.84 kB
Update README.md
6 months ago
config.json
789 Bytes
Update config.json
6 months ago
emissions.csv
770 Bytes
Upload emissions.csv with huggingface_hub
over 1 year ago
hh_rlhf_eval.parquet
9.41 MB
xet
Upload hh_rlhf_eval.parquet
over 1 year ago
merges.txt
456 kB
Upload folder using huggingface_hub
over 1 year ago
model.safetensors
499 MB
xet
Upload folder using huggingface_hub
over 1 year ago
optimizer.pt
997 MB
xet
Upload folder using huggingface_hub
over 1 year ago
rng_state.pth
14.2 kB
xet
Upload folder using huggingface_hub
over 1 year ago
scheduler.pt
1.06 kB
xet
Upload folder using huggingface_hub
over 1 year ago
special_tokens_map.json
280 Bytes
Upload folder using huggingface_hub
over 1 year ago
tokenizer.json
2.11 MB
Upload folder using huggingface_hub
over 1 year ago
tokenizer_config.json
1.22 kB
Upload folder using huggingface_hub
over 1 year ago
trainer_state.json
4.93 kB
Upload folder using huggingface_hub
over 1 year ago
training_args.bin
5.11 kB
xet
Upload folder using huggingface_hub
over 1 year ago
vocab.json
798 kB
Upload folder using huggingface_hub
over 1 year ago