Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

nicholasKluge
/
Harmless-RewardModel

Text Classification
Transformers
Safetensors
English
roberta
reward model
alignment
preference model
RLHF
Model card Files Files and versions
xet
Community
Harmless-RewardModel
1.51 GB
  • 1 contributor
History: 13 commits
nicholasKluge's picture
nicholasKluge
Update README.md
3cf2a1e verified 6 months ago
  • .gitattributes
    1.52 kB
    initial commit over 1 year ago
  • LICENSE
    10.8 kB
    Upload LICENSE over 1 year ago
  • README.md
    8.84 kB
    Update README.md 6 months ago
  • config.json
    789 Bytes
    Update config.json 6 months ago
  • emissions.csv
    770 Bytes
    Upload emissions.csv with huggingface_hub over 1 year ago
  • hh_rlhf_eval.parquet
    9.41 MB
    xet
    Upload hh_rlhf_eval.parquet over 1 year ago
  • merges.txt
    456 kB
    Upload folder using huggingface_hub over 1 year ago
  • model.safetensors
    499 MB
    xet
    Upload folder using huggingface_hub over 1 year ago
  • optimizer.pt
    997 MB
    xet
    Upload folder using huggingface_hub over 1 year ago
  • rng_state.pth
    14.2 kB
    xet
    Upload folder using huggingface_hub over 1 year ago
  • scheduler.pt
    1.06 kB
    xet
    Upload folder using huggingface_hub over 1 year ago
  • special_tokens_map.json
    280 Bytes
    Upload folder using huggingface_hub over 1 year ago
  • tokenizer.json
    2.11 MB
    Upload folder using huggingface_hub over 1 year ago
  • tokenizer_config.json
    1.22 kB
    Upload folder using huggingface_hub over 1 year ago
  • trainer_state.json
    4.93 kB
    Upload folder using huggingface_hub over 1 year ago
  • training_args.bin
    5.11 kB
    xet
    Upload folder using huggingface_hub over 1 year ago
  • vocab.json
    798 kB
    Upload folder using huggingface_hub over 1 year ago