qgallouedec
/

Qwen2.5-0.5B-GRPO-2873

Generated from Trainer

Model card Files Files and versions

Metrics Training metrics Community

Qwen2.5-0.5B-GRPO-2873

18.1 MB

1 contributor

History: 2 commits

qgallouedec's picture

qgallouedec HF Staff

End of training

d60dea3 verified 10 months ago

runs
End of training 10 months ago
.gitattributes

1.57 kB

End of training 10 months ago
README.md

2.22 kB

End of training 10 months ago
adapter_config.json

814 Bytes

End of training 10 months ago
adapter_model.safetensors

2.18 MB
xet

End of training 10 months ago
added_tokens.json

605 Bytes

End of training 10 months ago
merges.txt

1.67 MB

End of training 10 months ago
special_tokens_map.json

616 Bytes

End of training 10 months ago
tokenizer.json

11.4 MB
xet

End of training 10 months ago
tokenizer_config.json

7.29 kB

End of training 10 months ago
training_args.bin
Detected Pickle imports (10)
- "torch.device",
- "trl.trainer.grpo_config.GRPOConfig",
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "transformers.training_args.OptimizerNames",
- "transformers.trainer_utils.HubStrategy",
- "accelerate.utils.dataclasses.DistributedType",
- "transformers.trainer_utils.SaveStrategy",
- "transformers.trainer_utils.SchedulerType",
- "transformers.trainer_utils.IntervalStrategy",
- "accelerate.state.PartialState"
How to fix it?
5.82 kB
xet

End of training 10 months ago
vocab.json

2.78 MB

End of training 10 months ago