Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
InfiX-ai
/
InfiAlign-Qwen-7B-DPO
like
3
Follow
InfiX.ai
81
Text Generation
Transformers
Safetensors
English
qwen2
large-language-models
DPO
direct-preference-optimization
reasoning
long-CoT
conversational
text-generation-inference
arxiv:
2508.05496
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
InfiAlign-Qwen-7B-DPO
Commit History
Update README.md
616560d
verified
sslu
commited on
Aug 11
Update README.md
43f1c77
verified
sslu
commited on
Aug 11
Update tokenizer_config.json
fb5d60f
sslu
commited on
Aug 9
Update README.md
bd48c6e
verified
sslu
commited on
Aug 9
Update README.md
b0ede44
verified
sslu
commited on
Aug 9
Update README.md
3ab6ff1
verified
sslu
commited on
Aug 5
Update README.md
120d8f6
verified
sslu
commited on
Aug 5
Update README
7e812d3
sslu
commited on
Aug 5
Upload model checkpoint
37a0b8d
sslu
commited on
Aug 5
Update README.md
0af9f8f
verified
sugavvvv
commited on
Jul 28
update dpo-version infi-align using new template
99b3f6e
verified
sugavvvv
commited on
Jul 28
Update README.md
ee522d1
verified
sslu
commited on
Jul 28
Update README.md
a4d48e3
verified
sslu
commited on
Jul 28
Update README.md
9dbae95
verified
sslu
commited on
Jul 28
Update README.md
60bcf9d
verified
sslu
commited on
Jul 28
Update README.md
2dd2b57
verified
sslu
commited on
Jul 28
initial commit
4f06235
verified
baicaihaochi121
commited on
Jul 27