Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
Haokai Zhao
jz666
Follow
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 2 hours ago
jz666/mistral-ultrafeedback-templated-ppl
published
a dataset
about 2 hours ago
jz666/mistral-ultrafeedback-templated-ppl
updated
a dataset
about 4 hours ago
jz666/llama3-ultrafeedback-templated-ppl-margin-5
View all activity
Organizations
None yet
jz666
's models
16
Sort: Recently updated
jz666/dpo-grad-acc-128-train_filtered_full
Text Generation
•
9B
•
Updated
Oct 21
•
6
jz666/dpo-grad-acc-32-train_filtered_full
Text Generation
•
9B
•
Updated
Oct 21
•
4
jz666/dpo-grad-acc-64-train_filtered_full
Text Generation
•
9B
•
Updated
Oct 21
•
8
jz666/dpo-grad-acc-16-train_filtered_full
Text Generation
•
9B
•
Updated
Oct 21
•
2
jz666/gemma-2-9b-it-dpo-train_filtered_full
Text Generation
•
9B
•
Updated
Oct 20
•
6
jz666/gemma-2-9b-it-simpo-split-10-train_filtered_full
Text Generation
•
9B
•
Updated
Oct 17
•
7
jz666/simpo-train-large-wrong
Text Generation
•
9B
•
Updated
Oct 16
•
10
jz666/simpo-train-filtered-full
Text Generation
•
9B
•
Updated
Oct 14
•
6
jz666/simpo-train-large-correct
Text Generation
•
9B
•
Updated
Oct 14
•
8
jz666/simpo-train-small-wrong
Text Generation
•
9B
•
Updated
Oct 14
•
7
jz666/simpo-train-small-correct
Text Generation
•
9B
•
Updated
Oct 14
•
8
jz666/simpo-train-smallest-30-abs-diff
Text Generation
•
9B
•
Updated
Oct 14
•
6
jz666/simpo-train-largest-30-abs-diff
Text Generation
•
9B
•
Updated
Oct 14
•
9
jz666/simpo-train-largest-30-ppl-chosen
Text Generation
•
9B
•
Updated
Oct 14
•
11
jz666/simpo-train-largest-30-ppl-rejected
Text Generation
•
9B
•
Updated
Oct 14
•
8
jz666/simpo
Text Generation
•
9B
•
Updated
Sep 29
•
6