arxiv:2502.18356
Kang
JaxonK
·
AI & ML interests
RL, LLMs, Model generalization
Recent Activity
published
a model
about 7 hours ago
JaxonK/cua-reward-7b-sft-instruct_2
updated
a model
about 24 hours ago
JaxonK/cua-reward-7b-sft-instruct_2
updated
a model
about 24 hours ago
JaxonK/cua-reward-7b-sft-instruct_2