hazyresearch/Qwen2.5-3B-Instruct-OT3-8K-R1-QwQ-Seed-42-MLR Text Generation • 3B • Updated 7 days ago • 48
hazyresearch/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-LR-Retrained Text Generation • 3B • Updated 4 days ago • 51
Kazuki1450/Qwen2.5-3B-Instruct_lightr1_bsz192_1p0_0p0_1p0_grpo_42_rule Text Generation • 3B • Updated about 21 hours ago • 45
Kazuki1450/Qwen2.5-3B-Instruct_lightr1_1p0_0p0_1p0_grpo_42_Qwen2.5-7B-Instruct Text Generation • 3B • Updated about 22 hours ago • 28
Kazuki1450/Qwen2.5-3B-Instruct_lightr1_1p0_0p0_1p0_grpo_42_Qwen3-4B Text Generation • 3B • Updated about 24 hours ago • 11
Kazuki1450/Qwen2.5-3B-Instruct_lightr1_1p0_0p0_1p0_grpo_42_Qwen3-8B Text Generation • 3B • Updated about 21 hours ago • 28