SURESHBEEKHANI/Gemma_2B_Medical_ORPO_RLHF_Fine_Tuning Question Answering • 3B • Updated Feb 3, 2025 • 4 • 1
SURESHBEEKHANI/Deep-seek-R1-Medical-reasoning-SFT Text Generation • 8B • Updated Jan 30, 2025 • 219 • 1