A collection of mutiple benchmarks for large reasoning model evaluation
datasets-and-models
non-profit
AI & ML interests
None defined yet.
Recent Activity
View all activity
models 94
guanning-ai/Qwen3_1.7B_PRM_chenwu-dapo14k-hard06_4gpu
Updated
guanning-ai/Qwen2.5_Math_1.5B_PRM_chenwu-dapo14k-hard04_4gpu
Updated
guanning-ai/Qwen3_1.7B_PRM_chenwu-dapo14k-hard04_4gpu
Updated
guanning-ai/Qwen3_1.7B_PRM_chenwu-dapo14k-hard02_4gpu
Updated
guanning-ai/Qwen3_1.7B_PRM_chenwu-dapo14k_4gpu
Updated
guanning-ai/Qwen3_1.7B_PRM_chenwu-dapo14k
Updated
guanning-ai/Qwen3_1.7B_MMLU_Pro_4gpu
Updated
guanning-ai/Qwen3_1.7B_SciQA_4gpu_v2
Updated
guanning-ai/Qwen3_1.7B_SciQA_4gpu_short
Updated
guanning-ai/addition-random-init-seed1950
250k • Updated
• 274
datasets 142
guanning-ai/addition-data
Viewer
• Updated
• 70M • 8
guanning-ai/sciknoweval_l3
Viewer
• Updated
• 4.14k • 20
guanning-ai/dapo17k_splited
Viewer
• Updated
• 17.2k • 11
guanning-ai/hmmt2025feb
Viewer
• Updated
• 30 • 8
guanning-ai/gsm8k-platinum
Viewer
• Updated
• 1.21k • 17
guanning-ai/math500_level5
Viewer
• Updated
• 134 • 5
guanning-ai/math500_level4
Viewer
• Updated
• 128 • 5
guanning-ai/math500_level3
Viewer
• Updated
• 105 • 4
guanning-ai/math500_level2
Viewer
• Updated
• 90 • 4
guanning-ai/math500_level1
Viewer
• Updated
• 43 • 4