P1: Mastering Physics Olympiads with Reinforcement Learning Paper • 2511.13612 • Published 23 days ago • 132
BroRL: Scaling Reinforcement Learning via Broadened Exploration Paper • 2510.01180 • Published Oct 1 • 18
Sphere Prover Collection The dataset and ckpt in Sphere-Prover-V1: Training LLM-based Prover for Formal Mathematics via Exploration-based Reinforocement Learning • 10 items • Updated Aug 21
laion/CLIP-ViT-H-14-laion2B-s32B-b79K Zero-Shot Image Classification • 1.0B • Updated Jan 22 • 740k • 425