C2: Scalable Rubric-Augmented Reward Modeling from Binary Preferences Paper • 2604.13618 • Published 14 days ago • 4
C2: Scalable Rubric-Augmented Reward Modeling from Binary Preferences Paper • 2604.13618 • Published 14 days ago • 4