Walkie Cyrile/dataset-the-stack-v2-dedup-sub Viewer • Updated Apr 1, 2025 • 82.8M • 2.01k • 6 OpenCoder-LLM/opc-annealing-corpus Viewer • Updated May 29, 2025 • 15.6M • 1.65k • 43 OpenCoder-LLM/opc-fineweb-math-corpus Viewer • Updated Nov 24, 2024 • 5.24M • 488 • 30 OpenCoder-LLM/opc-fineweb-code-corpus Viewer • Updated Nov 24, 2024 • 101M • 3.3k • 51
Overall Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges Paper • 2604.13602 • Published 26 days ago • 32 Self-Evolving LLM Memory Extraction Across Heterogeneous Tasks Paper • 2604.11610 • Published 28 days ago • 7
Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges Paper • 2604.13602 • Published 26 days ago • 32
Self-Evolving LLM Memory Extraction Across Heterogeneous Tasks Paper • 2604.11610 • Published 28 days ago • 7
Walkie Cyrile/dataset-the-stack-v2-dedup-sub Viewer • Updated Apr 1, 2025 • 82.8M • 2.01k • 6 OpenCoder-LLM/opc-annealing-corpus Viewer • Updated May 29, 2025 • 15.6M • 1.65k • 43 OpenCoder-LLM/opc-fineweb-math-corpus Viewer • Updated Nov 24, 2024 • 5.24M • 488 • 30 OpenCoder-LLM/opc-fineweb-code-corpus Viewer • Updated Nov 24, 2024 • 101M • 3.3k • 51
Overall Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges Paper • 2604.13602 • Published 26 days ago • 32 Self-Evolving LLM Memory Extraction Across Heterogeneous Tasks Paper • 2604.11610 • Published 28 days ago • 7
Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges Paper • 2604.13602 • Published 26 days ago • 32
Self-Evolving LLM Memory Extraction Across Heterogeneous Tasks Paper • 2604.11610 • Published 28 days ago • 7