Yuren - a ZeroNihl Collection

ZeroNihl 's Collections

Physics

Spatial

Yuren

Yuren

updated 8 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2, 2025 • 187
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2, 2025 • 147
Taming LLMs by Scaling Learning Rates with Gradient Grouping

Paper • 2506.01049 • Published Jun 1, 2025 • 38
ARIA: Training Language Agents with Intention-Driven Reward Aggregation

Paper • 2506.00539 • Published May 31, 2025 • 30
Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control

Paper • 2506.01943 • Published Jun 2, 2025 • 25
SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning

Paper • 2506.01713 • Published Jun 2, 2025 • 48
AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning

Paper • 2505.24298 • Published May 30, 2025 • 28
MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning

Paper • 2505.24846 • Published May 30, 2025 • 15
Esoteric Language Models

Paper • 2506.01928 • Published Jun 2, 2025 • 9
Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models

Paper • 2504.17789 • Published Apr 24, 2025 • 23
TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published Nov 12, 2025 • 118
CogDPM: Diffusion Probabilistic Models via Cognitive Predictive Coding

Paper • 2405.02384 • Published May 3, 2024
A Stable, Fast, and Fully Automatic Learning Algorithm for Predictive Coding Networks

Paper • 2212.00720 • Published Nov 16, 2022
Prodigy: An Expeditiously Adaptive Parameter-Free Learner

Paper • 2306.06101 • Published Jun 9, 2023
Hierarchical Reasoning Model

Paper • 2506.21734 • Published Jun 26, 2025 • 46
Why Diffusion Models Don't Memorize: The Role of Implicit Dynamical Regularization in Training

Paper • 2505.17638 • Published May 23, 2025 • 1
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8, 2025 • 195
Learning to Optimize: A Primer and A Benchmark

Paper • 2103.12828 • Published Mar 23, 2021 • 1