On the Optimal Reasoning Length for RL-Trained Language Models Paper • 2602.09591 • Published Feb 10 • 5
mmnga-o/NVIDIA-Nemotron-Nano-9B-v2-Japanese-gguf Text Generation • 9B • Updated 24 days ago • 14.6k • 49
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 306