Memorization Dynamics in Knowledge Distillation for Language Models Paper • 2601.15394 • Published 14 days ago • 2
Prat 9B Collection Prat 9B model for Norwegian TTS, currently in preview • 2 items • Updated 3 days ago • 1
FP8 quants Collection A collection of my FP8 quants for models missing this. • 2 items • Updated 15 days ago • 1
Janus Collection Janus is a novel autoregressive framework that unifies multimodal understanding and generation. • 8 items • Updated Nov 27, 2025 • 21
Co-PatcheR Collection Co-PatcheR: Collaborative Software Patching with Component(s)-specific Small Reasoning Models • 3 items • Updated May 29, 2025 • 1
VulnLLM-R Collection Data and model for VulnLLM-R: Specialized Reasoning LLM with Agent Scaffold for Vulnerability Detection • 9 items • Updated Dec 17, 2025 • 8
Definition modeling Collection Models to generate contextualized word definitions • 22 items • Updated Nov 4, 2025 • 1
Ministral 3 Collection A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 149