Gemma 4 Assistant GGUF Collection Gemma 4 MTP assistant drafters as GGUF (F16/Q8_0/Q5_K_M/Q4_K_M/Q4_K_S). Speculative-decoding heads for the atomic-llama-cpp-turboquant fork. • 4 items • Updated 5 days ago • 8
Granite 4.1 Language Models Collection Efficient language models for multilingual generation, coding, RAG, and AI assistant workflows. • 6 items • Updated 13 days ago • 49
Gemma 4 Collection Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. • 28 items • Updated 20 days ago • 177
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 18 items • Updated 3 days ago • 286
Mistral Small 4 Collection A state-of-the-art model, open-weight, with a granular Mixture-of-Experts architecture that fuses instruct, reasoning and agentic skills. • 3 items • Updated Mar 16 • 73
200+ Roleplay, Creative Writing, Uncensored, NSFW models. Collection Oldest models listed first, with Newest models at bottom of the page. Most repos have full examples, instructions, best settings and so on. • 276 items • Updated 1 day ago • 729
Ministral 3 Collection Mistral Ministral 3: new multimodal models in Base, Instruct, and Reasoning variants, available in 3B, 8B, and 14B sizes. • 36 items • Updated 20 days ago • 35
Ministral 3 Collection A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 166
The Bestiary Collection Decensored language models made using Heretic (https://github.com/p-e-w/heretic) • 6 items • Updated Nov 16, 2025 • 113