view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand 5 days ago • 52
The Bestiary Collection Decensored language models made using Heretic (https://github.com/p-e-w/heretic) • 6 items • Updated 22 days ago • 69
GLiNER-PII Collection PII detection models developed in collaboration with Wordcab • 5 items • Updated Sep 24 • 21
gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7 • 391
SauerkrautLM-Multilingual-(Reason)-ColBERT Collection SauerkrautLM ColBERT is a suite of Late-Interaction retrieval models built with PyLate’s ColBERT architecture and tuned for seven European languages. • 7 items • Updated Aug 3 • 18
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face +3 Jul 29 • 202
GLM-4.5 Collection GLM-4.5: An open-source large language model designed for intelligent agents by Z.ai • 11 items • Updated Aug 11 • 249
NuExtract-2.0 Collection Models specialized in extracting structured information (JSON) from text, PDFs, scans, spreadsheets, etc. • 15 items • Updated Sep 26 • 26
view article Article Unlocking Healthcare AI: I'm Releasing State-of-the-Art Medical Models for Free. Forever. Jul 16 • 144
Qwen3 Quantized Collection Collection of quantized Qwen 3 models from Alibaba Cloud. • 14 items • Updated 28 days ago • 7