From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence Paper • 2511.18538 • Published 17 days ago • 251
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 10 days ago • 234
Beyond URLs: Metadata Diversity and Position for Efficient LLM Pretraining Paper • 2511.21613 • Published 14 days ago • 2
view article Article Building for an Open Future - our new partnership with Google Cloud 28 days ago • 46
Sample-Efficient Language Modeling with Linear Attention and Lightweight Enhancements Paper • 2511.05560 • Published Nov 4 • 1
Pre-training Dataset Samples Collection A collection of pre-training datasets samples of sizes 10M, 100M and 1B tokens. Ideal for use in quick experimentation and ablations. • 19 items • Updated 29 days ago • 15
BabyBabelLM: A Multilingual Benchmark of Developmentally Plausible Training Data Paper • 2510.10159 • Published Oct 11 • 3
Gaperon: A Peppered English-French Generative Language Model Suite Paper • 2510.25771 • Published Oct 29 • 15
gpt-oss-safeguard Collection gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are safety reasoning models built-upon gpt-oss • 2 items • Updated Oct 29 • 58
Gaperon Collection Our French-English LLM suite (SFT models are coming soon) • 16 items • Updated 8 days ago • 16
Huxley-Gödel Machine: Human-Level Coding Agent Development by an Approximation of the Optimal Self-Improving Machine Paper • 2510.21614 • Published Oct 24 • 22