Small multilingual LLMs for annotating and curating LLM training data.
AI & ML interests
Open, Multilingual, European, Generative, Foundational LLM
Recent Activity
Organization Card
Europe's leading AI companies and research institutions combine their forces and expertise to develop next-generation open-source language models in an unprecedented collaboration to advance European AI capabilities, the OpenEuroLLM project
models 13
openeurollm/OLMo-3-7B-Think-SFT
Updated
openeurollm/OLMo-3-7B-Instruct-SFT
Updated
openeurollm/tokenizer-256k
Updated • 1
openeurollm/tokenizer-128k
Updated
openeurollm/datamix-2b-80-20
Updated • 96
openeurollm/datamix-2b-50-50
Updated
openeurollm/datamix-2b-60-40
Updated • 114
openeurollm/datamix-2b-70-30
Updated • 31
openeurollm/datamix-2b-90-10
Updated • 32
openeurollm/datamix-2b-en-80pct-DPO-HelpSteer3-16k
Text Generation • 2B • Updated
datasets 10
openeurollm/propella-annotations
Viewer • Updated • 3.17B • 4.57k • 14
openeurollm/dolci-think-sft-tokenized
Updated • 24
openeurollm/dolci-instruct-sft-tokenized
Updated • 20
openeurollm/evaluation_singularity_images
Updated • 186
openeurollm/battle-annotations
Viewer • Updated • 163 • 24
openeurollm/contaminated-documents
Viewer • Updated • 40.3k • 17
openeurollm/ArenaHard-EU-v0-bis
Viewer • Updated • 30 • 13
openeurollm/ArenaHard-EU-v0
Viewer • Updated • 360 • 128
openeurollm/nemotron-cc-10K-sample-translated-judged
Viewer • Updated • 9.07M • 13
openeurollm/nemotron-cc-10K-sample-translated
Viewer • Updated • 450k • 116