Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Open Language Data Initiative

community
https://oldi.org/
openlanguagedata
Activity Feed

AI & ML interests

Multilingual NLP, underserved languages

Recent Activity

cointegrated  updated a collection 10 days ago
OLDI and friends
cointegrated  updated a dataset 25 days ago
openlanguagedata/flores_plus
cointegrated  new activity 25 days ago
openlanguagedata/flores_plus:Add Khakas data (kjh_Cyrl)
View all activity

Laurie Burchell's profile pictureJean's profile pictureSkyler Wang's profile pictureDavid Dale's profile pictureIsaac Caswell's profile picture

openlanguagedata 's collections 1

OLDI and friends
This collection groups the datasets that have been featured as part of WMT’s Open Language Data Initiative shared task.
  • openlanguagedata/flores_plus

    Viewer • Updated 25 days ago • 893k • 21.4k • 120
  • openlanguagedata/oldi_seed

    Viewer • Updated 27 days ago • 564k • 1.13k • 11
  • google/smol

    Viewer • Updated Oct 31, 2025 • 798k • 1.84k • 93
  • google/wmt24pp

    Viewer • Updated Jan 22 • 54.9k • 5.04k • 86
OLDI and friends
This collection groups the datasets that have been featured as part of WMT’s Open Language Data Initiative shared task.
  • openlanguagedata/flores_plus

    Viewer • Updated 25 days ago • 893k • 21.4k • 120
  • openlanguagedata/oldi_seed

    Viewer • Updated 27 days ago • 564k • 1.13k • 11
  • google/smol

    Viewer • Updated Oct 31, 2025 • 798k • 1.84k • 93
  • google/wmt24pp

    Viewer • Updated Jan 22 • 54.9k • 5.04k • 86
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs