OpenEvals

community

Activity Feed

AI & ML interests

LLM evaluation

Recent Activity

SaylorTwift updated a Space about 11 hours ago

OpenEvals/open_benchmark_index

SaylorTwift new activity 30 days ago

OpenEvals/MuSR:[bot] Conversion to Parquet

SaylorTwift updated a dataset about 1 month ago

OpenEvals/MuSR

View all activity

OpenEvals 's Spaces 9

Benchmark Finder

📚

A space to view and inspect all the tasks in lighteval

244

Evaluation Guidebook

📝

Display benchmark evaluation data for LLMs

127

Find a leaderboard

🔍

Explore and discover all leaderboards from the HF community

README

⚖

Aa Omniscience

🐠

Display and inspect log files

InferenceProviderTestingBackend

📈

Launch and monitor model evaluation jobs

Evals

🐨

Run your LLM evaluations on the hub

🐢

Generate a command to run model evaluations

Tokenizers Languages

🐠

Compare tokenization lengths across languages