pinned
Running
27
Benchmark Finder
๐
A space to view and inspect all the tasks in lighteval
LLM evaluation
A space to view and inspect all the tasks in lighteval
Display benchmark evaluation data for LLMs
Explore and discover all leaderboards from the HF community
Display and inspect log files
Launch and monitor model evaluation jobs
Generate a command to run model evaluations
Compare tokenization lengths across languages