HUME: Measuring the Human-Model Performance Gap in Text Embedding Task
Paper • 2510.10062 • Published • 10
The HUME benchmark is designed to evaluate the performance of text embedding models and humans on a comparable set of tasks. This captures areas wh...