sugatoray
's Collections
Papers-LLMEval
updated
Latxa: An Open Language Model and Evaluation Suite for Basque
Paper
•
2403.20266
•
Published
•
3
TrustLLM: Trustworthiness in Large Language Models
Paper
•
2401.05561
•
Published
•
69
Prometheus 2: An Open Source Language Model Specialized in Evaluating
Other Language Models
Paper
•
2405.01535
•
Published
•
123
Beyond Scaling Laws: Understanding Transformer Performance with
Associative Memory
Paper
•
2405.08707
•
Published
•
34
tinyBenchmarks: evaluating LLMs with fewer examples
Paper
•
2402.14992
•
Published
•
17
meta-llama/Llama-3.3-70B-Instruct-evals
Viewer
•
Updated
•
41.3k
•
89
•
41
RUC-NLPIR/OmniEval-HallucinationEvaluator
Text Generation
•
Updated
•
1
Viewer
•
Updated
•
92
•
1.2k
•
24
Viewer
•
Updated
•
17.6k
•
444k
•
1k
Preview
•
Updated
•
37
•
4
KRLabsOrg/lettucedect-base-modernbert-en-v1
Token Classification
•
0.1B
•
Updated
•
1.77k
•
17
Viewer
•
Updated
•
269
•
1.18k
•
47