Published July 25, 2024
| Version v1
Dataset
Open
metabench - Paper Data
Creators
Contributors
Contact person:
Description
Item-wise accuracies in six benchmarks from Open LLM Leaderboard 1 scraped from huggingface.co and used for metabench analyses and construction. Datasets with RMSE's for random benchmark subsets are used as reference in the paper and are included here.
Files
Files
(620.4 MB)
Name | Size | Download all |
---|---|---|
md5:9f2d5d6bbf6cf730494e0c29507850c7
|
620.4 MB | Download |
Additional details
Related works
- Is part of
- Publication: arXiv:2407.12844 (arXiv)
Software
- Repository URL
- https://github.com/adkipnis/metabench