Published July 25, 2024 | Version v1
Dataset Open

metabench - Paper Data

Contributors

Contact person:

Description

Item-wise accuracies in six benchmarks from Open LLM Leaderboard 1 scraped from huggingface.co and used for metabench analyses and construction. Datasets with RMSE's for random benchmark subsets are used as reference in the paper and are included here. 

Files

Files (620.4 MB)

Name Size Download all
md5:9f2d5d6bbf6cf730494e0c29507850c7
620.4 MB Download

Additional details

Related works

Is part of
Publication: arXiv:2407.12844 (arXiv)