Published June 5, 2024 | Version v1
Dataset Open

Einsum Benchmark: Enabling the Development of Next-Generation Tensor Execution Engines

Description

Modern artificial intelligence and machine learning workflows rely on efficient tensor libraries. However, tuning tensor libraries without considering the actual problems they are meant to execute can lead to a mismatch between expected performance and the actual performance. Einsum libraries are tuned to efficiently execute tensor expressions with only a few, relatively large, dense, floating-point tensors. But, practical applications of einsum cover a much broader range of tensor expressions than those that can currently be executed efficiently. For this reason, we have created a benchmark dataset that encompasses this broad range of tensor expressions, allowing future implementations of einsum to build upon and be evaluated against. In addition, we also provide generators for einsum expression and converters to einsum expressions in our repository, so that additional data can be generated as needed. The benchmark dataset, the generators and converters are released openly and are publicly available at https://benchmark.einsum.org.

The broader data collection process included contributions from individuals whose data was transformed. We duly acknowledge the following for making their data publicly available:

Moreover, we thank the following authors of open source software used to generated instances:

 

Files

instances.zip

Files (551.8 MB)

Name Size Download all
md5:8b2ec1a812b3fa2d61cc84981d805196
1.0 kB Download
md5:b476f37d39a5b8a978e8230375326eb3
551.8 MB Preview Download

Additional details

Software