Published December 10, 2024 | Version v1
Dataset Open

Evaluation Dataset A and B for ChatTS

Creators

Description

Evaluation Datasets for ChatTS
  • We provide the two evaluation datasets we collected, as mentioned in the paper. Each dataset sample contains the following fields: `timeseries`, `question`, `answer` (standard answers in text format, for reference only), `attributes` (structured labels used for result evaluation), and `ability_types` (the tasks included in the question). Please note that, to reduce the evaluation cost, we have merged different questions for the same time series into a single `question`, using numbering to distinguish between the different questions. Therefore, the actual number of questions in the evaluation dataset may be greater than the number of `timeseries` entries. Additionally, please note that some tasks in inductive reasoning and alignment are grouped into the same question, as the inductive reasoning tasks involve explaining the physical meanings of time series attributes.
  • The `MCQ2` dataset is a third-party open-source dataset, we do not provide it in this repository. Please download it directly via https://github.com/behavioral-data/TSandLanguage.

Data Source

Dataset A includes real-world time series data collected from the following sources:

  • NAB (https://github.com/numenta/NAB)
  • Weather (https://www.bgc-jena.mpg.de/wetter/)
  • Oracle (https://zenodo.org/records/6955909)
  • AIOps (https://github.com/netmanaiops/kpi-anomaly-detection)
License
Our datasets uses Creative Commons Attribution 4.0 International License (CC BY 4.0). If you use this dataset, you must:
1. Give Attribution: When using the dataset, clearly mention the dataset name and the sources NAB, Weather, Oracle, and AIOps.
2. Share Alike: If you make something new based on this dataset, share it under the same CC BY 4.0 license.
3. Follow Source Restrictions: If any source dataset has special rules (like some parts of the Weather dataset under CC-BY-4.0), follow those too.

By using this dataset, you agree to follow all laws and the license terms of each source dataset and this overall license.

Files

README.md

Files (45.9 MB)

Name Size Download all
md5:1978d2ae5f2760ef944477fdcab223b2
7.4 MB Preview Download
md5:ffbd572226837c5526dfcbc40c8f156a
38.5 MB Preview Download
md5:b754f438424544d664819a1c5fd324f8
2.1 kB Preview Download