There is a newer version of the record available.

Published June 8, 2022 | Version v1
Dataset Open

Datasets used for the performance evaluation of kataegis detection tools

  • 1. Erasmus MC

Description

These two datasets were used for the performance evaluation of katdetectr and other publicly available kataegis detection tools (link to paper, not yet published).

The synthetic data was generated using the generateSyntheticData function from the katdetectr package (https://github.com/ErasmusMC-CCBC/katdetectr).

The scripts that generate the synthetic data, and import and process the Alexandrov et al. dataset can be found on GitHub at: https://github.com/ErasmusMC-CCBC/evaluation_katdetectr.

Files

alexandrov_data_processed_katdetectr_evaluation.csv.zip

Files (264.5 MB)

Additional details

References

  • Alexandrov LB, Nik-Zainal S, Wedge DC, Aparicio SA, Behjati S, Biankin AV, Bignell GR, Bolli N, Borg A, Børresen-Dale AL, Boyault S. Signatures of mutational processes in human cancer. Nature. 2013 Aug;500(7463):415-21.