Published December 5, 2020 | Version v1
Dataset Open

Uniprot datasets with variable patch sizes for testing taxonomic classification

  • 1. Center for Bio-Medical image and Information processing (CBMI), HTW University of Applied Sciences, Berlin, Germany

Description

These datasets can be used to test the performance of the taxonomic classification model deposited at https://zenodo.org/record/4306499 and trained using the data deposited at https://zenodo.org/record/4306240 with different patch sizes:

  • 300 +- 10 bases
  • 300 +- 25 bases
  • 300 +- 50 bases

Those are the same sequences as used in the test dataset mentioned above, but sampled at different lengths.

Notes

The authors acknowledge the financial support by the Federal Ministry of Education and Research of Germany (BMBF) in the project deep.Health (project number 13FH770IX6).

Files

Files (3.4 MB)

Name Size Download all
md5:8aa4f75d0aae3abdfd073e7c23260d23
3.4 MB Download