Microscopy Imaging Dataset: Trypanosoma brucei Bloodstream Form Classification Using Deep Learning
- 1. 1. Department of Biological Physics, Eötvös University (ELTE), H-1117, Budapest, Hungary
- 2. Gulbenkian Institute for Molecular Medicine, Lisbon, Portugal
Description
This dataset provides a comprehensive collection of microscopic images and associated labels, specifically designed to facilitate the automated classification of Trypanosoma brucei bloodstream forms—slender and stumpy. Accurate differentiation of these life cycle stages is vital for understanding the parasite's biology, transmission dynamics, and adaptation mechanisms in its mammalian host.
Contents:
- Image Data: Microscopic images of T. brucei bloodstream forms captured under standard imaging conditions, encompassing a broad array of image quality, cellular arrangements, and morphological characteristics.
- Label Data: Annotation files for each image, specifying cellular forms as slender or stumpy, essential for supervised machine learning applications.
- Supplementary Files: Additional Excel files providing information on training, testing, and validation splits, alongside test results for model evaluation.
Purpose:
This dataset serves as a valuable resource for researchers in parasitology, machine learning, and computational biology. It supports investigations into the biology and life cycle of T. brucei, while also providing a robust testbed for developing, validating, and benchmarking image processing and classification algorithms tailored to parasite morphology.
Data Collection and Methodology:
The dataset was compiled using advanced deep learning techniques, integrating the Cellpose segmentation algorithm with a custom-trained Xception model optimized for classifying T. brucei forms. The model achieved 97% classification accuracy, demonstrating effective application in handling complex cell images and distinguishing between slender and stumpy forms in challenging imaging conditions.
Usage:
Researchers are encouraged to use this dataset to:
- Analyze and classify the life cycle stages of T. brucei bloodstream forms in microscopic images.
- Develop and test deep learning models for single-cell image segmentation and classification.
- Explore cellular morphology patterns and refine machine learning approaches for other single-cell imaging applications.
Citation:
Please cite the original dataset if you utilize this resource in your research to acknowledge its contribution to the field.
Access and Availability:
This dataset is openly available through Zenodo, enabling researchers to download, explore, and apply it in various fields, from parasitology to advanced computational biology.
Files
Parasite_CLF(V3).ipynb
Files
(595.7 MB)
Name | Size | Download all |
---|---|---|
md5:9763bebb9b7443bc2c84d612308b1fde
|
45.8 kB | Preview Download |
md5:2a3818ba20d418dc56862b98d576b598
|
42.0 MB | Preview Download |
md5:3df5eac8603aa493cf741643aa10f6c0
|
106.0 MB | Preview Download |
md5:763cfe94c2420b1990191c40a80d1f5d
|
342.1 MB | Preview Download |
md5:b5b0a54e2b9a9347132ee6e9ea524f49
|
105.6 MB | Preview Download |
Additional details
Dates
- Available
-
2025-02-23