Published January 1, 2023 | Version v1
Dataset Open

A large publicly available single-cell peripheral blood dataset (MLL23)

  • 1. Roche
  • 2. ROR icon Helmholtz Zentrum München
  • 3. TUM
  • 4. ROR icon Ludwig-Maximilians-Universität München
  • 5. ROR icon Deutschen Konsortium für Translationale Krebsforschung
  • 6. ROR icon German Cancer Research Center
  • 7. ROR icon Munich Leukemia Laboratory (Germany)
  • 8. Helmholtz Zentrum München - German Research Center for Environmental Health

Description

Distinguishing cell types in peripheral blood smears is critical for differential diagnosis of blood diseases, such as leukemia subtypes. Machine learning can assist physicians in automating cell classification. Still, the generalizability of deep neural networks remains challenging, particularly concerning domain shifts emerging from variations in, e.g., patient cohorts, staining protocols, scanning procedures, and image resolution. We introduce a large, publicly available, fully annotated peripheral blood dataset comprising over 40,000 single-cell images classified into 18 classes by cytomorphology experts.

In the group of lymphoid cells, there are mature ‘typical lymphocytes’ (number of single-cell images = 5,532) and atypical lymphocytes like plasma cells (1,658), large granular lymphocytes (1,849), reactive lymphocytes (33), hairy cells (3,265) and other neoplastic lymphocytes (180), as well as smudge cells (988). In comparison, the group of myeloid cells is divided into mature cells like band neutrophil granulocytes (687), segmented neutrophil granulocytes (7,170), eosinophil granulocytes (2,448), basophil granulocytes (616), monocytes (2510), and immature cells like myeloblasts (8,606), metamyelocytes (483), promyelocytes (745), myelocytes (747), and atypical promyelocytes (2,033). Lastly, normoblasts (2071) are also present in the dataset. The cell types occur with specific frequencies in the peripheral blood in healthy and pathological patients.

Files

basophil.zip

Files (9.7 GB)

Name Size Download all
md5:7bb731078be264c977fa72230c49c25c
146.6 MB Preview Download
md5:1d8a11afcf90a618c80b41510620f0f7
559.7 MB Preview Download
md5:0ef69f54b296ae09b42bef4a5e8e74d5
739.9 MB Preview Download
md5:3dc47820ee9805cfa6a20acc3436e282
1.3 GB Preview Download
md5:e4ce87d300fa95b47e0e3d224fe55107
425.6 MB Preview Download
md5:c479f033de12ec2ad7e7335b2b2085c2
45.6 MB Preview Download
md5:ef3daecf3e8cba5b2110ea3fbf4e8870
7.7 MB Preview Download
md5:92f7f022370754abce0eb2bcdacc196d
107.3 MB Preview Download
md5:ec8a85d1d06889a00e9edb7f3bdde365
615.1 MB Preview Download
md5:8cfc239610093f92d374831549895b04
2.1 GB Preview Download
md5:92e5f6dac522c3deabd55f8be5f634c4
176.4 MB Preview Download
md5:c8d94d9bbc4eb2d05ea5ea949680a41f
158.7 MB Preview Download
md5:0962400d8fe56484d6b4f233b1532af2
1.8 GB Preview Download
md5:1ab785e0c9b7529e995551f37dd2dab5
374.9 MB Preview Download
md5:00ef51b611352ea25e5f6cb9bc538019
352.7 MB Preview Download
md5:b8d985eaed723e8c5f484d3f7ad3d134
183.5 MB Preview Download
md5:7b1ca8c1fbb13e67ee1f246dd3dc9538
475.5 MB Preview Download
md5:4642b73dd940cb69f7f0b3991756cba2
229.2 MB Preview Download

Additional details

Funding

European Commission
CompHematoPathology - Computational Hematopathology for Improved Diagnostics 866411