A Global Reference Balanced Gene Expression Dataset for Acute Lymphoblastic Leukemia Subtypes
Description
This dataset presents a global reference and balanced gene expression resource for Acute Lymphoblastic Leukemia (ALL) subtypes. It was constructed by integrating 13 publicly available GEO microarray datasets, resulting in a unified and harmonized dataset covering 258 samples across 9 ALL subtypes and normal controls. The dataset includes 54,675 genes and is balanced to ensure fair representation of each subtype, making it suitable for use in machine learning, biomarker discovery, and translational leukemia research.
Files provided include both the balanced dataset and the complete dataset with metadata, enabling reproducibility and global benchmarking in leukemia genomics research.
Files
FINAL_balanced_all_metadata_V2.zip
Files
(1.2 GB)
Name | Size | Download all |
---|---|---|
md5:912220a13ee2255a005429d07f10f256
|
1.2 GB | Preview Download |
Additional details
Related works
- Is described by
- Dataset: 10.5281/zenodo.16999485 (DOI)
- Is supplemented by
- https://github.com/AlirezaRahi/A-Global-Reference-Balanced-Gene-Expression-Dataset-for-Acute-Lymphoblastic-Leukemia-Subtypes (URL)
References
- "https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE135294, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE51866, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE19475, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE135294, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE19475, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE26713, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE4698, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE13159, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE79533, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE28497, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE60926, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE3910"