Automated cell annotation in scRNA-seq data using unique marker gene sets
Description
Single-cell RNA sequencing has revolutionized the study of cellular heterogeneity, yet accurate cell type annotation remains a significant challenge. Inconsistent labels, technological variability, and limitations in transferring annotations from reference datasets hinder precise annotation. This study presents a novel approach for accurate cell type annotation in scRNA-seq data using unique marker gene sets. By manually curating cell type names and markers from 280 publications, we verified marker expression profiles across these datasets and unified nomenclatures to consistently identify 166 cell types and subtypes. Our customized algorithm, which builds on the AUCell method, achieves accurate cell labeling at single-cell resolution and surpasses the performance of reference-based tools like Azimuth, especially in distinguishing closely related subtypes. To enhance accessibility and practical utility for researchers, we have also developed a user-friendly application that automates the cell typing process, enabling efficient verification and supporting comprehensive downstream analyses. The desktop application can be accessed at https://omnibusx.com.
Files
data.zip
Files
(6.4 GB)
Name | Size | Download all |
---|---|---|
md5:e1b1558f22283f1e0162e11f9e66847f
|
6.4 GB | Preview Download |
Additional details
Software
- Repository URL
- https://github.com/OmnibusXLab/celltype-prediction-benchmark
- Programming language
- Python
- Development Status
- Active