Published June 14, 2024 | Version v1
Dataset Open

scTransformers - Dataset : Cross Tissue Immune Cells

  • 1. ROR icon Centre d'Immunologie de Marseille-Luminy

Description

The annotation of cell types on single cell RNA-seq data is a complex, uncertain and time-consuming task, requiring several methods and tools to be able to annotate cells appropriately and efficiently. To overcome these problems and uncertainties, numerous tools and scientific articles have emerged over the years. The rise of artificial intelligence in our lives (notably through chatGPT), has also imposed itself on the scientific world, bringing novelty and innovation to existing techniques and tools. These tools need to be tested and studied to verify their effectiveness. In this project, two cell annotation tools in single cell RNA-seq named scBERT and scGPT are of interest to CB2M because of their ability to resolve and avoid the uncertainties and problems mentioned above. We study here, through various analyses, including cross-validation and the use of multiple qualitative and numerical indicators, that cell annotation by those tools are effective for annotating cells from scRNA-seq.

Provided files :

  • Human_Thymus_Development_Atlas_reference.tar.gz : Cell atlas of human thymic development : 15 embryonic and fetal thymuses covering stages of thymic development from 7 post-conceptional weeks (PCW) to 17 PCW, and 9 postnatal thymuses from human pediatric and adult samples. It contains 255,901 cells, 32,922 genes and 33 different cell types.
  • Human_Thymus_Development_Atlas_output.tar.gz : All analysis output files
  • Human_Thymus_Development_Atlas_container.tar.gz : Docker image and Singularity mages used for the analysis

See https://github.com/CIML-bioinformatic/CB2M_scTransformers for more details.

Files

Files (33.8 GB)

Name Size Download all
md5:f830b7e8ee0963a6ea2362cf170c2d0b
21.5 GB Download
md5:2e50c1458ad53602cc8f7a659a7bf68e
9.5 GB Download
md5:1a968b4b34e91f42c531ee38c3c64655
2.9 GB Download

Additional details

Software

Repository URL
https://github.com/CIML-bioinformatic/CB2M_scTransformers
Programming language
Python