Published January 10, 2025 | Version v2.5
Dataset Open

TCR-DeepInsight Reference Datasets

Creators

  • 1. Zhejiang University

Description

Reference single-cell TCR immune profiling datasets for the TCR-DeepInsight analysis. We have excluded two controlled-access datasets (1) AML dataset from Abbas et al. 2021 (EGAS00001004894) and (2) Kawasaki disease dataset from Wang et al., 2021 (OEP001162).

  • human_gex_reference_v2.h5ad: Processed H5AD file for transcriptome features from the single-cell TCR immune profiling datasets. anndata.read_h5ad
  • human_tcr_reference_v2.h5ad: Processed H5AD file for unique TCR clonotypes single-cell TCR immune profiling dataset. anndata.read_h5ad
  • Yi_2023_Ankylosing_Spondylitis.h5ad: Processed H5AD file from Yi et al., 2023 (GSE216885). anndata.read_h5ad
  • GSE272993_cd8_nn_labeled_FINAL.fl_tcr.match_v2_5.transfered.h5ad: Processed H5AD file from Wang et al., 2024 (GSE272993). anndata.read_h5ad

  • human_bulk_tcr_reference.parquet: PARQUET file for bulk TCRβ sequencing. pandas.read_parquet
    • human_bulk_tcr_reference.cd4.parquet: PARQUET file for bulk TCRβ sequencing for CD4-sorted T cells
      human_bulk_tcr_reference.mait.parquet: PARQUET file for bulk TCRβ sequencing for MAIT cells
      human_bulk_tcr_reference.treg.parquet: PARQUET file for bulk TCRβ sequencing for Treg cells
      human_bulk_tcr_reference.cd8.parquet: PARQUET file for bulk TCRβ sequencing for CD8-sorted T cells

  • Yi_2023_Ankylosing_Spondylitis.h5ad. anndata.read_h5ad
    Processed dataset from K. Yi et al. Analysis of Single‐Cell Transcriptome and Surface Protein Expression in Ankylosing Spondylitis Identifies OX40 ‐Positive and Glucocorticoid‐Induced Tumor Necrosis Factor Receptor–Positive Pathogenic Th17 Cells. Arthritis & Rheumatology 75, 1176–1186 (2023).
  • GSE272993_cd8_nn_labeled_FINAL.fl_tcr.match_v2_5.transfered.h5ad. anndata.read_h5ad
    Processed dataset from K. Wang et al. Combination anti-PD-1 and anti-CTLA-4 therapy generates waves of clonal responses that include progenitor-exhausted CD8+ T cells. Cancer Cell, S1535610824003064 (2024).
  • human_gex_reference_v2.scatlasvae.ckpt: pretrained weight state dict from scAtlasVAE model for human_gex_reference_v2.h5ad. torch.load
  • human_bert_pseudosequence.tcr_v2.ckpt: pretrained weight state dict from BERT for human_tcr_reference_v2.h5ad. torch.load
  • human_bert_pseudosequence_pca.tcr_v2.pkl: pretrained PCA weight for human_tcr_reference_v2.h5ad. pickle.load

Files

Files (7.3 GB)

Name Size Download all
md5:998a7a0391766468f999c2a17c318442
456.0 MB Download
md5:45536cc1555f78790a0f8c4ab136dc5a
11.3 MB Download
md5:8f6c8fc8f9cead4e7e5e087baf56aa0d
41.1 kB Download
md5:41b2490273f62e9f080450e1494ec738
131.5 MB Download
md5:72841609f77c9995e866a30bb2a97f5b
163.3 MB Download
md5:927e4dc799842bbd85459821a14e57f4
387.6 kB Download
md5:b5b4a32106375ab9a484889669b848c9
1.9 GB Download
md5:d630a194f65f46f91e012a22b380ba72
7.6 MB Download
md5:9144ba1cf0a601ec8ee4516e2152c8b5
2.9 GB Download
md5:a5a695d396b03bff37f732a6051c7d62
16.8 MB Download
md5:b5da56054534e28010b8265bc7c0f783
1.7 GB Download
md5:a4d0c1c4197436b620b40223947c27b7
22.9 MB Download