Published March 9, 2022 | Version 2.0.0
Dataset Open

Control T-cell receptor (TCR) alpha and beta chain nucleotide and amino acid sequences from human and mouse

Authors/Creators

  • 1. Institute of Bioorganic Chemistry, Russian Academy of Sciences

Description

A dataset of pooled T-cell receptor (TCR) sequences for TCR alpha and beta chains of human and mouse.

Sequences are obtained from various samples of healthy individuals/mice using our conventional protocols: see for example [Britanova et al "Dynamics of individual T cell repertoires: from cord blood to centenarians" The Journal of Immunology 2016] and [Izraelson et al. "Comparative analysis of murine T‐cell receptor repertoires." Immunology 2018].

The sequences are stored as gzipped clonotype tables in VDJtools format, see [https://vdjtools-doc.readthedocs.io/en/master/input.html#vdjtools-format].

This control dataset can be used as a proxy for a generative VDJ rearrangement model to estimate the expected frequency distribution of TCRs and check for enrichment of rare TCR clonotypes and groups of similar TCR sequences. For the implementation of the enrichment analysis, please see CalcDegreeStats routine from VDJtools software, see [https://vdjtools-doc.readthedocs.io/en/master/annotate.html#calcdegreestats].

Files named "human.tra.strict.txt.gz", etc are pools of random/naive TCR clonotypes containing unique V/J/CDR3 nucleotide sequence combinations observed in data. The pools.zip file is used for TCR motif inference in VDJdb database [https://github.com/antigenomics/vdjdb-motifs], it contains human.tra.aa.txt, etc files that contain random/naive TCR clonotypes grouped by CDR3 amino acid sequence with the most frequent representative V and J.

Files

pools.zip

Files (1.3 GB)

Name Size Download all
md5:3464f2bfa15d77a602448df3e4b52280
79.2 MB Download
md5:44298995a19d00674b32e59b98fbddf9
625.2 MB Download
md5:e61789984c6f32b2f24a3dd86a116b4a
15.1 MB Download
md5:edf4a24a1fbda61097c58be65f11f147
25.4 MB Download
md5:ba734c3ad5a5c80d6625b635adf27eb9
518.2 MB Preview Download

Additional details

Related works

Is described by
Journal article: 10.3389/fimmu.2019.02159 (DOI)