There is a newer version of the record available.

Published July 22, 2018 | Version v1
Dataset Open

Control T-cell receptor sequences

  • 1. Institute of Bioorganic Chemistry, Russian Academy of Sciences

Description

A dataset of pooled T-cell receptor (TCR) sequences for TCR alpha and beta chains of human and mouse.

Sequences are obtained from various samples of healthy individuals/mice using our conventional protocols: see for example [Britanova et al "Dynamics of individual T cell repertoires: from cord blood to centenarians" The Journal of Immunology 2016] and [Izraelson et al. "Comparative analysis of murine T‐cell receptor repertoires." Immunology 2018].

The sequences are stored as gzipped clonotype tables in VDJtools format, see [https://vdjtools-doc.readthedocs.io/en/master/input.html#vdjtools-format].

This control dataset can be used as a proxy for a generative VDJ rearrangement model to estimate the expected frequency distribution of TCRs and check for enrichment of rare TCR clonotypes and groups of similar TCR sequences. For the implementation of the enrichment analysis, please see CalcDegreeStats routine from VDJtools software, see [https://vdjtools-doc.readthedocs.io/en/master/annotate.html#calcdegreestats].

Files

Files (744.9 MB)

Name Size Download all
md5:3464f2bfa15d77a602448df3e4b52280
79.2 MB Download
md5:44298995a19d00674b32e59b98fbddf9
625.2 MB Download
md5:e61789984c6f32b2f24a3dd86a116b4a
15.1 MB Download
md5:edf4a24a1fbda61097c58be65f11f147
25.4 MB Download