There is a newer version of the record available.

Published February 27, 2024 | Version v1
Workflow Open

High-resolution scRNA-seq reveals genomic determinants of antigen expression hierarchy in African Trypanosomes

  • 1. ROR icon Ludwig-Maximilians-Universität München
  • 2. ROR icon University of Edinburgh
  • 3. Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, USA

Description

Description of the repository:

This repository contains the data analysis workflows for the manuscript entitled "High-resolution scRNA-seq reveals genomic determinants of antigen expression hierarchy in African Trypanosomes".

Due to space limitations on Zenodo, all the input raw sequencing data files were deposited in the European Nucleotide Archive (ENA) under project number PRJEB72370.

The folder "Switching_Hierarchy_Paper_Analysis_Pipelines.tar.gz" contains three subfolders: "Genomes" ; "Pre-processing" and "Downstream_Analysis" :

  • The "Genomes" folder contains all the input fasta files and annotation files used in this project.
  • The "Data_Pre-processing" folder contains worklows for preprocessing the sequencing data (i.e., subsampling of reads, alignments, production of count matrices, generation of single-cell coverage files, de-novo single-cell transcriptome assembly). The raw sequencing data files need to be downloaded from ENA (PRJEB72370) and placed in the "input/reads" folder of each respective pre-processing folder. The workflow for each dataset is in a bash script named "run.sh" , designed to run on a High-performance computer cluster with the SLURM job scheduling system. Accessory scripts are located in the "bin" folder. Final output files used in downstream analysis are in the "output" folder. Intermediate output files were not retained due to space limiatations. The required software environment can be recreated using the provided yaml file: "SS3x_mapping_counting_VSGhierarchy_env.yaml".
  • The "Downstream_Analysis" folder contains, for each dataset, Jupyter notebooks and input files to generate all figures and analysis conducted during this project based on the sequencing data. The required software environment can be recreated using the provided yaml file: "scAnalysis_env.yaml"

 

Abstract

Antigenic variation is an immune evasion strategy used by many different pathogens. It involves the periodic, non-random switch in the expression of different antigens throughout an infection. How the observed hierarchy in antigen expression is achieved has remained a mystery. A key challenge in uncovering this process has been the inability to track transcriptome changes and potential genomic rearrangements in individual cells during a switch event. Here, we report the establishment of a highly sensitive single-cell RNA-seq (scRNA-seq) approach for the model protozoan parasite Trypanosoma brucei. This approach has revealed genomic rearrangements that occur in individual cells during a switch event. Our data show that following a double-strand break (DSB) in the transcribed antigen-coding gene – an important trigger for antigen switching – the type of repair mechanism and the resultant antigen expression depend on the availability of a homologous repair template in the genome. When such a template was available, repair proceeded through segmental gene conversion, creating new, mosaic antigen-coding genes. Conversely, in the absence of a suitable template, a telomere-adjacent antigen-coding gene from a different part of the genome was activated by break-induced replication. Our results reveal the critical role of available repair sequence in the antigen selection mechanism. Additionally, our study demonstrates the power of highly sensitive scRNA-seq methods in detecting genomic rearrangements that drive transcriptional changes at the single-cell level.

Notes

This work was funded by the German Research Foundation [SI 1610 / 2-2], an ERC Starting Grant (3D_Tryps 715466) and an ERC Consolidator Grant (SwitchDecoding 101044320) awarded to TNS. ZK and AD were supported by MSCA ITN Cell2Cell fellowships. KRM was supported by European Union’s Framework Programme for Research and Innovation Horizon 2020 (2014-2020) under the Marie Skłodowska-Curie Grant Agreement No. 754388 (LMUResearchFellows) and from LMUexcellent, funded by the Federal Ministry of Education and Research (BMBF) and the Free State of Bavaria under the Excellence Strategy of the German Federal Government and the Länder.

Files

Files (3.2 GB)

Name Size Download all
md5:1795bdc0c80fbabd8e1c44f23af05ed9
433 Bytes Download
md5:d254f187981b758c8508266a8e65a082
434 Bytes Download
md5:6e3d702616689eb10baf11a922f0bc7c
3.2 GB Download

Additional details

Software

Programming language
Shell , Jupyter Notebook , Python