Published December 26, 2024 | Version v1
Dataset Open

A proposed workflow to robustly analyze bacterial transcripts in RNAseq data from Extracellular Vesicles

  • 1. Biogipuzkoa Health Research Intitute
  • 2. ROR icon Biogipuzkoa Health Research Institute
  • 3. Biodonostia Research Institute

Description

The microbiota has been unequivocally linked to various diseases, yet the mechanisms underlying these associations remain incompletely understood. One potential contributor to this relationship is the extracellular vesicles produced by bacteria (bEVs). However, the detection of these bEVs is challenging. Therefore, we propose a novel workflow to identify bacterial RNA present in circulating extracellular vesicles using Total EV RNA-seq data. As a proof of concept, we applied this workflow to a dataset from individuals with multiple sclerosis (MS).

We analyzed Total EV RNA-seq data from blood samples of healthy controls and individuals with MS, encompassing both the Relapsing-Remitting (RR) and Secondary Progressive (SP) phases of the disease. Our workflow incorporates multiple reference mapping steps against the host genome, followed by a consensus selection of bacterial genera based on various taxonomic profiling tools. This consensus approach utilizes a flagging system to exclude species with low abundance or high variability across profilers.  Additionally, including biological samples from known cultured species and generating artificial reads constitute two key aspects of this workflow to validate the high specificity of the approach.

Our findings demonstrate that bacterial RNA can indeed be detected in Total EV RNA-seq from blood samples, suggesting that this workflow can be a powerful tool for reanalyzing RNA-seq data from EV studies. Additionally, we identified promising bacterial candidates with differential expression between the RR and SP phases of MS. This approach provides valuable insights into the potential role of bEVs in the microbiota-host communication. Finally, this approach is translatable to other experiments using total RNA, where the lack of a robust pipeline can lead to an increased false positive detection of microbial species. 

The workflow is available at the following repository: https://github.com/NanoNeuro/EV_taxprofiling

Files

Files (24.5 GB)

Name Size Download all
md5:bc28e2f5c34fdff14a37bb9835ac142b
4.6 MB Download
md5:314fa261edc98a59c62e4518e67dabd4
360.4 kB Download
md5:9d2aea9a10d587c490c71251dd25e421
3.5 MB Download
md5:31444e8329bbb45295a9e761037c4556
24.5 GB Download
md5:8c1624896b07e3c3030658585ed07fc1
2.8 MB Download

Additional details

Funding

European Committee for Treatment and Research in Multiple Sclerosis
Postdoctoral Research Fellowship Programme
Basque Government
POS_2020_1_0008
Instituto de Salud Carlos III
Sara Borrell
Basque Government
PREDOC BERRI
Basque Government
IKR Nanoneuro
Instituto de Salud Carlos III
PI23/00903
Instituto de Salud Carlos III
PI20/00327

Software

Repository URL
https://github.com/NanoNeuro/EV_taxprofiling
Programming language
Python, Shell