Project deliverable Open Access

euCanSHare. Deliverable 4.4 - Bioinformatics Toolbox

Zeller, Tanja; Spiess, Andrej; Engels, Anna Lena

Citation Style Language JSON Export

  "publisher": "Zenodo", 
  "DOI": "10.5281/zenodo.4299922", 
  "language": "eng", 
  "title": "euCanSHare. Deliverable 4.4 - Bioinformatics Toolbox", 
  "issued": {
    "date-parts": [
  "abstract": "<p>Meaning and purpose of this deliverable is to demonstrate the applicability of a bioinformatical tool (part of a larger toolbox) that can either analyse external data through an upload mechanism or offer the automatic analysis of internal server-housed data.<br>\nFor this initial case, we selected the analysis of RNA sequencing (RNAseq) data, the de facto standard of today&rsquo;s gene expression measurement, as it is widely applied in the scientific community.<br>\nWe have programmed a tool that (as it currently stands) can analyse differential gene expression between two groups, based on a provided &ldquo;raw count&rdquo; RNAseq matrix and three additional files containing gene annotation data, group definitions and covariates. All data is automatically matched and a subsequent extensive analyses of the data is conducted, including visualizations of expression levels, variance structure analysis by decomposition (PCA), variance contribution analysis, hierarchical clustering of top differential transcripts, profile plots, and diagnostic plots (MA plot, Volcano plot). During analysis, the obtained data to generate these exported plots is also automatically exported and named accordingly. The differential gene expression is calculated by covariate-adjusted linear models with multiple testing-corrected p-values. Finally, a large result matrix is generated, with the original count matrix augmented with annotations, gene names and the complete statistical data and sorted ascendingly by the corrected p-value, so that the most differential transcripts reside on the top of the data.<br>\nIn future, it is envisaged that the user selects RNAseq data deposited alongside clinical variables and defines the desired grouping of the samples, which then is sufficient to create a complete analysis output as described above.</p>", 
  "author": [
      "family": "Zeller, Tanja"
      "family": "Spiess, Andrej"
      "family": "Engels, Anna Lena"
  "note": "This deliverable has been produced in the context of the euCanSHare (An EU-Canada joint infrastructure for next-generation multi-Study Heart research) Research and Innovation Action, funded by the European Union's Horizon 2020 programme (grant agreement No 825903), the Canadian Institutes of Health Research (CIHR) and the Fonds de recherche du Qu\u00e9bec \u2013 Sant\u00e9 under the framework of Canada\u2010EU Commission Flagship Collaboration for Human data storage, integration and sharing.", 
  "type": "report", 
  "id": "4299922"
All versions This version
Views 6161
Downloads 4444
Data volume 20.8 MB20.8 MB
Unique views 6060
Unique downloads 4141


Cite as