Project deliverable Open Access
Zeller, Tanja; Spiess, Andrej; Engels, Anna Lena
<?xml version='1.0' encoding='UTF-8'?> <record xmlns="http://www.loc.gov/MARC21/slim"> <leader>00000nam##2200000uu#4500</leader> <datafield tag="041" ind1=" " ind2=" "> <subfield code="a">eng</subfield> </datafield> <controlfield tag="005">20201202122713.0</controlfield> <datafield tag="500" ind1=" " ind2=" "> <subfield code="a">This deliverable has been produced in the context of the euCanSHare (An EU-Canada joint infrastructure for next-generation multi-Study Heart research) Research and Innovation Action, funded by the European Union's Horizon 2020 programme (grant agreement No 825903), the Canadian Institutes of Health Research (CIHR) and the Fonds de recherche du Québec – Santé under the framework of Canada‐EU Commission Flagship Collaboration for Human data storage, integration and sharing.</subfield> </datafield> <controlfield tag="001">4299922</controlfield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="u">Medical Center Hamburg-Eppendorf</subfield> <subfield code="a">Spiess, Andrej</subfield> </datafield> <datafield tag="700" ind1=" " ind2=" "> <subfield code="u">Medical Center Hamburg-Eppendorf</subfield> <subfield code="a">Engels, Anna Lena</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="s">473711</subfield> <subfield code="z">md5:497b9ed6591b953b7b0dd17a2de410de</subfield> <subfield code="u">https://zenodo.org/record/4299922/files/D4.4_euCanSHare_UKE_30112020.pdf</subfield> </datafield> <datafield tag="542" ind1=" " ind2=" "> <subfield code="l">open</subfield> </datafield> <datafield tag="260" ind1=" " ind2=" "> <subfield code="c">2020-11-30</subfield> </datafield> <datafield tag="909" ind1="C" ind2="O"> <subfield code="p">openaire</subfield> <subfield code="p">user-eucan</subfield> <subfield code="p">user-eucanshare</subfield> <subfield code="o">oai:zenodo.org:4299922</subfield> </datafield> <datafield tag="100" ind1=" " ind2=" "> <subfield code="u">Medical Center Hamburg-Eppendorf</subfield> <subfield code="a">Zeller, Tanja</subfield> </datafield> <datafield tag="245" ind1=" " ind2=" "> <subfield code="a">euCanSHare. Deliverable 4.4 - Bioinformatics Toolbox</subfield> </datafield> <datafield tag="980" ind1=" " ind2=" "> <subfield code="a">user-eucan</subfield> </datafield> <datafield tag="980" ind1=" " ind2=" "> <subfield code="a">user-eucanshare</subfield> </datafield> <datafield tag="536" ind1=" " ind2=" "> <subfield code="c">825903</subfield> <subfield code="a">An EU-Canada joint infrastructure for next-generation multi-Study Heart research</subfield> </datafield> <datafield tag="540" ind1=" " ind2=" "> <subfield code="u">https://creativecommons.org/licenses/by/4.0/legalcode</subfield> <subfield code="a">Creative Commons Attribution 4.0 International</subfield> </datafield> <datafield tag="650" ind1="1" ind2="7"> <subfield code="a">cc-by</subfield> <subfield code="2">opendefinition.org</subfield> </datafield> <datafield tag="520" ind1=" " ind2=" "> <subfield code="a"><p>Meaning and purpose of this deliverable is to demonstrate the applicability of a bioinformatical tool (part of a larger toolbox) that can either analyse external data through an upload mechanism or offer the automatic analysis of internal server-housed data.<br> For this initial case, we selected the analysis of RNA sequencing (RNAseq) data, the de facto standard of today&rsquo;s gene expression measurement, as it is widely applied in the scientific community.<br> We have programmed a tool that (as it currently stands) can analyse differential gene expression between two groups, based on a provided &ldquo;raw count&rdquo; RNAseq matrix and three additional files containing gene annotation data, group definitions and covariates. All data is automatically matched and a subsequent extensive analyses of the data is conducted, including visualizations of expression levels, variance structure analysis by decomposition (PCA), variance contribution analysis, hierarchical clustering of top differential transcripts, profile plots, and diagnostic plots (MA plot, Volcano plot). During analysis, the obtained data to generate these exported plots is also automatically exported and named accordingly. The differential gene expression is calculated by covariate-adjusted linear models with multiple testing-corrected p-values. Finally, a large result matrix is generated, with the original count matrix augmented with annotations, gene names and the complete statistical data and sorted ascendingly by the corrected p-value, so that the most differential transcripts reside on the top of the data.<br> In future, it is envisaged that the user selects RNAseq data deposited alongside clinical variables and defines the desired grouping of the samples, which then is sufficient to create a complete analysis output as described above.</p></subfield> </datafield> <datafield tag="773" ind1=" " ind2=" "> <subfield code="n">doi</subfield> <subfield code="i">isVersionOf</subfield> <subfield code="a">10.5281/zenodo.4299921</subfield> </datafield> <datafield tag="024" ind1=" " ind2=" "> <subfield code="a">10.5281/zenodo.4299922</subfield> <subfield code="2">doi</subfield> </datafield> <datafield tag="980" ind1=" " ind2=" "> <subfield code="a">publication</subfield> <subfield code="b">deliverable</subfield> </datafield> </record>
All versions | This version | |
---|---|---|
Views | 61 | 61 |
Downloads | 44 | 44 |
Data volume | 20.8 MB | 20.8 MB |
Unique views | 60 | 60 |
Unique downloads | 41 | 41 |