There is a newer version of the record available.

Published November 12, 2020 | Version 2
Dataset Open

Gene expression and splicing counts from the Kremer et al study

  • 1. Technical University of Munich

Description

File description:

  1. geneCounts: gene-level counts 

  2. k_j: split counts spanning from one exon to another.

  3. k_theta: non-split counts covering a splice site

  4. n_psi3: total split counts from a given acceptor site

  5. n_psi5: total split counts from a given donor site

  6. n_theta: total split and non-split counts for a given splice site

  7. Sample annotation describing each sample from the dataset

  8. Description file with global information from the dataset

 

The gene counts were originated using the GTF file from release 34 of GENCODE https://www.gencodegenes.org/human/release_34, and the split and non-split counts contain only the annonated junctions from the same release.

Use: The count matrices are intended to help researchers that are interested in using RNA-Seq data with the purpose of diagnostics. Researchers can merge their own dataset with the downloaded ones, provided the tissue, genome build, strand, and paired-end specifications match. Afterwards, the DROP pipeline can be used to compute expression and splicing outliers (https://github.com/gagneurlab/drop).

Number of samples: 119
Tissue: Fibroblast
Organism: Homo sapiens
Genome assembly: hg19
Gene annotation: gencode34
Disease (ICD-10: N): E75: 1, E79: 13, E88: 84, G31: 9, K72: 3, NONE: 9
Strand specific: FALSE
Paired end: TRUE
Contact person: Vicente Yepez, yepez@in.tum.de; Christian Mertes, mertes@in.tum.de
 

 

Files

Files (111.1 MB)

Name Size Download all
md5:810b90c274263a3e5e6cd924c9d247d8
111.1 MB Download

Additional details

References