Gene expression and splicing counts from the Kremer et al study
Description
File description:
-
geneCounts: gene-level counts
-
k_j: split counts spanning from one exon to another.
-
k_theta: non-split counts covering a splice site
-
n_psi3: total split counts from a given acceptor site
-
n_psi5: total split counts from a given donor site
-
n_theta: total split and non-split counts for a given splice site
-
Sample annotation describing each sample from the dataset
-
Description file with global information from the dataset
The gene counts were originated using the GTF file from release 34 of GENCODE https://www.gencodegenes.org/human/release_34, and the split and non-split counts contain only the annonated junctions from the same release.
Use: The count matrices are intended to help researchers that are interested in using RNA-Seq data with the purpose of diagnostics. Researchers can merge their own dataset with the downloaded ones, provided the tissue, genome build, strand, and paired-end specifications match. Afterwards, the DROP pipeline can be used to compute expression and splicing outliers (https://github.com/gagneurlab/drop).
Number of samples: 119
Tissue: Fibroblast
Organism: Homo sapiens
Genome assembly: hg19
Gene annotation: gencode34
Disease (ICD-10: N): E75: 1, E79: 13, E88: 84, G31: 9, K72: 3, NONE: 9
Strand specific: FALSE
Paired end: TRUE
Contact person: Vicente Yepez, yepez@in.tum.de; Christian Mertes, mertes@in.tum.de
Files
Files
(111.1 MB)
Name | Size | Download all |
---|---|---|
md5:810b90c274263a3e5e6cd924c9d247d8
|
111.1 MB | Download |
Additional details
References
- Kremer, L., Bader, D., Mertes, C. et al. Genetic diagnosis of Mendelian disorders via RNA sequencing. Nat Commun 8, 15824 (2017). https://doi.org/10.1038/ncomms15824