Published March 7, 2018 | Version 3.0
Dataset Open

Conserved regulation of RNA processing in somatic cell reprogramming

  • 1. Biozentrum, University of Basel, Basel, Switzerland; RNA Regulatory Networks, Swiss Institute of Bioinformatics, Lausanne, Switzerland
  • 2. MRC Centre for Regenerative Medicine, University of Edinburgh, Edinburgh, Scotland, United Kingdom

Description

Data set 1. Transcript expression across human RNA-Seq samples: estimated read counts. The file contains estimated read counts, generated by kallisto (https://pachterlab.github.io/kallisto/), for human transcripts and RNA-Seq samples used in this study (see Additional file 2 of the accompanying publication). The format is a compressed (GZIP) tab-separated transcript-by-sample matrix. Ensembl transcript identifiers and a combined Sequence Read Archive study/sample name identifier serve as row and column names, respectively.

Data set 2. Transcript expression across murine RNA-Seq samples: estimated read counts. As in Data set 1, but for mouse transcripts.

Data set 3. Transcript expression across simian RNA-Seq samples: estimated read counts. As in Data set 1, but for chimpanzee transcripts.

Data set 4. Transcript expression across across human RNA-Seq samples: estimated transcript abundances. As in Data set 1, but instead of read counts, transcript abundances in transcripts per million (TPM), as estimated by kallisto (https://pachterlab.github.io/kallisto/), are listed. Format, column and row names as in Data set 1.

Data set 5. Transcript expression across murine RNA-Seq samples: estimated transcript abundances. As in Data set 4, but for mouse transcripts.

Data set 6. Transcript expression across simian RNA-Seq samples: estimated transcript abundances. As in Data set 4, but for chimpanzee transcripts.

Data set 7. Differential expression analyses across human RNA-Seq sample groups: log fold changes. The file contains log fold changes, inferred by edgeR (http://bioconductor.org/packages/release/bioc/html/edgeR.html), for human genes and the RNA-Seq sample group contrasts listed in Additional file 3 of the accompanying publication in a compressed (GZIP) TSV gene-by-comparison matrix. Ensembl gene identifiers and a descriptive contrast identifier serve as row and column names, respectively.

Data set 8. Differential expression analyses across murine RNA-Seq sample groups: log fold changes. As in Data set 7, but for mouse genes.

Data set 9. Differential expression analyses across simian RNA-Seq sample groups: log fold changes. As in Data set 7, but for chimpanzee genes.

Data set 10. Differential expression analyses across human RNA-Seq sample groups: false discovery rates. The file contains false discovery rates (FDR) for the differential expression analyses summarized in Data set 7. Format, column and row names as in Data set 7.

Data set 11. Differential expression analyses across murine RNA-Seq sample groups: false discovery rates. As in Data set 10, but for mouse genes.

Data set 12. Differential expression analyses across simian RNA-Seq sample groups: false discovery rates. As in Data set 10, but for chimpanzee genes.

Data set 13. Quantification of alternative splicing events across human RNA-Seq samples. The file contains ‘percent spliced in’ (PSI) values computed by SUPPA (https://github.com/comprna/SUPPA) for annotated alternative splicing events (inferred from the transcript annotation of the human genome, Ensembl release 84; http://www.ensembl.org/). The format is a compressed (GZIP) tab-separated transcript-by-sample matrix. SUPPA-provided event identifiers and a combined Sequence Read Archive study/sample name identifier serve as row and column names, respectively.

Data set 14. Quantification of alternative splicing events across murine RNA-Seq samples. As in Data set 13, but for mouse alternative splicing events.

Data set 15. Differential splicing analyses across human RNA-Seq sample groups: differences in ‘percent spliced in’ (ΔPSI). The file contains ΔPSI values for human alternative splicing events (as in Data set 13). The RNA-Seq sample group contrasts are listed in Additional file 3 of the accompanying publication. Values were inferred by SUPPA’s diffSplice functionality (https://github.com/comprna/SUPPA). The format is a compressed (GZIP) tab-separated gene-by-comparison matrix. SUPPA event identifiers and a descriptive contrast identifier serve as row and column names, respectively.

Data set 16. Differential splicing analyses across murine RNA-Seq sample groups: differences in ‘percent spliced in’ (ΔPSI). As in Data set 15, but for mouse alternative splicing events.

Data set 17. Differential splicing analyses across human RNA-Seq sample groups: P values. The file contains P values for the differential splicing analysis of human alternative splicing events summarized in Data set 15. Format, column and row names as in Data set 15.

Data set 18. Differential splicing analyses across murine RNA-Seq sample groups: P values. The file contains P values for the differential splicing analysis of mouse alternative splicing events summarized in Data set 16. Format, column and row names as in Data set 15.

Data set 19. Transcript expression across murine RNA-Seq time course data: estimated read counts. As in Data set 2, but for the time course data generated for the accompanying publication.

Data set 20. Transcript expression across murine RNA-Seq time course data: estimated transcript abundances. As in Data set 5, but for the time course data generated for the accompanying publication.

Data set 21. Quantification of alternative splicing events across murine RNA-Seq time course data. As in Data set 14, but for the time course data generated for the accompanying publication.

Notes

All files are in compressed (GZIP) tab-separated values format.

Files

Files (133.6 MB)

Name Size Download all
md5:5350222364f2e54e61da53fe4a3288e0
16.9 MB Download
md5:7348fd39ffc2fbbfa418e67f8e43c83f
614.0 kB Download
md5:f4631ff35ebea3f4442236e6af0672eb
673.6 kB Download
md5:45d82934eca216a8da6a8017f30da69d
109.7 kB Download
md5:9074f554606d2d2e3a445fda4e2b50b6
19.8 MB Download
md5:2c5311e85576d43f9f29905a0af1ddb7
5.3 MB Download
md5:073e5c0cb8678c421f3a7af27ce44de2
2.1 MB Download
md5:4c89f9c47b267de18bd95ca036d6befb
1.2 MB Download
md5:7546e466334ea49be154a61c4030c7ec
1.9 MB Download
md5:c981c08f377e5d73b759b6a4e24f4e6e
1.2 MB Download
md5:22351794d4496f9e5c32071f61b4986d
2.1 MB Download
md5:c03b58b6f9b477ad810832346e288ff0
6.6 MB Download
md5:892365fe012ec46b6db70f7237c9ea91
5.7 MB Download
md5:ac174408752d5cdeb610161acaa4f032
4.8 MB Download
md5:76aea7ff87ff9a028c5a896f7544c076
1.5 MB Download
md5:a344c806f6f165118f74a979ffb63ea8
42.8 MB Download
md5:f6722c6c882f4776beb4fa602f1162ff
15.4 MB Download
md5:19af336a8fe48f2255648f158cc88508
3.6 MB Download
md5:a11506ba1cc97d2bc2c95cdebfeffb3e
579.8 kB Download
md5:f7dbce9f1d2cc00b88dbcffc9bd8d54c
708.6 kB Download
md5:9847d0cc4ed9893cc2a24ec7eca434ca
101.9 kB Download

Additional details

Related works

Is cited by
10.1186/s12864-019-5438-2 (DOI)
Is documented by
10.1186/s12864-019-5438-2 (DOI)
Is supplement to
10.1186/s12864-019-5438-2 (DOI)

Funding

European Commission
WHYMIR - Interplays between miRNAs and transcription factors in the determination and maintenance of cell identity 310510

References

  • Alamancos, G. P., Pagès, A., Trincado, J. L., Bellora, N. & Eyras, E. Leveraging transcript quantification for fast computation of alternative splicing profiles. RNA 21, 1521–1531 (2015).
  • Bray, N. L., Pimentel, H., Melsted, P. & Pachter, L. Near-optimal probabilistic RNA-seq quantification. Nat. Biotechnol. 34, 525–527 (2016).
  • Robinson, M. D., McCarthy, D. J. & Smyth, G. K. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26, 139–140 (2010).
  • Yates, A. et al. Ensembl 2016. Nucleic Acids Res. 44, D710–6 (2016).
  • Trincado, J. L. et al. SUPPA2: fast, accurate, and uncertainty-aware differential splicing analysis across multiple conditions. Genome Biol. 19(1):40 (2018).