Published May 24, 2018 | Version v1.5.8-SUPcaller
Dataset Open

Supplementary Data - Nanopore cDNA With and Without mtDNA

  • 1. Malaghan Institute of Medical Research
  • 2. Victoria University of Wellington, Malaghan Institute of Medical Research

Contributors

Other:

Description

Update (v1.5.8-SUPcaller): added scripts and data files for mt-Nd1 comparisons for different basecaller versions:

  • LAST_gci_*.csv - gap-compressed identity statistics
  • compStats_*.csv - called base-level statistics
  • process.txt - script example file demonstrating how to generate statistics
  • smealgol.r - R script to generate gap-compressed identities from LAST output
  • est_vs_actual.png - density plots comparing gap-compressed identity with mean estimated accuracy from the caller
  • GCI_albacore_guppy.png - violin plots comparing gap-compressed identity with different callers
  • BaseQ_vs_GCI.png - scatter plots comparing gap-compressed identity with mean estimated accuracy from the caller

Update (v1.5.7-Nd1Reads): added mt-Nd1 read subset from CGDec2017 run:

  • 4T1-WT_reads_CGDec2017_singlePlusMulti.tar

Update (v1.5.6-countAnalysis): added / updated a script to convert transcript mappings to a count table:

  • count_analysis.r

Update (v1.5.5-Int): added intermediate (post-demultiplexing) demonstration dataset:

  • SampleDESet_60k_demultiplexed.tar.gz

Also updated count tables and DESeq2 results:

  • raw_counts_2019-Nov-29.csv
  • DE_orig_VST_GRCm38_CG_GL261_2019-Nov-29.csv.gz
  • DE_shrunk_VST_GRCm38_CG_GL261_2019-Nov-29.csv.gz

Update (v1.5.4-MT): updated demonstration dataset (now 40Mb) to include mitochondrial transcripts, and exclude non-target sequences from chimeric reads:

  • reads_SampleDESet_sampled_60k.fastq.gz
  • Mm_subset.GRCm38.ensembl_v98.fa

Update (v1.5.3-Ensembl): added/updated Ensembl reference transcripts, including non-coding RNA:

  • ensembl_mm10_geneFeatureLocations.txt.gz - genome annotation
  • Mus_musculus.GRCm38.ensembl_v98.fa.gz - transcript sequences

Update (v1.5.2-Demo): added 60Mb demonstration dataset for teaching / learning purposes (reads_SampleDESet_sampled_60k.fastq.gz). This dataset can be used for testing demultiplexing, or carrying out differential expression analysis on nanopore reads. There are 15k reads sampled from each run (subset on ~300 genes), plus 7.5k chimeric reads from the Dec17 run, plus 7.5k reads sampled from from CG009/BC08. The gene subset includes ~100 genes differentially-expressed between 4T1WT and 4T1ρ0 cell lines, plus an additional ~200 genes with minimal differential expression observed between any 4T1 cell lines (i.e. a housekeeping gene set):

  • BC01, BC02, BC04, BC06 - 4T1 cell line reads from three sequencing runs (005, 005, 004, Dec17)
  • BC03, BC05, BC07 - 4T1ρ0 (without mtDNA) cell line reads from three sequencing runs (005, 004, Dec17)
  • BC08 - 4T1ρ0SC (from subcutaneous tumour) cell line reads from one sequencing run (009)

[note: v1.5.1-Demo has the wrong sequence ID for BC08 reads]

Update (v1.5.0-Called): added full 4T1 call set (called using guppy high-accuracy v3.2.2), gene transcript count tables, and differential expression results

Update (v1.4.1-Chimeric): added more example called chimeric reads of various [unclassified] categories

Update (v1.4.0-Chimeric): added example chimeric reads, one formed [presumably] during sample prep ["conjugation"], another formed in-silico ["double read"]

Update (v1.3.0-DESeq2): added count table and metadata file for DESeq2 analysis

Update (v1.2.1-10k): added 10k of FASTQ reads from a more recent run (part of the same differential expression project).

Supplementary data to go with David Eccles' poster about nanopore cDNA sequencing with and without mitochondrial DNA.

Reference mouse mtDNA and mtDNA reads from 4T1 cells (from MinION cDNA sequencing, and from SRA run SRR6747859).

  • fwd_4T1_BC06.correctedReads.uniqueOnly.fasta.gz -- Canu-corrected cDNA reads from Wildtype 4T1 strain, filtered as transcript-forward orientation
  • rev_4T1_BC06.correctedReads.uniqueOnly.fasta.gz -- Canu-corrected cDNA reads from Wildtype 4T1 strain, filtered as transcript-reverse orientation
  • fwd_4T1_BC07.correctedReads.uniqueOnly.fasta.gz -- Canu-corrected cDNA reads from 4T1ρ0 strain (no mitochondrial DNA), filtered as transcript-forward orientation
  • rev_4T1_BC07.correctedReads.uniqueOnly.fasta.gz -- Canu-corrected cDNA reads from 4T1ρ0 strain (no mitochondrial DNA), filtered as transcript-reverse orientation

Other intermediate data files are also included, see `4way_mapping_process.txt` for more details.

The mpileup differential coverage script can be found here.

Files

metadata_4T1.csv

Files (23.3 GB)

Name Size Download all
md5:661395631a72e5402b49ee6ecba3e17a
64.6 MB Download
md5:00ac1bdc1cca43e24c2f96b1f75a75c8
856 Bytes Download
md5:ac4241538cda6164c182f2ed05f47afc
404 Bytes Download
md5:529c22c7cd0c16acaa5c0b0922528d70
286 Bytes Download
md5:3dd575115054017ddce64965a946eda9
2.2 kB Download
md5:678c64e87746f82afecb97edf1f8f882
287 Bytes Download
md5:9a166ceca53982b27bd92b2f42c21e50
4.9 GB Download
md5:322de0619fc70559a2a00ad81d93792e
1.4 GB Download
md5:a4012480d7d640fe15204cd35bf0ec76
3.0 GB Download
md5:0a13fbf621c8fa724fcda4c32544a169
7.5 GB Download
md5:12f8bdf5222fd202ecc4522b6def8bbc
6.3 GB Download
md5:783e0e6db0c3602b09ac9d61c7437595
16.6 kB Download
md5:2f759fce597a692bc3e7f10bd7be2efe
1.9 kB Download
md5:443f501087910a41d76a54b1a2ad6d1b
1.0 MB Download
md5:81fbcf2f03e904709798e13597d70ca5
1.6 kB Download
md5:6917535a21f275bf6f89c4c022e640d5
998.9 kB Download
md5:8d889ded737e7209af6693ce792948c4
8.6 kB Download
md5:dda3312bc69339afd318804e69112e9c
2.4 MB Download
md5:90601ede7f3b9b18efde4c595bdbd4f8
1.1 kB Preview Download
md5:03cd32145b4ef959b5c05ad7af5b52e6
670.5 kB Download
md5:a63c2f1e6a9597ff15c498f81eda5a9a
54.0 MB Download
md5:e89c4dff50ffc0074b54fd1d987ffee2
7.2 MB Preview Download
md5:cbd500bd529ec4d3571d806a8b2a3ed3
42.7 MB Download
md5:0498de619e7ff573197d16b1ae581aa3
736.7 kB Preview Download
md5:6087f5d6e6d626423192dd7dabb05037
361.7 kB Preview Download
md5:46fd813a7cdac987eaa85009a307f701
173.3 kB Download
md5:2de00e8651270b60b915bc0ee8c65f7e
8.8 kB Download
md5:f537a25b28c915e3b9d1d35de73fadb4
15.7 MB Download
md5:7602dd11e7dcc0873b5d3838f47c314c
42.9 MB Download
md5:168fe98fe83b11406ad780b1614a956c
2.8 kB Download

Additional details

Related works

Is cited by
10.7490/f1000research.1115509.1 (DOI)
Is supplemented by
Software documentation: 10.17504/protocols.io.8azhsf6 (DOI)