Supplementary Data - Nanopore cDNA With and Without mtDNA
Creators
- 1. Malaghan Institute of Medical Research
- 2. Victoria University of Wellington, Malaghan Institute of Medical Research
Contributors
Other:
Description
Update (v1.5.8-SUPcaller): added scripts and data files for mt-Nd1 comparisons for different basecaller versions:
- LAST_gci_*.csv - gap-compressed identity statistics
- compStats_*.csv - called base-level statistics
- process.txt - script example file demonstrating how to generate statistics
- smealgol.r - R script to generate gap-compressed identities from LAST output
- est_vs_actual.png - density plots comparing gap-compressed identity with mean estimated accuracy from the caller
- GCI_albacore_guppy.png - violin plots comparing gap-compressed identity with different callers
- BaseQ_vs_GCI.png - scatter plots comparing gap-compressed identity with mean estimated accuracy from the caller
Update (v1.5.7-Nd1Reads): added mt-Nd1 read subset from CGDec2017 run:
- 4T1-WT_reads_CGDec2017_singlePlusMulti.tar
Update (v1.5.6-countAnalysis): added / updated a script to convert transcript mappings to a count table:
- count_analysis.r
Update (v1.5.5-Int): added intermediate (post-demultiplexing) demonstration dataset:
- SampleDESet_60k_demultiplexed.tar.gz
Also updated count tables and DESeq2 results:
- raw_counts_2019-Nov-29.csv
- DE_orig_VST_GRCm38_CG_GL261_2019-Nov-29.csv.gz
- DE_shrunk_VST_GRCm38_CG_GL261_2019-Nov-29.csv.gz
Update (v1.5.4-MT): updated demonstration dataset (now 40Mb) to include mitochondrial transcripts, and exclude non-target sequences from chimeric reads:
- reads_SampleDESet_sampled_60k.fastq.gz
- Mm_subset.GRCm38.ensembl_v98.fa
Update (v1.5.3-Ensembl): added/updated Ensembl reference transcripts, including non-coding RNA:
- ensembl_mm10_geneFeatureLocations.txt.gz - genome annotation
- Mus_musculus.GRCm38.ensembl_v98.fa.gz - transcript sequences
Update (v1.5.2-Demo): added 60Mb demonstration dataset for teaching / learning purposes (reads_SampleDESet_sampled_60k.fastq.gz). This dataset can be used for testing demultiplexing, or carrying out differential expression analysis on nanopore reads. There are 15k reads sampled from each run (subset on ~300 genes), plus 7.5k chimeric reads from the Dec17 run, plus 7.5k reads sampled from from CG009/BC08. The gene subset includes ~100 genes differentially-expressed between 4T1WT and 4T1ρ0 cell lines, plus an additional ~200 genes with minimal differential expression observed between any 4T1 cell lines (i.e. a housekeeping gene set):
- BC01, BC02, BC04, BC06 - 4T1 cell line reads from three sequencing runs (005, 005, 004, Dec17)
- BC03, BC05, BC07 - 4T1ρ0 (without mtDNA) cell line reads from three sequencing runs (005, 004, Dec17)
- BC08 - 4T1ρ0SC (from subcutaneous tumour) cell line reads from one sequencing run (009)
[note: v1.5.1-Demo has the wrong sequence ID for BC08 reads]
Update (v1.5.0-Called): added full 4T1 call set (called using guppy high-accuracy v3.2.2), gene transcript count tables, and differential expression results
Update (v1.4.1-Chimeric): added more example called chimeric reads of various [unclassified] categories
Update (v1.4.0-Chimeric): added example chimeric reads, one formed [presumably] during sample prep ["conjugation"], another formed in-silico ["double read"]
Update (v1.3.0-DESeq2): added count table and metadata file for DESeq2 analysis
Update (v1.2.1-10k): added 10k of FASTQ reads from a more recent run (part of the same differential expression project).
Supplementary data to go with David Eccles' poster about nanopore cDNA sequencing with and without mitochondrial DNA.
Reference mouse mtDNA and mtDNA reads from 4T1 cells (from MinION cDNA sequencing, and from SRA run SRR6747859).
- fwd_4T1_BC06.correctedReads.uniqueOnly.fasta.gz -- Canu-corrected cDNA reads from Wildtype 4T1 strain, filtered as transcript-forward orientation
- rev_4T1_BC06.correctedReads.uniqueOnly.fasta.gz -- Canu-corrected cDNA reads from Wildtype 4T1 strain, filtered as transcript-reverse orientation
- fwd_4T1_BC07.correctedReads.uniqueOnly.fasta.gz -- Canu-corrected cDNA reads from 4T1ρ0 strain (no mitochondrial DNA), filtered as transcript-forward orientation
- rev_4T1_BC07.correctedReads.uniqueOnly.fasta.gz -- Canu-corrected cDNA reads from 4T1ρ0 strain (no mitochondrial DNA), filtered as transcript-reverse orientation
Other intermediate data files are also included, see `4way_mapping_process.txt` for more details.
The mpileup differential coverage script can be found here.
Files
metadata_4T1.csv
Files
(23.3 GB)
Name | Size | Download all |
---|---|---|
md5:661395631a72e5402b49ee6ecba3e17a
|
64.6 MB | Download |
md5:00ac1bdc1cca43e24c2f96b1f75a75c8
|
856 Bytes | Download |
md5:ac4241538cda6164c182f2ed05f47afc
|
404 Bytes | Download |
md5:529c22c7cd0c16acaa5c0b0922528d70
|
286 Bytes | Download |
md5:3dd575115054017ddce64965a946eda9
|
2.2 kB | Download |
md5:678c64e87746f82afecb97edf1f8f882
|
287 Bytes | Download |
md5:9a166ceca53982b27bd92b2f42c21e50
|
4.9 GB | Download |
md5:322de0619fc70559a2a00ad81d93792e
|
1.4 GB | Download |
md5:a4012480d7d640fe15204cd35bf0ec76
|
3.0 GB | Download |
md5:0a13fbf621c8fa724fcda4c32544a169
|
7.5 GB | Download |
md5:12f8bdf5222fd202ecc4522b6def8bbc
|
6.3 GB | Download |
md5:783e0e6db0c3602b09ac9d61c7437595
|
16.6 kB | Download |
md5:2f759fce597a692bc3e7f10bd7be2efe
|
1.9 kB | Download |
md5:443f501087910a41d76a54b1a2ad6d1b
|
1.0 MB | Download |
md5:81fbcf2f03e904709798e13597d70ca5
|
1.6 kB | Download |
md5:6917535a21f275bf6f89c4c022e640d5
|
998.9 kB | Download |
md5:8d889ded737e7209af6693ce792948c4
|
8.6 kB | Download |
md5:dda3312bc69339afd318804e69112e9c
|
2.4 MB | Download |
md5:90601ede7f3b9b18efde4c595bdbd4f8
|
1.1 kB | Preview Download |
md5:03cd32145b4ef959b5c05ad7af5b52e6
|
670.5 kB | Download |
md5:a63c2f1e6a9597ff15c498f81eda5a9a
|
54.0 MB | Download |
md5:e89c4dff50ffc0074b54fd1d987ffee2
|
7.2 MB | Preview Download |
md5:cbd500bd529ec4d3571d806a8b2a3ed3
|
42.7 MB | Download |
md5:0498de619e7ff573197d16b1ae581aa3
|
736.7 kB | Preview Download |
md5:6087f5d6e6d626423192dd7dabb05037
|
361.7 kB | Preview Download |
md5:46fd813a7cdac987eaa85009a307f701
|
173.3 kB | Download |
md5:2de00e8651270b60b915bc0ee8c65f7e
|
8.8 kB | Download |
md5:f537a25b28c915e3b9d1d35de73fadb4
|
15.7 MB | Download |
md5:7602dd11e7dcc0873b5d3838f47c314c
|
42.9 MB | Download |
md5:168fe98fe83b11406ad780b1614a956c
|
2.8 kB | Download |
Additional details
Related works
- Is cited by
- 10.7490/f1000research.1115509.1 (DOI)
- Is supplemented by
- Software documentation: 10.17504/protocols.io.8azhsf6 (DOI)