Published May 16, 2024 | Version 1.0.0
Dataset Open

lr-kallisto Simulation Dataset

Description

NanoSim pretrained model human_NA12878_dRNA_Bham1_guppy was used to generate transcriptomic reads, stored in file human_NA12878_dRNA_Bham1_guppy_reads.fastq.gz.

The modified uLTRA simulators, included in https://github.com/pachterlab/LSRRSRLFKOTWMWMP_2024 Figures folder SupplementaryFigure4, were used to generate:

ultra_sim_ONT_homo_2M.fq.gz (2 million ONT InDel Profile reads at 1% sequencing error),
ultra_sim_ONT_homo_2M.02.fq.gz (2 million ONT InDel Profile reads at 2% sequencing error),
ultra_sim_ONT_homo_2M.04.fq.gz (2 million ONT InDel Profile reads at 4% sequencing error),

ultra_sim_PB_homo_2M.001.fq.gz (2 million ONT InDel Profile reads at 0.1% sequencing error),
ultra_sim_PB_homo_2M.005.fq.gz (2 million ONT InDel Profile reads at 0.5% sequencing error),
ultra_sim_PB_homo_2M.015.fq.gz (2 million ONT InDel Profile reads at 1.5% sequencing error), and
ultra_sim_PB_homo_2M.02.fq.gz (2 million ONT InDel Profile reads at 2% sequencing error).

 

Lastly, 2 simulation files were not uploaded (due to size of files): Mouse.ONT.R10.4.simulated.shuffled.fastq.gz and human_NA12878_cDNA_Bham1_guppy_reads.fastq.gz. Mouse.PB.simulated.shuffled.fastq.gz and Mouse.ONT.R10.4.simulated.shuffled.fastq.gz are the simulations performed in Prjibelski et al. 2023 (https://doi.org/10.1038/s41587-022-01565-y) that were converted from bams to fastqs with samtools and shuffled with bbtools. NanoSim pretrained model human_NA12878_cDNA_Bham1_guppy was used to generate 10 million transcriptomic reads stored in file human_NA12878_cDNA_Bham1_guppy_reads.fastq.gz.

Files

Files (19.6 GB)

Name Size Download all
md5:032246232805aa7bd4e3cacaee9d1ac9
6.0 GB Download
md5:afcfdbd377bc468b9134ed1356ef11b7
5.6 GB Download
md5:4b46c140826f8fc509cd0d852809735a
1.1 GB Download
md5:d8f87a9517ed1d4816ec8fac370480a4
1.2 GB Download
md5:2efeb3e5155dbd44cbc87e7c651032c9
1.1 GB Download
md5:f90ccb8e05d68c4f9d8463988e16358e
1.1 GB Download
md5:f311fb893418a37366a2b5197077a666
1.1 GB Download
md5:963577743ee6077c3685b610f2166787
1.1 GB Download
md5:15a7a2fc7fe4f094f868cbf8a0eb31f2
1.2 GB Download

Additional details

Software