Published November 17, 2021
| Version v2
Dataset
Open
Salmon Softcliping Tutorial - Prokaryotic Dataset
Description
This is a paired-end short read simulation dataset of Salmonella along with two different representations of Salmonella transcriptome. "genes.fasta" is the regular gene/CDS-based transcriptome that is commonly used in prokaryotic expression quantification. "operons.fasta" is an operon-based transcriptome in which the sequences of the co-transcribed genes are concatenated together to form a united operon sequence. The gene annotation and gene to operon assignments are provided in "salmonella_operons.gff" file.