Datasets for the Carpentry-style RNA-seq lesson
Description
Lesson files
For all compressed files, go to the Shell and uncompress using `tar -xzvf myarchive.tar.gz`.
1) Bioinformatic files: bioinformatic_tutorial_files.tar.gz
This archive contains the following datasets:
FASTQ files from Arabidopsis leaf RNA-seq:
- Arabidopsis_sample3.fq.gz
- Arabidopsis_sample1.fq.gz
- Arabidopsis_sample4.fq.gz
- Arabidopsis_sample2.fq.gz
Arabidopsis thaliana genome assembly and genome annotation:
- AtChromosome1.fa.gz
- ath_annotation.gff3.gz
The sequence of sequencing adapters in adapters.fasta.
2) Gene counts usable with DESeq2 and R: tutorial.tar.gz
This archive contains the following datasets:
- raw_counts.csv: a dataframe of the sample raw counts. It is a comma-separate values file therefore data are separated by commas ','.
- samples_to_conditions.csv: a dataframe that indicates the correspondence between samples and experimental conditions (e.g. control, treated).
- differential_genes.csv: a dataframe that contains the result of the DESeq2 analysis specifying this contrast in `DESEq2::results()` function: `
contrast = c("infected", "Pseudomonas_syringae_DC3000", "mock")
The raw_counts.csv file was obtained by running the `v0.1.1` version of a RNA-Seq bioinformatic pipeline on the mRNA-Seq sequencing files from Vogel et al. (2016): https://www.ebi.ac.uk/ena/data/view/PRJEB13938.
Please read the original study (Vogel et al. 2016): https://nph.onlinelibrary.wiley.com/doi/full/10.1111/nph.14036
====
Exercise files
1) NASA spaceflight
The NASA GeneLab experiment GLDS-38 performed transcriptomics and proteomics of Arabidopsis seedlings in microgravity by sending seedlings to the International Space Station (ISS).
The raw counts, scaled counts and sample to conditions files are available in the ZIP archive
2) Deforges 2019 hormone-treatments: deforges_2019.tar.gz
This archive contains:
- arabidopsis_root_hormones_raw_counts.csv
- arabidopsis_shoot_hormones_raw_counts.csv
- arabidopsis_root_hormones_sample2condition.csv
- arabidopsis_shoot_hormones_sample2condition.csv
- dataset01_IAA_arabidopsis_root_raw_counts.csv
- dataset02_ABA_arabidopsis_root_raw_counts.csv
- dataset03_ACC_arabidopsis_root_raw_counts.csv
- dataset04_MeJA_arabidopsis_root_raw_counts.csv
The arabidopsis_root_hormones_raw_counts.csv file contains all gene counts from all hormones. Separate datasets were made for each hormone for convenience.
Files
Araport11_genes_ko.txt
Files
(138.4 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:3194e27d3339ab39470be50a7994c4dd
|
736.5 kB | Preview Download |
|
md5:56eaeb64d39ec90605c378409310246b
|
104.2 MB | Download |
|
md5:ced2a569fc52c31653be573161ddee1a
|
3.1 MB | Download |
|
md5:dac4bf0e6cb0d09b0ddad99550bf6562
|
28.1 MB | Download |
|
md5:649076fe36315124923e573b69481190
|
2.2 MB | Download |