Published November 30, 2022 | Version 1.0.0
Dataset Open

16S rRNA sequencing gene datasets for CRC data

  • 1. Computational Biology Group, Precision Nutrition and Cancer Research Program, IMDEA Food Institute, Madrid, Spain.

Description

Used datasets: 

Dataset 16S rRNA Region Control (n) Adenoma (n) CRC (n) Available metadata
Baxter V4 171 198 120 Gender, age, weight, height, BMI, country, race
Zackular V4 30 30 30 Gender, age, weight, height, BMI, country, race, FOBT, medication
Zeller V4 50 38 41 Gender, age, BMI, country, FOBT
TOTAL V4 251 266 191 All of the above
 

Data processing & sharing

All datasets were processed using qiime2 pipeline with DADA2 for Sequence quality control and feature table construction and SILVA database for taxonomic assignment, and then a phyloseq object was constructed.

  • Abundance table at genus level is in file genus.csv (Sample counts with NO filtering).
  • Clean metadata is in metadata.csv file (Countries: CA - Canada. USA - United States of America. FRA - France.)
  • Phyloseq object is in file physeq.RDS (Saved as an RDS object in R)

More information is here

 

Files

genus.csv

Files (1.1 MB)

Name Size Download all
md5:fbaaa8b884ba892cd4b3a3ece5098935
763.3 kB Preview Download
md5:0d62fc946388bbe5a86ea554340d1979
60.8 kB Preview Download
md5:b5a36c58f0a414d5e5ebc84c121e1bed
279.3 kB Download