Planned intervention: On Wednesday April 3rd 05:30 UTC Zenodo will be unavailable for up to 2-10 minutes to perform a storage cluster upgrade.
Published January 21, 2019 | Version 1.0.1
Dataset Open

Simulated metagenomes with quality and abundance distributions derived from real samples

Description

Species abundances and quality values were derived from the following list of samples:

SAMEA2466896
SAMEA2466916
SAMEA2466952
SAMEA2466953
SAMEA2466965
SAMEA2466996
SAMEA2467015
SAMEA2467039
SAMEA2621010
SAMEA2621033
SAMEA2621107
SAMEA2621155
SAMEA2621229
SAMEA2621247
SAMEA2621300
SAMEA2622357

Reference abundances (.abund files) were generated using mOTUs profiler.
Metagenomes were simulated with cMESSi using proGenomes' representative contigs for species and the aforementioned abundances. In cases where a ref_mOTU_v2 corresponded to more than one genome, the abundance of said ref_mOTU was distributed equally over all genomes.
GFF location files were produced using location information generated by cMESSi.
Two variants of truth values were obtained by intersecting coordinates of simulated reads with coordinates of eggNOG orthologous groups (OG at NOG level) as predicted by eggNOG-mapper.

  1. .cog-simulated files contain the NOG distribution that was effectively simulated, i.e. a count of the number of reads overlapping with genes annotated with each NOG. A read overlapping multiple genes is considered for each gene. If a gene possesses multiple NOG annotations, each annotation gets assigned the total number of overlapping reads. Longer genes will (in expectation) generate more reads, all else being equal.
  2. .cog-distribution file contains the expected distribution for every NOG on all samples. The number of genes annotated with each NOG is multiplied by the abundance of the corresponding species. Length of the gene is not taken into account.

If you use this dataset, please cite: NG-meta-profiler: fast processing of metagenomes using NGLess, a domain-specific language

Files

Files (26.9 GB)

Name Size Download all
md5:84ccadea2f436c416e79c79d2e694b88
8.5 MB Download
md5:2a47d7082a3a7c42c0ba3a1342c5cd99
270.9 MB Download
md5:a7540187d118aefbd3b2cbb2ecfad743
161.5 kB Download
md5:4eb3f1f224b1a752d7a38df27ac6e7ec
733.8 MB Download
md5:6917fdaf2ebd70c9c3a6095a1138ff62
733.8 MB Download
md5:80ef0d0e7f04533d9887b492d3f12cf5
235.1 MB Download
md5:118b713a31873345ff83bca141c630d6
180.8 kB Download
md5:e7c09f0982c5a6de9fce87104c7c9af7
673.6 MB Download
md5:77548660ce677299303d24984980dab8
673.6 MB Download
md5:05a72e34942a84239a2a13f86f5b3e29
231.3 MB Download
md5:f6092f21e0c7540af18d2ff23751462c
195.5 kB Download
md5:37d4580cf17502016c60f6b87cdf7960
726.2 MB Download
md5:a2696fc39814706c89c4edd43dc7ff63
726.2 MB Download
md5:57235e08c0876d4c22296c757f10408c
228.7 MB Download
md5:70269122a76f297bac01fd8b5e58b136
189.9 kB Download
md5:0f4d58da19a9a494d5dc5cbbe764703f
732.0 MB Download
md5:d1334faeafb95cf26b017669accd10ba
732.0 MB Download
md5:a96eb4112d9a036a53a3199df3d4d82a
230.6 MB Download
md5:01bc6eb97c1eeac697b128b09d627bae
211.0 kB Download
md5:24b36ae144d5b0ec13656aeb44715d2b
619.8 MB Download
md5:35173df131c72229d87f7b9175caade3
619.8 MB Download
md5:4aa63f9e61ce26179539ff36bff4429a
230.1 MB Download
md5:a48ec6feb65cb48e23a077b92d74c567
224.5 kB Download
md5:a182d8a701a93dd6cb15ebcd1d6336ec
683.1 MB Download
md5:b641ed7decdeb48e377a16937984d4d5
683.1 MB Download
md5:d7242491bcd250663e17923050005d3a
232.4 MB Download
md5:a612d93ee5ed5e00149bf74c7013214e
195.3 kB Download
md5:deb192f5b1789a6cc7acbf62d846874e
594.4 MB Download
md5:3a1369b18596a633df2ffe870529ac06
594.4 MB Download
md5:d5316a7c27b401da8138d86e2ce2791b
228.2 MB Download
md5:847f09a9b47ea93a86dc02d6d02f470e
187.3 kB Download
md5:c2744f4f8f8e9f629791d12ec98d3f2b
613.9 MB Download
md5:18cfde22522492b1927b3b3c2f61d61a
613.9 MB Download
md5:a22b7f88c96d399337233b19e9ac7ce7
230.7 MB Download
md5:9cb749447376a1431869fc408df9dad6
218.4 kB Download
md5:fe4305038654c83d9f7e9a84e093c208
730.7 MB Download
md5:49d0b415c3acf9f9a58868d56e54eabb
730.7 MB Download
md5:e4e1785621a6a8c47e792f5bac8549c1
233.0 MB Download
md5:08caeaf795969d72370e96e881249280
161.1 kB Download
md5:8697145aafef49568d9805cbb6621edd
837.3 MB Download
md5:f4b5c6019f13a17f0924095237b27823
837.3 MB Download
md5:2aebb10412e07074852899e3581a0dfa
236.6 MB Download
md5:079db1ea8ffdf232daac85dd178cd599
192.5 kB Download
md5:686a437ca551678f79c5adac4ae5613c
757.0 MB Download
md5:7b634b34181593f114aed538836a0134
757.0 MB Download
md5:3c05a49ae72c7df87589ea036b662e63
227.3 MB Download
md5:4c6243ad5d6f723f77cfd9fa55a6f3b0
147.6 kB Download
md5:cc97a145a7c2ab8ec98c7e8fc11bbc01
741.6 MB Download
md5:90edb4a814be5840584e5813643d6a5b
741.6 MB Download
md5:a171738cbf950cac7ca293f49d0e75ae
224.4 MB Download
md5:741e45209dbe8490682aedf5883e203b
212.3 kB Download
md5:e47d59cf0f92973a33d74d827db9b5f4
714.7 MB Download
md5:414ab10d6f66012184022e2382e42f99
714.7 MB Download
md5:4060765e0c2ee9306f7fb31e0d9a9eb3
232.2 MB Download
md5:39196cb819517ad8b3d04966e16e9765
234.3 kB Download
md5:96c6573cb97d85fcd92c058fb3491eda
713.3 MB Download
md5:c4796b9554f0e41a36194205e0a967e7
713.3 MB Download
md5:6a67445b9269b13f52145a2aaf1956b8
236.3 MB Download
md5:f05c5e3ac2d8a03547762a03d7cd4a17
203.6 kB Download
md5:1fe170f78e1bab21b9a3828417b9f6d2
726.6 MB Download
md5:bfa92684b515f484db1f4f1d38cac445
726.6 MB Download
md5:5944e6d5b564de10f1294a2a22d85cc2
227.0 MB Download
md5:8bd0187f89c47890f22d4f58f29958b7
113.9 kB Download
md5:809285bc423daee50b9fe610bf29be39
845.2 MB Download
md5:78c961f2345357ab07d0f74eaec80a5a
845.2 MB Download
md5:be408c401b2eadae517d7e84d4ff6e1b
222.2 MB Download
md5:12cbb64b77cf2e0b5566b2370f48dadd
3.0 kB Download
md5:e721d2f82f5a92a2ce32f8af0d3c2f1e
5.5 kB Download
md5:ea7b9a1abe9dd62504a43f50e89dfb8d
4.7 kB Download
md5:b63ff921c9d9890de9b6d188dfb2bb29
5.0 kB Download
md5:5d64939033cf22ae15c0d6206abef1d1
7.3 kB Download
md5:0199966e3792e4b735c3dbfff6d43a1c
8.1 kB Download
md5:3d3b65f50208c8e8ecbce63e2f7e6d2f
5.5 kB Download
md5:cff5dd9897b1df0f61deac88d981349a
4.4 kB Download
md5:856aae6cfd0b0377fec2412b77dde210
4.4 kB Download
md5:908edf64a960a57c552ba867022ca885
1.5 kB Download
md5:4fbb04870e99a70d60d4703184e949cf
4.8 kB Download
md5:e45edcf34eaeb00e8d53a65b44103c3d
2.9 kB Download
md5:65ff5406316aeb511e07be47255392c3
3.2 kB Download
md5:71695aaf713d78ae02290acae2159274
4.8 kB Download
md5:43ae18b7eb8569758d2cb6a77d4fdcaf
5.3 kB Download
md5:777fd4f2d1463b93cd510df3a38c5b82
785 Bytes Download

Additional details

Related works

Is referenced by
10.1101/367755 (DOI)

Funding

MicrobioS – Exploring the human gut microbiome at strain resolution 669830
European Commission
DD-DeCaF – Bioinformatics Services for Data-Driven Design of Cell Factories and Communities 686070
European Commission

References

  • Coelho LP, et al. NG-meta-profiler: fast processing of metagenomes using NGLess, a domain-specific language, biorXiv doi:10.1101/367755