Simulated metagenomes with quality and abundance distributions derived from real samples
Description
Species abundances and quality values were derived from the following list of samples:
SAMEA2466896
SAMEA2466916
SAMEA2466952
SAMEA2466953
SAMEA2466965
SAMEA2466996
SAMEA2467015
SAMEA2467039
SAMEA2621010
SAMEA2621033
SAMEA2621107
SAMEA2621155
SAMEA2621229
SAMEA2621247
SAMEA2621300
SAMEA2622357
Reference abundances (.abund files) were generated using mOTUs profiler.
Metagenomes were simulated with cMESSi using proGenomes' representative contigs for species and the aforementioned abundances. In cases where a ref_mOTU_v2 corresponded to more than one genome, the abundance of said ref_mOTU was distributed equally over all genomes.
GFF location files were produced using location information generated by cMESSi.
Two variants of truth values were obtained by intersecting coordinates of simulated reads with coordinates of eggNOG orthologous groups (OG at NOG level) as predicted by eggNOG-mapper.
- .cog-simulated files contain the NOG distribution that was effectively simulated, i.e. a count of the number of reads overlapping with genes annotated with each NOG. A read overlapping multiple genes is considered for each gene. If a gene possesses multiple NOG annotations, each annotation gets assigned the total number of overlapping reads. Longer genes will (in expectation) generate more reads, all else being equal.
- .cog-distribution file contains the expected distribution for every NOG on all samples. The number of genes annotated with each NOG is multiplied by the abundance of the corresponding species. Length of the gene is not taken into account.
If you use this dataset, please cite: NG-meta-profiler: fast processing of metagenomes using NGLess, a domain-specific language
Files
Files
(26.9 GB)
Name | Size | Download all |
---|---|---|
md5:84ccadea2f436c416e79c79d2e694b88
|
8.5 MB | Download |
md5:2a47d7082a3a7c42c0ba3a1342c5cd99
|
270.9 MB | Download |
md5:a7540187d118aefbd3b2cbb2ecfad743
|
161.5 kB | Download |
md5:4eb3f1f224b1a752d7a38df27ac6e7ec
|
733.8 MB | Download |
md5:6917fdaf2ebd70c9c3a6095a1138ff62
|
733.8 MB | Download |
md5:80ef0d0e7f04533d9887b492d3f12cf5
|
235.1 MB | Download |
md5:118b713a31873345ff83bca141c630d6
|
180.8 kB | Download |
md5:e7c09f0982c5a6de9fce87104c7c9af7
|
673.6 MB | Download |
md5:77548660ce677299303d24984980dab8
|
673.6 MB | Download |
md5:05a72e34942a84239a2a13f86f5b3e29
|
231.3 MB | Download |
md5:f6092f21e0c7540af18d2ff23751462c
|
195.5 kB | Download |
md5:37d4580cf17502016c60f6b87cdf7960
|
726.2 MB | Download |
md5:a2696fc39814706c89c4edd43dc7ff63
|
726.2 MB | Download |
md5:57235e08c0876d4c22296c757f10408c
|
228.7 MB | Download |
md5:70269122a76f297bac01fd8b5e58b136
|
189.9 kB | Download |
md5:0f4d58da19a9a494d5dc5cbbe764703f
|
732.0 MB | Download |
md5:d1334faeafb95cf26b017669accd10ba
|
732.0 MB | Download |
md5:a96eb4112d9a036a53a3199df3d4d82a
|
230.6 MB | Download |
md5:01bc6eb97c1eeac697b128b09d627bae
|
211.0 kB | Download |
md5:24b36ae144d5b0ec13656aeb44715d2b
|
619.8 MB | Download |
md5:35173df131c72229d87f7b9175caade3
|
619.8 MB | Download |
md5:4aa63f9e61ce26179539ff36bff4429a
|
230.1 MB | Download |
md5:a48ec6feb65cb48e23a077b92d74c567
|
224.5 kB | Download |
md5:a182d8a701a93dd6cb15ebcd1d6336ec
|
683.1 MB | Download |
md5:b641ed7decdeb48e377a16937984d4d5
|
683.1 MB | Download |
md5:d7242491bcd250663e17923050005d3a
|
232.4 MB | Download |
md5:a612d93ee5ed5e00149bf74c7013214e
|
195.3 kB | Download |
md5:deb192f5b1789a6cc7acbf62d846874e
|
594.4 MB | Download |
md5:3a1369b18596a633df2ffe870529ac06
|
594.4 MB | Download |
md5:d5316a7c27b401da8138d86e2ce2791b
|
228.2 MB | Download |
md5:847f09a9b47ea93a86dc02d6d02f470e
|
187.3 kB | Download |
md5:c2744f4f8f8e9f629791d12ec98d3f2b
|
613.9 MB | Download |
md5:18cfde22522492b1927b3b3c2f61d61a
|
613.9 MB | Download |
md5:a22b7f88c96d399337233b19e9ac7ce7
|
230.7 MB | Download |
md5:9cb749447376a1431869fc408df9dad6
|
218.4 kB | Download |
md5:fe4305038654c83d9f7e9a84e093c208
|
730.7 MB | Download |
md5:49d0b415c3acf9f9a58868d56e54eabb
|
730.7 MB | Download |
md5:e4e1785621a6a8c47e792f5bac8549c1
|
233.0 MB | Download |
md5:08caeaf795969d72370e96e881249280
|
161.1 kB | Download |
md5:8697145aafef49568d9805cbb6621edd
|
837.3 MB | Download |
md5:f4b5c6019f13a17f0924095237b27823
|
837.3 MB | Download |
md5:2aebb10412e07074852899e3581a0dfa
|
236.6 MB | Download |
md5:079db1ea8ffdf232daac85dd178cd599
|
192.5 kB | Download |
md5:686a437ca551678f79c5adac4ae5613c
|
757.0 MB | Download |
md5:7b634b34181593f114aed538836a0134
|
757.0 MB | Download |
md5:3c05a49ae72c7df87589ea036b662e63
|
227.3 MB | Download |
md5:4c6243ad5d6f723f77cfd9fa55a6f3b0
|
147.6 kB | Download |
md5:cc97a145a7c2ab8ec98c7e8fc11bbc01
|
741.6 MB | Download |
md5:90edb4a814be5840584e5813643d6a5b
|
741.6 MB | Download |
md5:a171738cbf950cac7ca293f49d0e75ae
|
224.4 MB | Download |
md5:741e45209dbe8490682aedf5883e203b
|
212.3 kB | Download |
md5:e47d59cf0f92973a33d74d827db9b5f4
|
714.7 MB | Download |
md5:414ab10d6f66012184022e2382e42f99
|
714.7 MB | Download |
md5:4060765e0c2ee9306f7fb31e0d9a9eb3
|
232.2 MB | Download |
md5:39196cb819517ad8b3d04966e16e9765
|
234.3 kB | Download |
md5:96c6573cb97d85fcd92c058fb3491eda
|
713.3 MB | Download |
md5:c4796b9554f0e41a36194205e0a967e7
|
713.3 MB | Download |
md5:6a67445b9269b13f52145a2aaf1956b8
|
236.3 MB | Download |
md5:f05c5e3ac2d8a03547762a03d7cd4a17
|
203.6 kB | Download |
md5:1fe170f78e1bab21b9a3828417b9f6d2
|
726.6 MB | Download |
md5:bfa92684b515f484db1f4f1d38cac445
|
726.6 MB | Download |
md5:5944e6d5b564de10f1294a2a22d85cc2
|
227.0 MB | Download |
md5:8bd0187f89c47890f22d4f58f29958b7
|
113.9 kB | Download |
md5:809285bc423daee50b9fe610bf29be39
|
845.2 MB | Download |
md5:78c961f2345357ab07d0f74eaec80a5a
|
845.2 MB | Download |
md5:be408c401b2eadae517d7e84d4ff6e1b
|
222.2 MB | Download |
md5:12cbb64b77cf2e0b5566b2370f48dadd
|
3.0 kB | Download |
md5:e721d2f82f5a92a2ce32f8af0d3c2f1e
|
5.5 kB | Download |
md5:ea7b9a1abe9dd62504a43f50e89dfb8d
|
4.7 kB | Download |
md5:b63ff921c9d9890de9b6d188dfb2bb29
|
5.0 kB | Download |
md5:5d64939033cf22ae15c0d6206abef1d1
|
7.3 kB | Download |
md5:0199966e3792e4b735c3dbfff6d43a1c
|
8.1 kB | Download |
md5:3d3b65f50208c8e8ecbce63e2f7e6d2f
|
5.5 kB | Download |
md5:cff5dd9897b1df0f61deac88d981349a
|
4.4 kB | Download |
md5:856aae6cfd0b0377fec2412b77dde210
|
4.4 kB | Download |
md5:908edf64a960a57c552ba867022ca885
|
1.5 kB | Download |
md5:4fbb04870e99a70d60d4703184e949cf
|
4.8 kB | Download |
md5:e45edcf34eaeb00e8d53a65b44103c3d
|
2.9 kB | Download |
md5:65ff5406316aeb511e07be47255392c3
|
3.2 kB | Download |
md5:71695aaf713d78ae02290acae2159274
|
4.8 kB | Download |
md5:43ae18b7eb8569758d2cb6a77d4fdcaf
|
5.3 kB | Download |
md5:777fd4f2d1463b93cd510df3a38c5b82
|
785 Bytes | Download |
Additional details
Related works
- Is referenced by
- 10.1101/367755 (DOI)
Funding
References
- Coelho LP, et al. NG-meta-profiler: fast processing of metagenomes using NGLess, a domain-specific language, biorXiv doi:10.1101/367755