Published September 3, 2024 | Version 1.0
Dataset Open

Lemonade Creek, Yellowstone National Park, USA - Microbial Community Analysis - Cyanidiophyceae genome data for HGT analysis

  • 1. ROR icon Rutgers, The State University of New Jersey

Description

This dataset consists of 12 metagenome samples that were collected from one of three environments in Yellowstone National Park:

  • 4 samples (numbered 1, 2, 3, 4) are from the "CreekBiofilm" environment.
  • 4 samples (1, 2, 3, 4) are from the "Endolithic" environment.
  • 4 samples (1, 2, 3, 4) are from the "Soil" environment.

We have found that there are two species of cyanidiophyceae present in these samples: one *Galdieria sulphuraria* (the `*Gsulp*` files) and one *Cyanidioschyzon merolae* (the `*Cmer*` files). For each of these species I extracted their contigs from the metagenome assembly if they had >=10% of their lengths covered by hits with >90% ID to the respective reference genome (i.e., contigs with >10% coverage of hits with >90% ID to a given reference genome). The majority of contigs have >90% hit coverage however, to prevent removal of contigs with novel sequences (arising via HGT or other processes), I used a lenient threshold of 10%. The naming of the files indicate which sample the contigs are from and which of the two cyanidiophyceae species they are putatively from. NOTE: that there are very few predicted proteins in the `YNP_CreekBiofilm_*_Gsulp*` files. This is because this environment is completely dominated by the other algal species and so we recovered very few contigs from this species from these environments.

Files

Files (413.7 MB)

Name Size Download all
md5:70be8da0a777d72b5ac79f36dd9806b2
21.5 MB Download
md5:a2377bbc5dff4ac1e123201e3f24eb7d
111.4 kB Download
md5:717b8930b9d67b03b4bba041d0f2d66b
22.6 MB Download
md5:58e761e9607e8dba5af8a02af35c4d8a
238.6 kB Download
md5:f4c15bc413fbcd3590c27caad26ba25c
23.2 MB Download
md5:a81d4022609ff6eb8296a624816176a7
1.2 MB Download
md5:33d78aa746cdb95ddbdd2947bb91d1da
26.0 MB Download
md5:1a718b6d37bdca212334787d78849c9a
323.4 kB Download
md5:053c6a303a0ea917680dc28ad5bfb8df
22.6 MB Download
md5:078142e24b13955a3b2a7b6c5e9febf2
14.9 MB Download
md5:1c0c3e4c0dcc078ca92f2309368376fe
27.7 MB Download
md5:b7a01619f8fb0367b953e3be64188dab
13.3 MB Download
md5:ea8dddb9051d3c8014c1ee092ba92fef
26.4 MB Download
md5:2fd6c21c3ebb632e863c433ce81a551c
17.9 MB Download
md5:1f0328844bb2bee4437d27f985eb0ea8
26.3 MB Download
md5:0e7fa1b7cd5993c611ea7a047645d573
17.9 MB Download
md5:6fd86b20dae59dffe75dc1d292c1b6e0
19.9 MB Download
md5:bcfb2798d068c6a8c32189cc4493d057
18.8 MB Download
md5:8d0d33c0a1bcdd74bdf39e0c06e3a58d
20.0 MB Download
md5:20189bb2db5a55798c0ee94ec6c10d55
17.8 MB Download
md5:e95ff44cc939bfbb50d4f686c0549d32
20.0 MB Download
md5:ee45ad42c56ef801875d4934d3a3ade4
18.0 MB Download
md5:3554d10092d93ebf67e5d8893940b061
19.7 MB Download
md5:15b99cf39a0252ac2b41bcb3c1d59f55
17.0 MB Download