Published April 9, 2019 | Version v1
Journal article Open

Red Sea SAR11 and Prochlorococcus Single-cell Genomes Reflect Globally Distributed Pangenomes

  • 1. Atlantic Oceanographic and Meteorological Laboratory, National Oceanic and Atmospheric Administration

Description

The Red Sea is isolated geographically from the rest of the ocean and has a combination of high irradiance, high temperature, and high salinity that is unique among the ocean; we therefore asked whether it harbors endemic gene content. We sequenced and assembled single-cell genomes of 21 SAR11 (subclades Ia, Ib, Id, II) and 5 Prochlorococcus (ecotype HLII) cells from the Red Sea and combined them with globally-sourced reference genomes to cluster genes into ortholog groups (OGs) using the program OrthoMCL (version 2.0). OrthoMCL configuration settings were as follows: percentMatchCutoff=50, evalueExponentCutoff=–5. This yielded 5272 SAR11 OGs and 10439 Prochlorococcus OGs. This archive contains four files: the protein identifiers associated with each OG (proch_ortholog_groups.txt, sar11_ortholog_groups.txt) and the protein sequences for each protein identifier (proch_protein_sequences.fasta, sar11_protein_sequences.fasta).

Files

proch_ortholog_groups.txt

Files (84.8 MB)

Name Size Download all
md5:f57ac196d307c0973d753eb576e82a0a
5.1 MB Preview Download
md5:4b5087898ae45b8e329aa1f081772436
61.8 MB Download
md5:4cdb3b082b90ee8889eae800a146d126
1.4 MB Preview Download
md5:2addedefbaa3a3156b6724fea02e40ee
16.5 MB Download