Published June 5, 2023 | Version v2023.153
Dataset Open

Data from Matrishin et al. "Phages are important unrecognized players in the ecology of the oral pathogen Porphyromonas gingivalis"

  • 1. University at Buffalo

Description

Data files associated with Matrishin et al. "Phages are important unrecognized players in the ecology of the oral pathogen Porphyromonas gingivalis".

Please see the table in 00.README.xlsx for key information regarding nomenclature. We caution that the same locus tag identifiers refer to different genes in the Zenodo files than in NCBI. This difference resulted from use of the same Locus Tag Prefixes for in house gene calls using Bakta for the manuscript analyses as for the PGAP analyses ultimately performed upon submission of the assemblies to GenBank. Unlike the Supplementary Data Files submitted with the manuscript, see below, it was not possible to readily update all the Zenodo-deposited files to their updated final GCA and distinct locus tag identifiers because of the complexity of some of the included filetypes, therefore all files in the Zenodo set were left unchanged from the nomenclature used in the original in house analyses based on Bakta.

Directory Contents
00.README Important information regarding nomenclature differences across data types.
01.bax.bakta Results of Bakta annotation of 88 Pg genomes.
02.bax.ppanggolin Results of PPanGGOLiN pangenome analysis of 88 Pg genomes.
03.bax.combo Results of multiple analyses used to inform identification and curation of prophages in Pg genomes, provided as bacterial genome fastas and gff files that can be uploaded together to genome viewer tools (e.g. Geneious) and visualized as tracks. Note, these do not include final prophage calls.
04.phage.genomes Pg phage genomes in fasta format.
05.phage.prots Pg phage proteins in fasta format, clipped proteins at the beginnings and ends of genomes are excluded.
06.phage.gbs Pg phage information in GenBank format, clipped proteins at the beginnings and ends of genomes are excluded.
07.phage.families.virclust Results of VirClust analysis used to inform resolution family-level units.
08.phage.families.victor Results of VICTOR analysis used to inform resolution of family-level units.

 

Files

01.bax.bakta.zip

Files (2.9 GB)

Name Size Download all
md5:816be6ee14bf51917c36c88e4d863ec2
13.2 kB Download
md5:b52e96b83dbd18b6e53674c574227ff4
814.0 MB Preview Download
md5:dcc55637c6e6733a468a3df842ccb7ae
120.7 MB Preview Download
md5:05b266f3ad77617ef2e589691bf070a1
2.0 GB Preview Download
md5:f506bd2e432c53241d60a2e0db1eebfb
429.9 kB Preview Download
md5:ac3da00fb60531770b22ac721ca34d83
283.3 kB Preview Download
md5:dcb0101ed275e8a9333c24545b1f37c7
724.3 kB Preview Download
md5:829f4577b5f27786b757424023f00a76
15.2 MB Preview Download
md5:9a598e3f94595834c663cded6e86928a
81.9 kB Preview Download

Additional details

Related works

Is supplement to
Preprint: 10.1101/2022.12.30.519816 (DOI)