Dataset Open Access

Functional Association of Prokaryotic Virus Orthologous Groups: A Proof of Concept - Data

Pappas, Nikolaos; Dutilh, E. Bas

MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="">
  <datafield tag="041" ind1=" " ind2=" ">
    <subfield code="a">eng</subfield>
  <controlfield tag="005">20210304002726.0</controlfield>
  <controlfield tag="001">4576599</controlfield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Utrecht University</subfield>
    <subfield code="0">(orcid)0000-0003-2329-7890</subfield>
    <subfield code="a">Dutilh, E. Bas</subfield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">673258119</subfield>
    <subfield code="z">md5:fd79a4325cb648b53040add1dbc1c14c</subfield>
    <subfield code="u"></subfield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2021-03-03</subfield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire_data</subfield>
    <subfield code="o"></subfield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">Utrecht University</subfield>
    <subfield code="0">(orcid)0000-0002-8540-4324</subfield>
    <subfield code="a">Pappas, Nikolaos</subfield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Functional Association of Prokaryotic Virus Orthologous Groups: A Proof of Concept - Data</subfield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u"></subfield>
    <subfield code="a">Creative Commons Attribution 4.0 International</subfield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2"></subfield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;Raw data required to reproduce all intermediate results and thereby the final predictions.&lt;/p&gt;

&lt;p&gt;These are used in the accompanying snakemake pipeline;/p&gt;

&lt;p&gt;This archive is automatically downloaded and files are extracted when executing the pipeline.&lt;/p&gt;

&lt;p&gt;You can also download and extract the files manually.&lt;/p&gt;

&lt;p&gt;Included files are:&lt;/p&gt;

	&lt;li&gt;data/genomes/phages_refseq.fasta : A multifasta file with all phage genomes available on RefSeq , on 14/01/2020&lt;/li&gt;
	&lt;li&gt;data/interactions/interactions.txt: A PSI-MI TAB file containing all interactions from IntAct, retrieved on 28/05/2019&lt;/li&gt;
	&lt;li&gt;data/pvogs/all.hmm : All hmmer profiles for pVOGs, retrieved from on 13/01/2020&lt;/li&gt;
	&lt;li&gt;data/pvogs/VOGProteinTable.txt: pVOGs occurrence on genomes used in the pVOGs paper, along with their annotation, retrieved on 16/01/2020&lt;/li&gt;
	&lt;li&gt;data/taxonomy_db/taxa.sqlite.db: Taxonomy db created with ete3 toolkit with data retrieved on 16/05/2019&lt;/li&gt;
	&lt;li&gt;data/taxonomy_db/taxa.traverse.pkl: Created automatically with ete3.&lt;/li&gt;
	&lt;li&gt;data/md5sums.txt: md5sums for all files&lt;/li&gt;
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.4576598</subfield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.4576599</subfield>
    <subfield code="2">doi</subfield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">dataset</subfield>
All versions This version
Views 5858
Downloads 1212
Data volume 8.1 GB8.1 GB
Unique views 4949
Unique downloads 1111


Cite as