Dataset Open Access

Ortholog data from the tuatara genome project

Patricio, Mateus; Muffato, Matthieu; Rutherford, Kim Matthew; Gemmell, Neil J.

Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="" xmlns:oai_dc="" xmlns:xsi="" xsi:schemaLocation="">
  <dc:creator>Patricio, Mateus</dc:creator>
  <dc:creator>Muffato, Matthieu</dc:creator>
  <dc:creator>Rutherford, Kim Matthew</dc:creator>
  <dc:creator>Gemmell, Neil J.</dc:creator>
  <dc:description>This record contains orthology predictions based on the Maker gene annotation
of the tuatara (Sphenodon punctatus) and a set of 25 other species, using the
Ensembl methodology.

See for the Maker annotation in GFF

The files all_trees.emf.gz and all_homologies.tsv.gz contain the phylogenetic
trees (all_trees.emf.gz, in the EMF alignment format) and the derived pairwise
orthologies and paralogies (all_homologies.tsv.gz, in tabular format).

From the phylogenetic trees, sets of 1-to-1 orthologues across all 26 species
were extracted (pure_one2one_orthologies.txt).  Sets of orthologues that span
all the species but include paralogues were reduced to 1 copy per species
using gene order conservation and sequence similarity. This extra dataset is
available in promoted_one2one_orthologies.txt

The main Ensembl entry point for tuatara is:

This work is supported by Ngatiwai iwi, Allan Wilson Centre, University of
Otago, New Zealand Genomics Limited, Illumina, National eScience
Infrastructure (NeSI NZ).</dc:description>
  <dc:title>Ortholog data from the tuatara genome project</dc:title>
All versions This version
Views 249249
Downloads 123123
Data volume 8.2 GB8.2 GB
Unique views 224224
Unique downloads 101101


Cite as