Published September 20, 2025 | Version 0.0.22
Dataset Open

oggmap/orthomap - example data

  • 1. Max Planck Institute for Evolutionary Biology

Contributors

Project leader:

  • 1. Max Planck Institute for Evolutionary Biology

Description

Example data for python package oggmap/orthomap.

OrthoFinder:  Ensembl release-105 (-S diamond_ultra_sens)

Includes OrthoFinder results (-S diamond_ultra_sens) for all translated coding sequences (CDS) from Ensembl release-105 (keeping only longest isoforms) and Xtropicalisv9.0.Named.primaryTrs.pep.fa from www.xenbase.org:

  • <ensembl_105_orthofinder_Orthogroups.GeneCount.tsv.zip>
  • <ensembl_105_orthofinder_Orthogroups.tsv.zip>

Includes a table specifying the OrthoFinder species file names and its corresponding NCBI taxonomic IDs:

  • <ensembl_105_orthofinder_species_list.tsv>

Includes NCBI taxonomic tree for Ensembl release-105 species analysed:

  • <ensembl_105_species_tree_ncbi_topology_named.nw>
  • <ensembl_105_species_tree_ncbi_topology_named.pdf>

OrthoFinder:  Ensembl release-110 (-S last)

Includes OrthoFinder results (-S last) for all translated coding sequences (CDS) from Ensembl release-110 (keeping only longest isoforms) and Xtropicalisv9.0.Named.primaryTrs.pep.fa from www.xenbase.org:

  • <ensembl_110_orthofinder_last_Orthogroups.GeneCount.tsv.zip>
  • <ensembl_110_orthofinder_last_Orthogroups.tsv.zip>

Includes a table specifying the OrthoFinder species file names and its corresponding NCBI taxonomic IDs:

  • <ensembl_110_orthofinder_last_species_list.tsv>

Includes NCBI taxonomic tree for Ensembl release-110 species analysed:

  • <ensembl_110_species_tree_ncbi_topology_named.nw>
  • <ensembl_110_species_tree_ncbi_topology_named.pdf>

OrthoFinder:  Ensembl release-111 (-S last)

Includes OrthoFinder results (-S last) for all translated coding sequences (CDS) from Ensembl release-111 (keeping only longest isoforms) and Xtropicalisv9.0.Named.primaryTrs.pep.fa from www.xenbase.org:

  • <ensembl_111_orthofinder_last_Orthogroups.GeneCount.tsv.zip>
  • <ensembl_111_orthofinder_last_Orthogroups.tsv.zip>

Includes a table specifying the OrthoFinder species file names and its corresponding NCBI taxonomic IDs:

  • <ensembl_111_orthofinder_last_species_list.tsv>

OrthoFinder:  Ensembl release-112 (-S last)

Includes OrthoFinder results (-S last) for all translated coding sequences (CDS) from Ensembl release-112 (keeping only longest isoforms) and Xtropicalisv9.0.Named.primaryTrs.pep.fa from www.xenbase.org:

  • <ensembl_112_orthofinder_last_Orthogroups.GeneCount.tsv.zip>
  • <ensembl_112_orthofinder_last_Orthogroups.tsv.zip>

Includes a table specifying the OrthoFinder species file names and its corresponding NCBI taxonomic IDs:

  • <ensembl_112_orthofinder_last_species_list.tsv>

OrthoFinder:  Ensembl release-113 (-S last)

Includes OrthoFinder results (-S last) for all translated coding sequences (CDS) from Ensembl release-113 (keeping only longest isoforms) and Xtropicalisv9.0.Named.primaryTrs.pep.fa from www.xenbase.org:

  • <ensembl_113_orthofinder_last_Orthogroups.GeneCount.tsv.zip>
  • <ensembl_113_orthofinder_last_Orthogroups.tsv.zip>

Includes a table specifying the OrthoFinder species file names and its corresponding NCBI taxonomic IDs:

  • <ensembl_113_orthofinder_last_species_list.tsv>

OrthoFinder:  Ensembl release-114 (-S last)

Includes OrthoFinder results (-S last) for all translated coding sequences (CDS) from Ensembl release-114 (keeping only longest isoforms) and Xtropicalisv9.0.Named.primaryTrs.pep.fa from www.xenbase.org:

  • <ensembl_114_orthofinder_last_Orthogroups.GeneCount.tsv.zip>
  • <ensembl_114_orthofinder_last_Orthogroups.tsv.zip>

Includes a table specifying the OrthoFinder species file names and its corresponding NCBI taxonomic IDs:

  • <ensembl_114_orthofinder_last_species_list.tsv>

OrthoFinder:  Ensembl release-115 (-S last)

Includes OrthoFinder results (-S last) for all translated coding sequences (CDS) from Ensembl release-115 (keeping only longest isoforms) and Xtropicalisv9.0.Named.primaryTrs.pep.fa from www.xenbase.org:

  • <ensembl_115_orthofinder_last_Orthogroups.GeneCount.tsv.zip>
  • <ensembl_115_orthofinder_last_Orthogroups.tsv.zip>

Includes a table specifying the OrthoFinder species file names and its corresponding NCBI taxonomic IDs:

  • <ensembl_115_orthofinder_last_species_list.tsv>

OrthoFinder:  WormBase release-WS288 + WormBase ParaSite release-WBPS18 (-S last)

Includes OrthoFinder results (-S last) for all translated coding sequences (CDS) from WormBase release-WS288,  WormBase ParaSite release-WBPS18 (keeping only longest isoforms) and dd_Smed_v6.pcf.contigs.fasta (transdecoder and miniprothint peptides) from https://planmine.mpibpc.mpg.de:

  • <WS288_WBPS18_orthofinder_last_Orthogroups.GeneCount.tsv.zip>
  • <WS288_WBPS18_orthofinder_last_Orthogroups.tsv.zip>

Includes a table specifying the OrthoFinder species file names and its corresponding NCBI taxonomic IDs:

  • <WS288_WBPS18_orthofinder_last_species_list.tsv>

Includes NCBI taxonomic tree for WormBase release-WS288 and WormBase ParaSite release-WBPS18 species analysed:

  • <WS288_WBPS18_species_tree_ncbi_topology_named.nw>
  • <WS288_WBPS18_species_tree_ncbi_topology_named.pdf>

OrthoFinder:  WormBase release-WS293 + WormBase ParaSite release-WBPS19 (-S last)

Includes OrthoFinder results (-S last) for all translated coding sequences (CDS) from WormBase release-WS293,  WormBase ParaSite release-WBPS19 (keeping only longest isoforms):

  • <WS293_WBPS19_orthofinder_last_Orthogroups.GeneCount.tsv.zip>
  • <WS293_WBPS19_orthofinder_last_Orthogroups.tsv.zip>

Includes a table specifying the OrthoFinder species file names and its corresponding NCBI taxonomic IDs:

  • <WS293_WBPS19_orthofinder_last_species_list.tsv>

Includes NCBI taxonomic tree for WormBase release-WS293 and WormBase ParaSite release-WBPS19 species analysed:

  • <WS293_WBPS19_species_tree_ncbi_topology_named.nw>
  • <WS293_WBPS19_species_tree_ncbi_topology_named.pdf>

Broccoli:  Example (default settings)

Includes Broccoli results (default settings) for translated coding sequences (CDS) from four plant sepcies (keeping only longest isoforms) namely A. lyrata, A. thaliana, C. hirsuta and C. rubella:

  • <broccoli_example_table_OGs_protein_counts.txt>
  • <broccoli_example_table_OGs_protein_names.txt>

Includes a table specifying the Broccoli species file names and its corresponding NCBI taxonomic IDs:

  • <broccoli_example_species_list.tsv>

Pre-calculated orthomaps:

Includes pre-calculated gene age assignments for C. elegans (Sun et al. 2021), H. vulgaris (Cazet et al. 2022) and D. rerio (Ensembl-105; Ensembl-110; Ensembl-113):

  • <Sun2021_Orthomap.tsv>
  • <Cazet2022_Orthomap.tsv>
  • <zebrafish_ensembl_105_orthomap.tsv>
  • <zebrafish_ensembl_110_orthomap.tsv>
  • <zebrafish_ensembl_113_orthomap.tsv>

Pre-calculated Ensembl release-113 orthomaps (OrthoFinder -S last):

Includes extracted orthomaps for all species from Ensembl release-113 based on OrthoFinder (-S last):

  • <ensembl_113_orthofinder_last_orthomaps.tsv.zip>

Pre-calculated Ensembl release-114 orthomaps (OrthoFinder -S last):

Includes extracted orthomaps for all species from Ensembl release-114 based on OrthoFinder (-S last):

  • <ensembl_114_orthofinder_last_orthomaps.tsv.zip>

Pre-calculated Ensembl release-115 orthomaps (OrthoFinder -S last):

Includes extracted orthomaps for all species from Ensembl release-115 based on OrthoFinder (-S last):

  • <ensembl_115_orthofinder_last_orthomaps.tsv.zip>

Pre-calculated evolutionary indices:

Includes pre-calculated TajimaD, NormalizedPi, FayWu, Fst for C. elegans (Ma et al. 2021):

  • <Ma2021_Fst.tsv>

eggNOG database version 6.0 orthomaps:

Includes extracted orthomaps for all Eukaryota from eggNOG database version 6.0 (Hernández-Plaza et al. 2022):

  • <eggnog6_eukaryota_orthomaps.tsv.zip>

myTAI example data:

Includes example data from the myTAI R package (Drost et al. 2018)

  • <PhyloExpressionSetExample.h5ad>

PLAZA database version 5.0 orthomaps:

Includes extracted orthomaps for either HOMFAM or ORTHOFAM groups of plants from PLAZA database version 5.0 (Van Bel et al. 2022):

  • <plaza_v5_dicots_HOMFAM_orthomaps.tsv.zip>
  • <plaza_v5_dicots_ORTHOFAM_orthomaps.tsv.zip>
  • <plaza_v5_monocots_ORTHOFAM_orthomaps.tsv.zip>
  • <plaza_v5_monocots_ORTHOFAM_orthomaps.tsv.zip>

Mouse synonyms:

Table of Mus musculus gene synonyms obtained from here https://github.com/mustafapir/geneName/blob/master/data/mouse_synonyms1.rda and converted into a table.

  • <mouse_synonyms.tsv>

Files

broccoli_example_table_OGs_protein_counts.txt

Files (688.8 MB)

Name Size Download all
md5:d36205d05d84e20165b2bf2e1fc812fd
179 Bytes Download
md5:4c619b3eefe61b7f1610b455def0e029
400.6 kB Preview Download
md5:95cbf475afc3136aed2359f36e4cce4c
1.7 MB Preview Download
md5:79b652500ecf8c9adff25088623c8429
264.5 kB Download
md5:9f3e2e8bf9415980845a1477c5dd1b3f
109.1 MB Preview Download
md5:caa971abed6bf34678f34f209b9907fd
1.3 MB Preview Download
md5:e75fafb750d3ebc490e219c0a04aa197
35.4 MB Preview Download
md5:bcb7a19824d483002dd74fc88b252631
15.7 kB Download
md5:e4648a88e1619ff5b77f25d4970bca7f
8.1 kB Download
md5:3d4359df5758dc4db18926af2ff8ee41
16.3 kB Preview Download
md5:aebe6404bbe180e36c39384c1dace219
1.3 MB Preview Download
md5:d087a32e199389aab9d41474d932acf9
36.5 MB Preview Download
md5:1e4843fa4230ead7f2fa0518c99ac69d
11.3 kB Download
md5:2a1abb8340d1b5585de84eeb3b3e97bf
10.2 kB Download
md5:81891e1b1b4b940dc1411d740abd52ec
17.6 kB Preview Download
md5:c50bc64cdf00b7dc1ceaf1dce1fec5bb
1.3 MB Preview Download
md5:1120975a2e06c80e243d1c9388596593
36.4 MB Preview Download
md5:2c02ae0d1133192e13fc6dd03c3ca97e
11.3 kB Download
md5:47eb3e43086137c4bab7015bd812501a
1.4 MB Preview Download
md5:519499bd2ef2fc76dc2d3af97aff22c3
37.5 MB Preview Download
md5:e8229aab0cc51553837f0995fbb08f7b
11.6 kB Download
md5:177aa9bd7a203ff73b3ea21ffa402825
1.2 MB Preview Download
md5:ba6188409dd5b3cbee9a1060e4840798
39.3 MB Preview Download
md5:4f24c723b0745e3a3792fc5c46d0c923
46.8 MB Preview Download
md5:66cdb69a5b219930bb3fa87484bc0ad4
12.4 kB Download
md5:1f4e35700bd9613619e11a4ce48b62bf
1.2 MB Preview Download
md5:d905995decdfafded9d3fb944519bf56
40.3 MB Preview Download
md5:169996ce37d7341803e438ecf521850b
48.0 MB Preview Download
md5:4264be36ecccf17e3067a41eeebcc8bc
12.6 kB Download
md5:9036b9de6ee46a6e91e8236ae8b285fb
1.2 MB Preview Download
md5:3f0a8ecfafda459e04bf6938e0550a88
40.4 MB Preview Download
md5:cd7f46915485c6aa10976807022a18c3
46.3 MB Preview Download
md5:0dd61d5ea1246082f40e5d8ef7e64d40
12.7 kB Download
md5:755905ce61bee50f057c93fd2c2f9530
1.0 MB Download
md5:6e96ca44887272cb4acdf2e69e3f994a
1.1 MB Download
md5:0fe84e401348ede11ddea7130247fdf1
2.9 MB Download
md5:e0002cf24722534732f6328f1659ada3
21.0 MB Preview Download
md5:aa7660f0369569bbbdf187fccb2948c1
22.6 MB Preview Download
md5:d1d2adc746d728083ecc067d56d5a13d
12.4 MB Preview Download
md5:c35c35cfd50b8780d9209b8c5dadb21e
13.4 MB Preview Download
md5:736deffbe886d954bd899785cf927d9b
344.6 kB Download
md5:ef729ef0e903172aadbaecceabe380a2
3.2 MB Preview Download
md5:ccc1a0af6060ab3dc985ea76d37c8bc0
27.5 MB Preview Download
md5:1e76b85782bb3c0c6144ca7be79ec29b
20.6 kB Download
md5:b364d113f88345c6aaf933736ac00d77
6.8 kB Download
md5:9a93a0ff78ccbde7a8db845c5504af27
10.0 kB Preview Download
md5:b574e4321976c7d31ed8dde91ed2a953
3.7 MB Preview Download
md5:d4bb86162f213969a5c991030292146e
44.9 MB Preview Download
md5:4bc8fa296718914f452d948be086db48
16.4 kB Download
md5:4dffd57cd59ea813ac2a94647caa1f64
13.3 kB Download
md5:f79f6b256e4dc3e1498f94ce612a9caf
33.6 kB Preview Download
md5:847704a8dbfee4a13e91a122c55bc4d4
1.9 MB Download
md5:cef1bbd0a10b95c1f6394b98aa3dcca3
1.9 MB Download
md5:0f09a6ce56afab1b9d59004a6aa1dd76
1.9 MB Download
md5:98849540577cb6c3aea8e05c814f4f8f
1.5 MB Download

Additional details

References

  • Emms, D.M. and Kelly, S., 2019. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome biology, 20(1), pp.1-14.
  • Sun, S., Rödelsperger, C. and Sommer, R.J., 2021. Single worm transcriptomics identifies a developmental core network of oscillating genes with deep conservation across nematodes. Genome research, 31(9), pp.1590-1601.
  • Cazet, J., Siebert, S., Little, H.M., Bertemes, P., Primack, A.S., Ladurner, P., Achrainer, M., Fredriksen, M.T., Moreland, R.T., Singh, S. and Zhang, S., 2022. New Hydra genomes reveal conserved principles of hydrozoan transcriptional regulation. bioRxiv.
  • Hernández-Plaza, A., Szklarczyk, D., Botas, J., Cantalapiedra, C.P., Giner-Lamia, J., Mende, D.R., Kirsch, R., Rattei, T., Letunic, I., Jensen, L.J. and Bork, P., 2022. eggNOG 6.0: enabling comparative genomics across 12 535 organisms. Nucleic Acids Research.
  • Drost, H.G., Gabel, A., Liu, J., Quint, M. and Grosse, I., 2018. myTAI: evolutionary transcriptomics with R. Bioinformatics, 34(9), pp.1589-1590.
  • Van Bel, M., Silvestri, F., Weitz, E.M., Kreft, L., Botzki, A., Coppens, F. and Vandepoele, K., 2022. PLAZA 5.0: extending the scope and power of comparative and functional genomics in plants. Nucleic Acids Research, 50(D1), pp.D1468-D1474.
  • Ma, F., Lau, C.Y. and Zheng, C., 2021. Large genetic diversity and strong positive selection in F-box and GPCR genes among the wild isolates of Caenorhabditis elegans. Genome Biology and Evolution, 13(5), p.evab048.
  • Derelle, R., Philippe, H., & Colbourne, J. K., 2020. Broccoli: combining phylogenetic and network analyses for orthology assignment. Molecular Biology and Evolution, 37(11), 3389-3396.