Published June 17, 2024 | Version v2
Dataset Open

Phyloformer: Fast, accurate and versatile phylogenetic reconstruction with deep neural networks

Description

This record is composed of: 

  • The results.tar.gz file which  contains all the output files necessary to reproduce the figures and tables from the linked paper
  • The 3 datasets used to fine tune different versions of Phylofofmer:
    1. cherry_fine_tune.tar.xz used to fine tune Phyloformer on the CherryML model
    2. LG_fine_tune_mre.tar.xz used to fine tune Phyloformer on LG+GC data with an MRE loss
    3. pastek_fine_tune.tar.xz used to fine tune Phyloformer on the SelReg model
  • The paper_test_sets.tar.xz file contains the test sets used to generate data in results.tar.gz, with simulated tree/msa pairs and trees inferred by different methods

Files

Files (39.6 GB)

Name Size Download all
md5:0e1916f0fb26240daee7faf4060d4e1d
6.5 GB Download
md5:34bc9330b81e4f7668805a099b9a21fa
1.2 GB Download
md5:433420525264e200e9ea0b9a01462f6c
11.1 GB Download
md5:ef008cef93732bbfebf40391b55819b8
9.2 GB Download
md5:b44994887561a3ed00b67dfc0211e704
11.5 GB Download

Additional details

Related works

Is supplement to
Preprint: 10.1101/2024.06.17.599404 (DOI)

Software

Repository URL
https://github.com/lucanest/Phyloformer
Programming language
Python