Published October 1, 2023 | Version v2
Dataset Open

Protein families for 2032 Saccharomyces cerevisiae genome assemblies

Creators

Description

The protein families obtained with different cluster cutoffs for different sets of genome assemblies as well as the marker genes are presented in text files. In each file, a row indicates a family and a column (separated by TAB) indicates a genome assembly. Protein IDs for multiple homologues from the same assembly are separated by '|'. The family ID is shown in the first column and the tag for each assembly is shown in the first row. Cluster cutoffs used are 50%, 60%, 70%, 80%, and 90%. Genome sets shown are all genomes (all), non-redundant (nr) genomes, medium-high-quality genomes (mhq), and high-quality genomes (hq).

Files

Files (577.8 MB)

Name Size Download all
md5:bfd86adc7abcb71d66e0b1a63d72094b
577.8 MB Download