Published July 10, 2023 | Version v1
Dataset Open

Petascale Homology Search for Structure Prediction - MSAs

  • 1. Seoul National University

Description

Multiple sequence alignments (MSAs) of CASP15 targets used in "Petascale Homology Search for Structure Prediction" publication. 

Each MSA is constructed from a combination of different sequence databases, using different search tools, 1) ColabFoldDB (CFDB) using ColabFold search module, 2) Sequence Read Archive (SRA) using MMseqs2, and 3) UniRef30+BFD using HHblits (HH).

  • Queries: Regular CASP15 TS targets, excluding TBM-easy
  • MSAs (A3M) 
    • CFDB (cfdb.tar.gz)
    • SRA + CFDB (sra_cfdb.tar.gz)
    • HH + CFDB (hh_cfdb.tar.gz)
    • HH + SRA + CFDB (hh_sra_cfdb.tar.gz)

Files

Files (187.4 MB)

Name Size Download all
md5:602e859e98b7914646afad46784dcfc0
21.5 MB Download
md5:c51090ae522ec04cfb03109d7b5c288f
49.1 MB Download
md5:efb12bfc87549909363852ad48aaa863
68.4 MB Download
md5:e13010c3bee96b783d867edde181e3ed
48.4 MB Download