Planned intervention: On Wednesday June 26th 05:30 UTC Zenodo will be unavailable for 10-20 minutes to perform a storage cluster upgrade.
Published November 4, 2017 | Version v1
Dataset Open

Data from: On the (un)predictability of a large intragenic fitness landscape

  • 1. Instituto Gulbenkian de Ciência
  • 2. École Polytechnique Fédérale de Lausanne
  • 3. Eli Lilly and Company*

Description

The study of fitness landscapes, which aims at mapping genotypes to fitness, is receiving ever-increasing attention. Novel experimental approaches combined with next-generation sequencing (NGS) methods enable accurate and extensive studies of the fitness effects of mutations, allowing us to test theoretical predictions and improve our understanding of the shape of the true underlying fitness landscape and its implications for the predictability and repeatability of evolution. Here, we present a uniquely large multiallelic fitness landscape comprising 640 engineered mutants that represent all possible combinations of 13 amino acid-changing mutations at 6 sites in the heat-shock protein Hsp90 in Saccharomyces cerevisiae under elevated salinity. Despite a prevalent pattern of negative epistasis in the landscape, we find that the global fitness peak is reached via four positively epistatic mutations. Combining traditional and extending recently proposed theoretical and statistical approaches, we quantify features of the global multiallelic fitness landscape. Using subsets of the data, we demonstrate that extrapolation beyond a known part of the landscape is difficult owing to both local ruggedness and amino acid-specific epistatic hotspots and that inference is additionally confounded by the nonrandom choice of mutations for experimental fitness landscapes.

Notes

Files

data.csv

Files (173.6 kB)

Name Size Download all
md5:8a6ecd5f2d525d14ff1d8ea3bf0943d6
173.6 kB Preview Download

Additional details

Related works

Is cited by
10.1073/pnas.1612676113 (DOI)