Published November 4, 2017
| Version v1
Dataset
Open
Data from: On the (un)predictability of a large intragenic fitness landscape
- 1. Instituto Gulbenkian de Ciência
- 2. École Polytechnique Fédérale de Lausanne
- 3. Eli Lilly and Company*
Description
The study of fitness landscapes, which aims at mapping genotypes to fitness, is receiving ever-increasing attention. Novel experimental approaches combined with next-generation sequencing (NGS) methods enable accurate and extensive studies of the fitness effects of mutations, allowing us to test theoretical predictions and improve our understanding of the shape of the true underlying fitness landscape and its implications for the predictability and repeatability of evolution. Here, we present a uniquely large multiallelic fitness landscape comprising 640 engineered mutants that represent all possible combinations of 13 amino acid-changing mutations at 6 sites in the heat-shock protein Hsp90 in Saccharomyces cerevisiae under elevated salinity. Despite a prevalent pattern of negative epistasis in the landscape, we find that the global fitness peak is reached via four positively epistatic mutations. Combining traditional and extending recently proposed theoretical and statistical approaches, we quantify features of the global multiallelic fitness landscape. Using subsets of the data, we demonstrate that extrapolation beyond a known part of the landscape is difficult owing to both local ruggedness and amino acid-specific epistatic hotspots and that inference is additionally confounded by the nonrandom choice of mutations for experimental fitness landscapes.
Notes
Files
data.csv
Files
(173.6 kB)
Name | Size | Download all |
---|---|---|
md5:8a6ecd5f2d525d14ff1d8ea3bf0943d6
|
173.6 kB | Preview Download |
Additional details
Related works
- Is cited by
- 10.1073/pnas.1612676113 (DOI)