Published November 2, 2022 | Version v1
Dataset Open

Data for: Testing for fitness epistasis in a transplant experiment identifies a candidate adaptive locus in Timema stick insects

  • 1. Centre d'Ecologie Fonctionnelle et Evolutive
  • 2. Fundação de Apoio à Universidade Federal de São Paulo
  • 3. Utah State University
  • 4. University of Nevada Reno
  • 5. Notre Dame University

Description

Identifying the genetic basis of adaptation is a central goal of evolutionary biology. However, identifying genes and mutations affecting fitness remains challenging because a large number of traits and variants can influence fitness. Selected phenotypes can also be difficult to know a priori, complicating top-down genetic approaches for trait mapping that involve crosses or genome-wide association studies. In such cases, experimental genetic approaches, where one maps fitness directly and attempts to infer the traits involved afterward, can be valuable. Here, we re-analyse data from a transplant experiment involving Timema stick insects, where five physically clustered SNPs associated with cryptic body colouration were shown to interact to affect survival. Our analysis covers a larger genomic region than past work and revealed a locus previously not identified as associated with survival. This locus resides near a gene, Punch (Pu), involved in pteridine pigments production, implying that it could be associated with an unmeasured colouration trait. However, by combining previous and newly obtained phenotypic data, we show that this trait is not eye or body colouration. We discuss the implications of our results for the discovery of traits, genes, and mutations associated with fitness in other systems, as well as for supergene evolution.

Notes

Data

timema_cristinae_1.3c2_braker_interproscan_predgene_and_funcann.wseq.gff3.bz2: annotation file for Timema cristinae reference genome 1.3c2.

mod_g_tchum_1.3c2.lgNA.excluded.geno: genotype file for all the individuals in the transplant experiment (both AC and MM treatments) containing genome wide information. Used in the body and eye colouration GWA analyses.

mod_g_tchum_AC_clean_LGNA_excluded.MelStripe.dsv: genotype file for individuals in the AC treatment (transplanted from MM to AC) containing information for the MelStripe locus only. Used in the gemma and LT-MAPIT analyses on survival.

mod_g_tchum_AC_clean_LGNA_excluded.geno: genotype file for individuals in the AC treatment (transplanted from MM to AC) containing genome wide information. Not used in any analysis but given for readers convenience.

lm.data.txt: genotype file for LT-MPPIT SNP outlier 1, 2, PCA 1 axis and their interactions. Used in the prediction of survival analysis.

pntest_fha2013.txt: genotype file for the linkage desiquilibrium analysis.

2019_Tchumash_epis_2022-01-12_Table4paper.xlsx: phenotype data. Survival, body and eye colouration data. Genomic prediction data for body and eye colouration.

Functions and scripts

01_gemma_bslmm.pl: function running the gemma BSLMM models

02_gemma_summary.pl: function summarizing the results accross MCMC chains.

03_gemma_sparse_formatting.pl: function formatting the output file for easy plotting.

LT-MAPIT-output_formatting.pl: function formatting the output files from LT-MAPPIT analyses.

select-MelStripe_geno_file.sh: selects SNPs within the MelStripe region from the file containing genome wide information.

generating_prediction_input file.R: generates the input file for the survival prediction analysis.

eye-body-colour-phenotypic-correlation_and_plots.R: computes the phenotypic correlations and associated plots.

genomic-correlations-script.R: computes the genomic correlations and associated plots.

gemma_bslmm_array.slurm: runs the gemma BSLMM analyses.

scripts/GWAs/GEMMA-BSLMM/colouration-traits/gemma_bslmm_relatedness-Matrix.sh: generates the relatedness matrix prior to GEMMA BSLMM analyses.

graphics_gemma_colour.R: generates the graphics for gemma BSLMM analyses on colouration traits.

graphics_gemma_survival.R: generates the graphics for gemma BSLMM analyses on survival.

gemma_predictions_bslmm_array.slurm: runs the gemma BSLMM analysis for genomic prediction of the colouration traits.

pheno.subsets.sh: subsets phenotypes prior to gemma prediction analyses.

calcLD.R: runs the linkage disequilibrium analysis in T. cristinae.

graphics_LT-MAPPIT.R: generates the graphics for LT-MAPPIT results.

survival-LT-mappit.MelStripe.Zach-filter.R: runs the LT-MAPPIT analysis on survival.

survival-LT-mappit.MelStripe.Zach-filter.slurm: slurm 'wrapper' script to run LT-MAPPIT R script.

epiAnalysis.R: runs the prediction of survival analysis.

Funding provided by: European Research Council
Crossref Funder Registry ID: http://dx.doi.org/10.13039/501100000781
Award Number: 770826

Files

lm.data.txt

Files (1.2 GB)

Name Size Download all
md5:2a845472317d348cae45f01288b6d0fc
63.7 kB Download
md5:9449653c4e6b037958a106912b0cc913
25.0 kB Preview Download
md5:1840c5527b78f2c4298ffaa89abc670d
31.6 MB Download
md5:e55aeaf4ec6e6e025b944684c27f7411
16.0 MB Download
md5:e25d04f0df57047ae204f6377aaedb63
372.4 kB Preview Download
md5:c9bfb0a9b6057455c7cd685bb9974b1d
847.4 MB Preview Download
md5:359b76ee6b51e1ed99f1da4e165121a3
13.3 kB Preview Download
md5:9560228291e55fd90154ded26e802345
271.3 MB Download

Additional details

Related works

Is derived from
10.5281/zenodo.5884987 (DOI)