Info: Zenodo’s user support line is staffed on regular business days between Dec 23 and Jan 5. Response times may be slightly longer than normal.

Published October 1, 2020 | Version v1
Dataset Open

A novel method for using RNA-seq data to identify imprinted genes in social Hymenoptera with multiply mated queens

  • 1. University of Oxford
  • 2. University of Copenhagen
  • 3. Beijing Genomics Institute

Description

Genomic imprinting results in parent-of-origin dependent gene expression biased towards either the maternally- or paternally-derived allele at the imprinted locus. The kinship theory of genomic imprinting argues that this unusual expression pattern is a manifestation of intra-genomic conflict between the maternally- and paternally-derived halves of the genome that arises because they are not equally related to the genomes of social partners. The theory thus predicts that imprinting may evolve wherever there are close interactions among asymmetrically related kin. The social Hymenoptera with permanent caste differentiation are suitable candidates for testing the kinship theory because haplodiploid sex determination creates strong relatedness asymmetries and nursing workers interact closely with kin. However, progress in the search for imprinted genes in the social Hymenoptera has been slow, in part because tests for imprinting rely on reciprocal crosses that are impossible in most species. Here, we develop a method to systematically search for imprinting in haplodiploid social insects without crosses, using instead samples of pooled individuals collected from natural colonies. We tested this protocol using data available for the leaf-cutting ant Acromyrmex echinatior, providing the first genome-wide search for imprinting in any ant. While we identified several genes as potentially imprinted, none of the four genes tested could be verified as imprinted using digital droplet PCR, highlighting the need for higher quality genomic assemblies that accurately map duplicated genes.

Notes

See README.txt for description of the data, files and script uploaded. 

In short, there are three data files uploaded in input_data (one for the allele-specific expression: ASE_data_frame.csv, one for the ddPCR allele-specific PCR: compiled_ddPCR_results.csv, and one for the copy number dilution assay: dilution_data.csv). There is one RMarkdown script (Howe_et_al_RScript.Rmd), which conducts all analyses, relying on only basic R packages that must be installed. 

Some intermediate data-values are missing from the ddPCR file (ie. the absolute number of droplets for each channel) due to a recording error. The fraction and confidence intervals are reported however.  

Funding provided by: H2020 European Research Council
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100010663
Award Number: 323085

Files

ASE_data_frame.csv

Files (2.5 MB)

Name Size Download all
md5:1e5382c8344621ef871f4ecb53f9dfe0
205.8 kB Preview Download
md5:b3941a6d503179fa88637adb449b1d47
8.7 kB Preview Download
md5:d2e4b8e0f352399839accd857cd5492a
1.9 kB Preview Download
md5:61db323d1323986255fe0f5d1fa9379d
30.4 kB Preview Download
md5:15c8e6be6c2bb42998e5f56192ffcda5
2.2 MB Download
md5:d943dd8edcff65991a90e649744766fb
48.0 kB Download
md5:25b94f34b27c7e4194a5d011ef235d24
6.7 kB Preview Download
md5:94b0c801c58646502515206da1f9b6aa
2.6 kB Preview Download
md5:aae6f6697f5cc34191096de03c9b600b
7.7 kB Preview Download

Additional details

Related works

Is cited by
10.1038/ncomms5943 (DOI)