Dataset Open Access

Data from: Genome assembly of the ragweed leaf beetle, a step forward to better predict rapid evolution of a weed biocontrol agent to environmental novelties

Bouchemousse, Sarah; Falquet, Laurent; Müller-Schärer, Heinz

DataCite XML Export

<?xml version='1.0' encoding='utf-8'?>
<resource xmlns:xsi="" xmlns="" xsi:schemaLocation="">
  <identifier identifierType="URL"></identifier>
      <creatorName>Bouchemousse, Sarah</creatorName>
      <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="">0000-0001-5283-5112</nameIdentifier>
      <affiliation>University of Fribourg</affiliation>
      <creatorName>Falquet, Laurent</creatorName>
      <affiliation>University of Fribourg</affiliation>
      <creatorName>Müller-Schärer, Heinz</creatorName>
      <affiliation>University of Fribourg</affiliation>
    <title>Data from: Genome assembly of the ragweed leaf beetle, a step forward to better predict rapid evolution of a weed biocontrol agent to environmental novelties</title>
    <subject>Biological control agent</subject>
    <subject>Ophraella communa</subject>
    <subject>Whole genome sequence</subject>
    <subject>SMRT-Cell sequencing</subject>
    <subject>de novo Assembly</subject>
    <date dateType="Issued">2020-05-29</date>
  <resourceType resourceTypeGeneral="Dataset"/>
    <alternateIdentifier alternateIdentifierType="url"></alternateIdentifier>
    <relatedIdentifier relatedIdentifierType="DOI" relationType="IsSupplementTo">10.1093/gbe/evaa102</relatedIdentifier>
    <relatedIdentifier relatedIdentifierType="DOI" relationType="IsIdenticalTo">10.5061/dryad.1ns1rn8qt</relatedIdentifier>
    <relatedIdentifier relatedIdentifierType="URL" relationType="IsPartOf"></relatedIdentifier>
    <rights rightsURI="">Creative Commons Zero v1.0 Universal</rights>
    <rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights>
    <description descriptionType="Abstract">&lt;p&gt;&lt;span&gt;Rapid evolution of weed biological control agents (BCAs) to new biotic and abiotic conditions is poorly understood and so far, only little considered both in pre-release and post-release studies, despite potential major negative or positive implications for risks of non-targeted attacks or for colonizing yet unsuitable habitats, respectively. Provision of genetic resources, such as assembled and annotated genomes, is essential to assess potential adaptive processes by identifying underlying genetic mechanisms. Here, we provide the first sequenced genome of a phytophagous insect used as a BCA, &lt;i&gt;i.e.&lt;/i&gt; the leaf beetle &lt;i&gt;Ophraella communa&lt;/i&gt;, a promising BCA of common ragweed, recently and accidentally introduced into Europe. A total 33.98 Gb of raw DNA sequences, representing c. 43-fold coverage, were obtained using the PacBio SMRT-Cell sequencing approach. Among the five different assemblers tested, the SMARTdenovo assembly displaying the best scores was then corrected with Illumina short reads. A final genome of 774 Mb containing 7,003 scaffolds was obtained. The reliability of the final assembly was then assessed by benchmarking universal single-copy orthologous genes (&amp;gt; 96.0% of the 1,658 expected insect genes) and by remapping tests of Illumina short reads (average of 98.6% ± 0.7% without filtering). The number of protein-coding genes of 75,642, representing 82% of the published antennal transcriptome, and the phylogenetic analyses based on 825 orthologous genes placing &lt;i&gt;O. communa &lt;/i&gt;in the monophyletic group of Chrysomelidae, confirm the relevance of our genome assembly. Overall, the genome provides a valuable resource for studying potential risks and benefits of this BCA facing environmental novelties.&lt;/span&gt;&lt;/p&gt;</description>
    <description descriptionType="Other">&lt;div class="o-metadata__file-usage-entry"&gt;
Proteome and annotation files

&lt;p&gt;The "GenePrediction_Ophraella_communa_Augustus.aa" (fasta format) contains the 75,642 protein-coding genes predicted with Augustus (v3.2.3), using parameters "--gff3=on --species=tribolium2012". The preliminary annotation of each 75,642 predicted proteins, performed with Pannzer2, is described in the "Annotation_Ophraella_communa.out" file (text format). Details of the meaning of columns are provided on the Pannzer2 website ( An interactive html file can be generated using the command line below.&lt;/p&gt;

&lt;pre class="moz-quote-pre"&gt;perl &amp;lt; Annotation_Ophraella_communa.out &amp;gt; Annotation_Ophraella_communa.html
&lt;p&gt;Funding provided by: Swiss National Science Foundation*&lt;br&gt;Crossref Funder Registry ID: &lt;br&gt;Award Number: 31003A_166448 to HMS&lt;/p&gt;</description>
Views 19
Downloads 1
Data volume 23.9 MB
Unique views 17
Unique downloads 1


Cite as