Published November 3, 2023 | Version v2
Dataset Open

Data sets for Polyplax serrata article "Highly-resolved genomes of two closely related lineages of the rodent louse Polyplax serrata with different host specificities"

  • 1. University of South Bohemia in České Budějovice

Description

Supplementary data for Polyplax serrata article 2023

Data included in this repository were generated and used in various genomic and phylogenetic analysis presented by the publication "Highly-resolved genomes of two closely related lineages of the louse Polyplax serrata with different host specificities"

Description of the data and file structure

Data provided for each analyzed taxa include:

-  Annotation table.

-  fasta format files for transcripts (CDS and mRNA).

-  fasta format file for genome.

-  protein fasta file.

-  gbk format file that includes the genome with its corresponding annotations.

Additionally,

- repeat families in fasta format were included for Polyplax serrata S and N lineages.

- rRNA in fasta format were included for Polyplax serrata S and N lineages, Pediculus humanus, Columbicola columbae and Brueelia nebulsa

Sharing/Access information

GenBank accession number of analyzed taxa:

·        Aedes Aegypti (GenBank accession no. GCF_002204515.2).

·        Brueelia nebulsa ( GenBank accession no. GCA_028293925.1).

·        Columbicola columbae (GenBank accession no. GCA_016920875.1).

·        Cimex lectularis (GenBank accession no. GCF_000648675.2).

·        Glossina morsitans (GenBank accession no. GCA_001077435.1).

·        Pediculus humanus (GenBank accession no. GCA_000006295.1).

·        Rhodnius prolixus (GenBank accession no. GCA_000181055.3).

·        Polyplax serrata S lineage (GenBank accession no. JAWJWF000000000).

·        Polyplax serrata N lineage (GenBank accession no. JAWJWE000000000).

 

Note: All the latter genomes except for the two genomes of Polyplax serrata S and N lineages, were acquired from GenBank database and were subjected to the same gene prediction and annotation workflow as P. serrata genomes to maintain methodological consistence in downstream analysis of the annotation results.

Software

- Gene prediction and annotation was performed using Funannotate v1.18.14 (https://github.com/nextgenusfs/funannotate)).

- Repeat were identified in the genomes of P. serrata S and N lineages using RepeatModeler v2.0.3.

Notes

Project funded by Czech Science Foundation, Grant number: 21-02532S

Files

Files (2.9 GB)

Name Size Download all
md5:a152ca835d8f322efacab8e2c660317c
957.4 MB Download
md5:4a1f94f2408636e89ccbc9acfb769796
120.2 MB Download
md5:64e96e559fdb7ec05dbadc3b895ec7bb
444.9 MB Download
md5:8e708defab1f0b31982966551166fde2
198.5 MB Download
md5:96d65293a11ac18d814d42009f48ef82
315.7 MB Download
md5:8b4ea452103b660e6d0da4a38bb39883
108.8 MB Download
md5:b389b4d78004c9b5e01ebc074f37ebef
139.9 MB Download
md5:e6d4db8c5aed38abefea40f79f47aec7
139.9 MB Download
md5:8b8952d7a1a769670fbc783c1e0fe72a
452.6 MB Download