Data sets for Polyplax serrata article "Highly-resolved genomes of two closely related lineages of the rodent louse Polyplax serrata with different host specificities"
Authors/Creators
- 1. University of South Bohemia in České Budějovice
Description
Supplementary data for Polyplax serrata article 2023
Data included in this repository were generated and used in various genomic and phylogenetic analysis presented by the publication "Highly-resolved genomes of two closely related lineages of the louse Polyplax serrata with different host specificities"
Description of the data and file structure
Data provided for each analyzed taxa include:
- Annotation table.
- fasta format files for transcripts (CDS and mRNA).
- fasta format file for genome.
- protein fasta file.
- gbk format file that includes the genome with its corresponding annotations.
Additionally,
- repeat families in fasta format were included for Polyplax serrata S and N lineages.
- rRNA in fasta format were included for Polyplax serrata S and N lineages, Pediculus humanus, Columbicola columbae and Brueelia nebulsa.
Sharing/Access information
GenBank accession number of analyzed taxa:
· Aedes Aegypti (GenBank accession no. GCF_002204515.2).
· Brueelia nebulsa ( GenBank accession no. GCA_028293925.1).
· Columbicola columbae (GenBank accession no. GCA_016920875.1).
· Cimex lectularis (GenBank accession no. GCF_000648675.2).
· Glossina morsitans (GenBank accession no. GCA_001077435.1).
· Pediculus humanus (GenBank accession no. GCA_000006295.1).
· Rhodnius prolixus (GenBank accession no. GCA_000181055.3).
· Polyplax serrata S lineage (GenBank accession no. JAWJWF000000000).
· Polyplax serrata N lineage (GenBank accession no. JAWJWE000000000).
Note: All the latter genomes except for the two genomes of Polyplax serrata S and N lineages, were acquired from GenBank database and were subjected to the same gene prediction and annotation workflow as P. serrata genomes to maintain methodological consistence in downstream analysis of the annotation results.
Software
- Gene prediction and annotation was performed using Funannotate v1.18.14 (https://github.com/nextgenusfs/funannotate)).
- Repeat were identified in the genomes of P. serrata S and N lineages using RepeatModeler v2.0.3.
Notes
Files
Files
(2.9 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:a152ca835d8f322efacab8e2c660317c
|
957.4 MB | Download |
|
md5:4a1f94f2408636e89ccbc9acfb769796
|
120.2 MB | Download |
|
md5:64e96e559fdb7ec05dbadc3b895ec7bb
|
444.9 MB | Download |
|
md5:8e708defab1f0b31982966551166fde2
|
198.5 MB | Download |
|
md5:96d65293a11ac18d814d42009f48ef82
|
315.7 MB | Download |
|
md5:8b4ea452103b660e6d0da4a38bb39883
|
108.8 MB | Download |
|
md5:b389b4d78004c9b5e01ebc074f37ebef
|
139.9 MB | Download |
|
md5:e6d4db8c5aed38abefea40f79f47aec7
|
139.9 MB | Download |
|
md5:8b8952d7a1a769670fbc783c1e0fe72a
|
452.6 MB | Download |