Annotation of the novel humpback whale (<em>Megaptera novaeangliae</em>) reference genome
Description
We present a comprehensive annotation of the humpback whale (Megaptera novaeangliae) genome, representing the most complete and biologically coherent gene-model resource available to date for this species. Leveraging R (v2025.05.1+513) within a fully reproducible analysis pipeline, we evaluated 1,308,486 genomic features—including 21,833 protein-coding genes and 386,765 annotated exons. The structural features of our annotation are consistent with mammalian expectations (median gene length ≈ 4.3 kb; intron lengths peaking in the 1–10 kb range) and show coding‐sequence integrity (99.7 % of transcripts are in-frame). Gene loci are distributed non‐uniformly across scaffolds, with densities ranging from ~7 to ~15 genes per megabase, covering ~25–30 % of the assembled genome sequence. We further identified ~2,000 multi‐copy gene families and numerous tandem duplication clusters (2–12 members). These results demonstrate that the M. novaeangliae genome is both structurally complete and functionally rich, providing a robust foundation for future comparative, physiological, and conservation‐genomics investigations.
Notes
Files
Fig_ContigEdge_Genes_Clean.png
Files
(2.6 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:a59ad1179874ef355a4aab987a017166
|
42.4 kB | Preview Download |
|
md5:9f50dd0993650312dc0e5fc5390e69d4
|
52.6 kB | Preview Download |
|
md5:ff4a32e158410c427553a952d8aed026
|
58.4 kB | Preview Download |
|
md5:8602fa9f93058ed04d44ca53a51f45d3
|
54.8 kB | Preview Download |
|
md5:be124c9ec1b47797152a71a3ea7cf4ca
|
865.9 kB | Preview Download |
|
md5:d527c0c6d8cc1300eac658abc83e019f
|
362.7 kB | Preview Download |
|
md5:89164b3f3082a2787f1f6948800dbe82
|
165.0 kB | Preview Download |
|
md5:9b2ec082d27d1b05dcb2d3c68324bcb6
|
63.4 kB | Preview Download |
|
md5:6ae3ff13573d37e52e734e599faaab12
|
37.4 kB | Preview Download |
|
md5:fd4119107a8c3f033a595939e3c2c678
|
342.5 kB | Preview Download |
|
md5:c1155f003ca45dbca63ea498f45095df
|
62.5 kB | Preview Download |
|
md5:8a5c49bb2470d13ed3cf1dd3fc706f35
|
50.5 kB | Preview Download |
|
md5:88e983c84b8a89cf49eddd331c6b4b59
|
58.4 kB | Preview Download |
|
md5:29f992f89c2b4c300f0b520562544c53
|
43.2 kB | Preview Download |
|
md5:7e59a2b73ad5b5c521a8430dc7a9d8b2
|
111.4 kB | Preview Download |
|
md5:bfb1e9d31f40e26f3cefb38e9dc6897f
|
128.5 kB | Preview Download |
|
md5:60985015cb898588faba30b15f781945
|
82.9 kB | Preview Download |
|
md5:c9e1af09ee646197e34f1fc56f43b300
|
56.8 kB | Preview Download |
Additional details
Related works
- Is cited by
- 10.1038/s41597-024-03922-9 (DOI)
- Is derived from
- 10.5061/dryad.x0k6djj09 (DOI)
- Is supplemented by
- https://www.ncbi.nlm.nih.gov/nuccore/JBGMDX010000001.1 (URL)
- 10.5061/dryad.dv41ns271 (DOI)