Published January 12, 2026 | Version v1
Other Open

Annotation of the novel humpback whale (<em>Megaptera novaeangliae</em>) reference genome

  • 1. University of Colorado Boulder

Description

We present a comprehensive annotation of the humpback whale (Megaptera novaeangliae) genome, representing the most complete and biologically coherent gene-model resource available to date for this species. Leveraging R (v2025.05.1+513) within a fully reproducible analysis pipeline, we evaluated 1,308,486 genomic features—including 21,833 protein-coding genes and 386,765 annotated exons. The structural features of our annotation are consistent with mammalian expectations (median gene length ≈ 4.3 kb; intron lengths peaking in the 1–10 kb range) and show coding‐sequence integrity (99.7 % of transcripts are in-frame). Gene loci are distributed non‐uniformly across scaffolds, with densities ranging from ~7 to ~15 genes per megabase, covering ~25–30 % of the assembled genome sequence. We further identified ~2,000 multi‐copy gene families and numerous tandem duplication clusters (2–12 members). These results demonstrate that the M. novaeangliae genome is both structurally complete and functionally rich, providing a robust foundation for future comparative, physiological, and conservation‐genomics investigations.

Notes

Funding provided by: The Unknown Scientist Institute
ROR ID:
Award Number:

Files

Fig_ContigEdge_Genes_Clean.png

Files (2.6 MB)

Name Size Download all
md5:a59ad1179874ef355a4aab987a017166
42.4 kB Preview Download
md5:9f50dd0993650312dc0e5fc5390e69d4
52.6 kB Preview Download
md5:ff4a32e158410c427553a952d8aed026
58.4 kB Preview Download
md5:8602fa9f93058ed04d44ca53a51f45d3
54.8 kB Preview Download
md5:be124c9ec1b47797152a71a3ea7cf4ca
865.9 kB Preview Download
md5:d527c0c6d8cc1300eac658abc83e019f
362.7 kB Preview Download
md5:89164b3f3082a2787f1f6948800dbe82
165.0 kB Preview Download
md5:9b2ec082d27d1b05dcb2d3c68324bcb6
63.4 kB Preview Download
md5:6ae3ff13573d37e52e734e599faaab12
37.4 kB Preview Download
md5:fd4119107a8c3f033a595939e3c2c678
342.5 kB Preview Download
md5:c1155f003ca45dbca63ea498f45095df
62.5 kB Preview Download
md5:8a5c49bb2470d13ed3cf1dd3fc706f35
50.5 kB Preview Download
md5:88e983c84b8a89cf49eddd331c6b4b59
58.4 kB Preview Download
md5:29f992f89c2b4c300f0b520562544c53
43.2 kB Preview Download
md5:7e59a2b73ad5b5c521a8430dc7a9d8b2
111.4 kB Preview Download
md5:bfb1e9d31f40e26f3cefb38e9dc6897f
128.5 kB Preview Download
md5:60985015cb898588faba30b15f781945
82.9 kB Preview Download
md5:c9e1af09ee646197e34f1fc56f43b300
56.8 kB Preview Download

Additional details