Published February 26, 2026 | Version v1
Dataset Open

Homo sapiens protein-coding sequences

  • 1. ROR icon Centro de Biología Molecular Severo Ochoa

Description

The Fasta file contains 19249 protein-coding sequences (CDS) annotated in the human genome according to the genome GRCh38 (hg38) in the Ensembl database (Release 57). These data were downloaded from: https://ppuigbo.me/programs/CAIcal/human_genes_from_ensembl

Files

Files (35.7 MB)

Name Size Download all
md5:a8159e1b80aea85e8be11488fbfe9544
35.7 MB Download

Additional details

Funding

Agencia Estatal de Investigación
AEI - PID2024-159768OB-I00