Published January 22, 2026 | Version v3

Dataset for "KnowYourCG: Facilitating base-level sparse methylome interpretation" --- HM450

  • 1. ROR icon Children's Hospital of Philadelphia

Description

Knowledgebases for the HumanMethylation450 Array (HM450)

This repository hosts curated knowledgebases for the Infinium HumanMethylation450 (HM450) array. These datasets are specifically formatted as RDS files for the knowYourCG Bioconductor package. They enable researchers to perform functional enrichment analysis on 450k data using updated genomic landmarks, chromatin states, and regulatory features.

1. Technical Metadata & Quality Control

Essential datasets for array-level metadata and data cleaning.

2. Genomic Context & Sequence Features

Knowledgebases describing the physical and evolutionary landscape of the 450k probes.

  • Chromosome — Updated chromosomal assignments.

  • CGI — CpG Island (CGI) associations.

  • nFlankCG — Nucleotide composition of sequences flanking the CpG.

  • Tetranuc2 — Tetranucleotide frequency signatures.

  • rmsk1 & rmsk2 — RepeatMasker repetitive elements.

3. Epigenomic States & Regulatory Elements

Annotations linking 450k sites to functional chromatin and protein binding data.

  • ChromHMM & REMCChromHMM — Chromatin state models and Roadmap Epigenomics updates.

  • HM — Histone modification peak overlaps.

  • TFBSrm — Transcription Factor Binding Sites.

  • CTCFbind — CTCF binding/insulator sites.

  • ABCompartment — Higher-order chromatin structure (A/B compartments).

  • PMD — Partially Methylated Domains.

4. Biological Signatures

Specialized datasets for tissue-specific and developmental biology.

  • ImprintingDMR — Differentially Methylated Regions associated with imprinting.

  • MetagenePC — Principal components of gene-level methylation.

References & Support

Files

Files (152.9 MB)

Name Size Download all
md5:a909819e78beaa5713e78034037a0956
1.6 MB Download
md5:04d570ee071cf4bfdaff109c803d1f29
1.3 kB Download
md5:38fb68bc45b5a423aa241968c15febeb
1.6 MB Download
md5:c73b292e85ae9424deda9c319f9840c0
1.7 MB Download
md5:78d205eae2a5c6f55b57aa9f62a0b27e
1.8 MB Download
md5:5b0d56f43ed743c299309049b56d8821
68.7 kB Download
md5:d122f8ab08621436957137c163b186f4
7.0 MB Download
md5:ef2d3d16a19c52d9e3a286fa6bab0c32
4.7 kB Download
md5:aa88ab2ea5ad941e40338f959c1d740d
1.6 MB Download
md5:99691e77d4cc07032fa6207ab6a47841
1.9 MB Download
md5:668c75c9cf76d30ad43438d77f246a9e
574.3 kB Download
md5:40c7ee998a204771b8e0dff154d2446d
7.7 MB Download
md5:ec513272e867af8b6726285c570c4a1f
1.8 MB Download
md5:a1c11fb6681d6a2a7d2d36bb580bfdd4
995.9 kB Download
md5:fc1ed6689cebf7b962e154859e32d678
1.5 MB Download
md5:3a047fc2bf55e0a301413d3ec31d9195
1.7 MB Download
md5:98f60f22170f5937573f240c9e7916bd
319.7 kB Download
md5:73c3291ca87d34c05fa94fe74c9b941d
336.9 kB Download
md5:cfd668d73ab93800be5908e11708c969
1.6 MB Download
md5:fe2efdad0a34fd1b95d43b51552bdbd8
119.2 MB Download

Additional details

Related works

Is supplement to
Dataset: 10.1126/sciadv.adw3027 (DOI)