Dataset for "KnowYourCG: Facilitating base-level sparse methylome interpretation" --- HM450
Description
Knowledgebases for the HumanMethylation450 Array (HM450)
This repository hosts curated knowledgebases for the Infinium HumanMethylation450 (HM450) array. These datasets are specifically formatted as RDS files for the knowYourCG Bioconductor package. They enable researchers to perform functional enrichment analysis on 450k data using updated genomic landmarks, chromatin states, and regulatory features.
1. Technical Metadata & Quality Control
Essential datasets for array-level metadata and data cleaning.
-
ProbeType — Infinium Type I vs Type II probe design.
-
InfiniumChemistry — Technical chemistry metadata for the 450k platform.
-
Mask (2026 Update) — Latest probe masking for artifacts and SNPs.
-
Mask (Legacy, BioC default) — Previous version of quality masks.
-
Blacklist — Genomic regions prone to mapping interference.
2. Genomic Context & Sequence Features
Knowledgebases describing the physical and evolutionary landscape of the 450k probes.
-
Chromosome — Updated chromosomal assignments.
-
CGI — CpG Island (CGI) associations.
-
nFlankCG — Nucleotide composition of sequences flanking the CpG.
-
Tetranuc2 — Tetranucleotide frequency signatures.
3. Epigenomic States & Regulatory Elements
Annotations linking 450k sites to functional chromatin and protein binding data.
-
ChromHMM & REMCChromHMM — Chromatin state models and Roadmap Epigenomics updates.
-
HM — Histone modification peak overlaps.
-
TFBSrm — Transcription Factor Binding Sites.
-
CTCFbind — CTCF binding/insulator sites.
-
ABCompartment — Higher-order chromatin structure (A/B compartments).
-
PMD — Partially Methylated Domains.
4. Biological Signatures
Specialized datasets for tissue-specific and developmental biology.
-
ImprintingDMR — Differentially Methylated Regions associated with imprinting.
-
MetagenePC — Principal components of gene-level methylation.
References & Support
-
Software: knowYourCG (Bioconductor)
-
Background Paper: Goldberg and Fu et al., Science Advances (2025)
Files
Files
(152.9 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:a909819e78beaa5713e78034037a0956
|
1.6 MB | Download |
|
md5:04d570ee071cf4bfdaff109c803d1f29
|
1.3 kB | Download |
|
md5:38fb68bc45b5a423aa241968c15febeb
|
1.6 MB | Download |
|
md5:c73b292e85ae9424deda9c319f9840c0
|
1.7 MB | Download |
|
md5:78d205eae2a5c6f55b57aa9f62a0b27e
|
1.8 MB | Download |
|
md5:5b0d56f43ed743c299309049b56d8821
|
68.7 kB | Download |
|
md5:d122f8ab08621436957137c163b186f4
|
7.0 MB | Download |
|
md5:ef2d3d16a19c52d9e3a286fa6bab0c32
|
4.7 kB | Download |
|
md5:aa88ab2ea5ad941e40338f959c1d740d
|
1.6 MB | Download |
|
md5:99691e77d4cc07032fa6207ab6a47841
|
1.9 MB | Download |
|
md5:668c75c9cf76d30ad43438d77f246a9e
|
574.3 kB | Download |
|
md5:40c7ee998a204771b8e0dff154d2446d
|
7.7 MB | Download |
|
md5:ec513272e867af8b6726285c570c4a1f
|
1.8 MB | Download |
|
md5:a1c11fb6681d6a2a7d2d36bb580bfdd4
|
995.9 kB | Download |
|
md5:fc1ed6689cebf7b962e154859e32d678
|
1.5 MB | Download |
|
md5:3a047fc2bf55e0a301413d3ec31d9195
|
1.7 MB | Download |
|
md5:98f60f22170f5937573f240c9e7916bd
|
319.7 kB | Download |
|
md5:73c3291ca87d34c05fa94fe74c9b941d
|
336.9 kB | Download |
|
md5:cfd668d73ab93800be5908e11708c969
|
1.6 MB | Download |
|
md5:fe2efdad0a34fd1b95d43b51552bdbd8
|
119.2 MB | Download |
Additional details
Related works
- Is supplement to
- Dataset: 10.1126/sciadv.adw3027 (DOI)