Published January 22, 2026 | Version v3
Dataset Open

Dataset for "KnowYourCG: Facilitating base-level sparse methylome interpretation" --- EPICv2

  • 1. ROR icon Children's Hospital of Philadelphia

Description

Knowledgebases for the MethylationEPIC v2.0 Array (EPICv2)

This repository hosts curated knowledgebases for the Infinium MethylationEPIC v2.0 (EPICv2) array. These datasets are specifically formatted as RDS files for the knowYourCG Bioconductor package to facilitate functional enrichment analysis and technical quality control for this second-generation EPIC platform.

Technical Metadata & Quality Control

Datasets describing the physical architecture of the EPICv2 array and recommended filtering.

  • ProbeType — Infinium Type I vs Type II probe design for EPICv2.

  • InfiniumChemistry — Technical chemistry metadata.

  • Mask — Probes flagged for technical artifacts, SNPs, or cross-reactivity specific to the v2 design.

Genomic Context & Sequence Features

Basic genomic landmarks and sequence-level characteristics.

  • CGI — CpG Islands.

  • nFlankCG — Flanking sequence composition.

  • Tetranuc2 — Tetranucleotide frequency.

  • rmsk1 — RepeatMasker repetitive elements (Set 1).

  • rmsk2 — RepeatMasker repetitive elements (Set 2).

Epigenomic States & Regulatory Elements

Functional annotations for chromatin states, histone marks, and transcriptional regulation.

  • ChromHMM — Chromatin state models.

  • REMCChromHMM — Roadmap Epigenomics chromatin states.

  • HM — Histone modification peaks.

  • TFBSrm — Transcription Factor Binding Sites.

  • CTCFbind — CTCF binding sites.

  • ABCompartment — A/B chromatin compartments.

  • PMD — Partially Methylated Domains.

Tissue Specificity & Biological Signatures

Specialized methylation phenomena and metagene summaries.

  • ImprintingDMR — Imprinted gene regions (Differentially Methylated Regions).

  • MetagenePC — Principal components of gene-level methylation.

External Resources

Files

Files (238.9 MB)

Name Size Download all
md5:55edde92cb02542277183c1c52c43a2d
3.4 MB Download
md5:361dd2456a81b025c87a4b633b2ef30b
3.4 MB Download
md5:519802c6d1d2a8fd2c88e822ce69f422
3.6 MB Download
md5:f4e826b2eae5f0a05f1864ef4a416fe9
151.7 kB Download
md5:63bccb43e32256daeae09c726d5d607b
10.3 MB Download
md5:e02cf56e8f245f6acbaaad55c1795f6b
5.5 kB Download
md5:ef48cf261f00487770fdcf54cb8dcb24
3.2 MB Download
md5:5c7422f3d25db9643de5cb0b48fb6a5a
813.8 kB Download
md5:5015fe0df90bafcdbd18ea2a7d731de0
15.3 MB Download
md5:faa84f8bffb6a89411d112ee8a7e0acf
3.8 MB Download
md5:b56b9c95ef652f7ec3956858563eb0be
2.2 MB Download
md5:e0242401fe883e638593a031172cd8d4
3.2 MB Download
md5:c87d233a6bf94ec066b09fa11eb3516b
3.6 MB Download
md5:81d3e9f5c9d27e82ddcdc25c6bfe9997
898.5 kB Download
md5:f7e3dbca72af536a7917c6d7921aebd8
954.0 kB Download
md5:ab887ba9b330a10fc96093325aeea98f
3.4 MB Download
md5:b8eeb9c732725d16bd1db2fba0dc1f0a
180.5 MB Download

Additional details

Related works

Is supplement to
Dataset: 10.1126/sciadv.adw3027 (DOI)