Published October 24, 2025
| Version v1
Dataset
Open
Dataset for "KnowYourCG: Facilitating base-level sparse methylome interpretation" --- mm10
Description
KYCG Knowledgebase Sets (mm10)
Overview
This repository contains comprehensive knowledgebase sets for the KnowYourCG (KYCG) framework, designed for functional DNA methylation analysis at base-level resolution in mouse (mm10). These databases enable rapid enrichment testing and interpretation of diverse methylation datasets, including sparse sequencing data (low-pass, single-cell), 5-hydroxymethylation (5hmC) profiles, spatial methylomes, and array-based datasets.
Citation: Goldberg DC, Fu H, Atkins D, Moyer E, Lee CN, Deng Y, Zhou W. (2025). KnowYourCG: Facilitating base-level sparse methylome interpretation. Science Advances 11(43). DOI: 10.1126/sciadv.adw3027
Reference Coordinates
- Complete reference coordinates for all CpG sites in mm10 (excluding contigs)
- Essential baseline for enrichment testing and coordinate mapping
I. Sequence Features
- nFlankCG.20220321.cm - CpG count in flanking regions (standard window)
- nFlankCG50.20231025.cm - CpG count within 50bp flanking regions
- nFlankCG100.20231025.cm - CpG count within 100bp flanking regions
- Tetranuc2.20220321.cm - Four-base sequence context surrounding CpG sites
- CGI.20220904.cm - CpG island annotations
- rmsk1.20220307.cm + .idx - RepeatMasker annotations (class 1)
- rmsk2.20220321.cm + .idx - RepeatMasker annotations (class 2)
II. Genomic Features
- Chromosome.20221129.cm - Basic chromosome annotations
- ChromosomeXY.20230901.cm - Sex chromosome-specific features
- Centromere.20221129.cm - Centromeric regions
- Win100k.20220228.cm - 100kb genomic window annotations
- ABCompartment.20220911.cm - A/B compartment annotations (open/closed chromatin)
- PMD.20220911.cm - Partially Methylated Domains
- CTCFbind.20220911.cm - CTCF binding sites (chromatin loop anchors)
- ChromHMM.20220303.cm - ChromHMM state annotations for mouse tissues
- ChromHMMfullStack.20230515.cm - Comprehensive ChromHMM states across multiple cell types
- HM.20221013.cm + .idx - Comprehensive histone modification marks (H3K4me3, H3K27ac, H3K9me3, H3K27me3, etc.)
- MetagenePC.20220911.cm + .idx - Positional information relative to gene features (promoters, gene bodies, 3'UTRs)
- TFBS.20220921.cm + .idx - TFBS collection Part 1
- TFBSrm.20221005.cm + .idx - Roadmap Epigenomics TFBS for mouse
III. Trait Associates
- TiSigMouse.20221209.cm + .idx - Mouse tissue and cell type signatures
- TiSigMouseBrain.20221209.cm + .idx - Mouse brain cell type signatures
- TiSigMouseDevelopment.20221209.cm + .idx - Developmental stage-specific signatures
- ImprintingDMR.20220818.cm - Mouse genomically imprinted differentially methylated regions
- IntermediateMeth.20221121.cm - CpGs with intermediate methylation levels (25-75%)
- IntermediateMethS.20221121.cm - Stable intermediate methylation sites
- XCILinkedWGBS.20221121.cm - X-chromosome inactivation-associated CpGs
- XCILinkedWGBSSorted.20221121.cm - Sorted XCI-linked sites
IV. Technical Associates
- Blacklist.20220304.cm - Problematic genomic regions for filtering (high coverage artifacts, repeats)
Resources
Funding: NIH/NIGMS 5R35GM146978
Files
Files
(278.4 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:00132568ac11fff1f6120bfa65b6b28f
|
636 Bytes | Download |
|
md5:9dc83c947b8e4e0cb1f34626ef1761b1
|
122.9 kB | Download |
|
md5:ca06426c56115acc2c3a056836755fa8
|
878.2 kB | Download |
|
md5:ae38cfa904ef1a5a8c09fbc4f2354d6b
|
878.0 kB | Download |
|
md5:2f99ae2926273a48140767fadb4a169d
|
6.4 MB | Download |
|
md5:8ab4b4fd913fb1bf7300374985ed76ef
|
269 Bytes | Download |
|
md5:5d76574ee4f195c7d2d8f4cd7433c998
|
95 Bytes | Download |
|
md5:202e27f4ad168601d63a9f1073811be6
|
23.8 MB | Download |
|
md5:81b337a04dacecacbde06b4f2e9e2274
|
1.0 MB | Download |
|
md5:efa88450646dbb13efc9965d150754e4
|
146 Bytes | Download |
|
md5:8b33cdb1489ed07d4b069d9b4b8f02ac
|
115.7 kB | Download |
|
md5:b01be1cf12ae1233a0bc6c7159b8e4fa
|
20.0 MB | Download |
|
md5:eed392355ee313fda2bf3ecab2c12ea0
|
1.5 MB | Download |
|
md5:b7e950d0bde019186146dd509788d7a9
|
123 Bytes | Download |
|
md5:a630eceec27fbe4557ff4030dfa2af09
|
5.9 MB | Download |
|
md5:173d72f07e14ea76e60ea89f7334a5b3
|
1.6 kB | Download |
|
md5:bc0706314e280bafc807dc7546e8dbfb
|
52.4 MB | Download |
|
md5:5000a77602b957ebc80c721d421c8ec6
|
2.5 MB | Download |
|
md5:5d9420ce23f652e25d703230b19b64b4
|
709 Bytes | Download |
|
md5:5b6d629ab20ebc6af46e7a5c848369a9
|
7.8 MB | Download |
|
md5:41d673b759840bb05e198d3c7c61d350
|
6.8 MB | Download |
|
md5:d5d5e4b47b44c7766ad0479bc86d0e0a
|
5.0 MB | Download |
|
md5:a061eed474d53256cd44333d46fa7e6b
|
16.7 kB | Download |
|
md5:999977d4175d8cb9a980f47d814da21d
|
3.3 MB | Download |
|
md5:722e585a623a744b17f7764ac8efdacf
|
374 Bytes | Download |
|
md5:69157385a1d38721a2f7305ff3f6553f
|
4.6 MB | Download |
|
md5:8cc6eb976920b70354e545635f494260
|
1.2 kB | Download |
|
md5:156e9ae7c14f5da56cd8f289e505fd30
|
6.8 MB | Download |
|
md5:0adfc3712c8a204b89ae8b8efa16aa44
|
13.2 MB | Download |
|
md5:1bba80847bac38cfd798ac320076a27f
|
56.7 MB | Download |
|
md5:ec45849106d224b1bf6c6ad89f76e59f
|
13.7 kB | Download |
|
md5:57ee98a4b9cf91312ee7d29e12351fe7
|
3.9 MB | Download |
|
md5:bc63f01560a9b273467c12c981dbec7d
|
53.4 MB | Download |
|
md5:06c0f44abc5e3113c0cf5ddc33eade6b
|
12.5 kB | Download |
|
md5:b4b76999271d85f22c034fe0dba0a7ac
|
639.7 kB | Download |
|
md5:a5e575009bc63f88c222404032f5fb8f
|
154.7 kB | Download |
|
md5:6534a8722a1fd1d0c0c74b0f9d4531b5
|
17.0 kB | Download |
|
md5:6d19164607ca6c74dab1213528296e7e
|
513.5 kB | Download |