POIBM - Poisson batch correction through sample matching
Description
POIBM is a batch factor inference and correction method that is suited for heterogeneous RNA-seq or other count data datasets. It operates by simulataneously inferring the batch factors and a mapping between matching samples. This is advantageous for datasets, which comprise of samples of heterogeneous populations, in which unknown subpopulations match but e.g. the subpopulation fractions vary so the global population statistics cannot be matched.
Major features:
-
Simulatenous batch factor and sample matching inference reveals both the batch correction coefficients and putatively similar phenotypes in the data. The phenotypes need not to be prelabeled, but are learned in the process, as this is often difficult in patient derived samples.
-
Supports sample trimming for datasets that have only very little overlap
-
The model accounts for the discrete nature of RNA-seq data and models both expression and technical noise or the lack of thereof, operates on raw count data, and infers total RNA factors in the process
For the details about the method and validation on cancer cell line and patient data, please refer to our publication on the matter.
Files
anthakki-poibm-b4dad2030578.zip
Files
(572.4 kB)
Name | Size | Download all |
---|---|---|
md5:13c3cd7f9ea0aea188757acc5c0c5fd3
|
572.4 kB | Preview Download |
Additional details
Related works
- Is derived from
- Software: https://bitbucket.org/anthakki/poibm/ (URL)
- Is documented by
- Journal article: 10.1093/bioinformatics/btac124 (DOI)
Funding
- European Commission
- DECIDER - Improved clinical decisions via integrating multiple data levels to overcome chemotherapy resistance in high-grade serous ovarian cancer 965193
- European Commission
- HERCULES - Comprehensive characterization and effective combinatorial targeting of high-grade serous ovarian cancer via single-cell analysis 667403
- European Commission
- RESCUER - RESISTANCE UNDER COMBINATORIAL TREATMENT IN ER+ AND ER- BREAST CANCER. 847912
- Research Council of Finland
- Efficient computational methods to analyze big data in cancer at single-cell resolution 322927