Published September 6, 2021 | Version v0.9-7
Software Open

POIBM - Poisson batch correction through sample matching

  • 1. University of Helsinki

Description

POIBM is a batch factor inference and correction method that is suited for heterogeneous RNA-seq or other count data datasets. It operates by simulataneously inferring the batch factors and a mapping between matching samples. This is advantageous for datasets, which comprise of samples of heterogeneous populations, in which unknown subpopulations match but e.g. the subpopulation fractions vary so the global population statistics cannot be matched.

Major features:

  • Simulatenous batch factor and sample matching inference reveals both the batch correction coefficients and putatively similar phenotypes in the data. The phenotypes need not to be prelabeled, but are learned in the process, as this is often difficult in patient derived samples.

  • Supports sample trimming for datasets that have only very little overlap

  • The model accounts for the discrete nature of RNA-seq data and models both expression and technical noise or the lack of thereof, operates on raw count data, and infers total RNA factors in the process

For the details about the method and validation on cancer cell line and patient data, please refer to our publication on the matter.

Files

anthakki-poibm-b4dad2030578.zip

Files (572.4 kB)

Name Size Download all
md5:13c3cd7f9ea0aea188757acc5c0c5fd3
572.4 kB Preview Download

Additional details

Related works

Is derived from
Software: https://bitbucket.org/anthakki/poibm/ (URL)
Is documented by
Journal article: 10.1093/bioinformatics/btac124 (DOI)

Funding

European Commission
DECIDER - Improved clinical decisions via integrating multiple data levels to overcome chemotherapy resistance in high-grade serous ovarian cancer 965193
European Commission
HERCULES - Comprehensive characterization and effective combinatorial targeting of high-grade serous ovarian cancer via single-cell analysis 667403
European Commission
RESCUER - RESISTANCE UNDER COMBINATORIAL TREATMENT IN ER+ AND ER- BREAST CANCER. 847912
Research Council of Finland
Efficient computational methods to analyze big data in cancer at single-cell resolution 322927