Double-stage discretization approaches for biomarker-based bladder cancer survival modeling
- 1. University of Eastern Piedmont UPO
- 2. Enginsoft SpA
Description
Bioinformatic techniques targeting gene expression data require specific analysis pipelines with the aim of studying properties, adaptation, and disease outcomes in a sample population. Present investigation compared together results of four numerical experiments modeling survival rates from bladder cancer genetic profiles. Research showed that a sequence of two discretization phases produced remarkable results compared to a classic approach employing one discretization of gene expression data. Analysis involving two discretization phases consisted of a primary discretizer followed by refinement or pre-binning input values before the main discretization scheme. Among all tests, the best model encloses a sequence of data transformation to compensate skewness, data discretization phase with class-attribute interdependence maximization algorithm, and final classification by voting feature intervals, a classifier that also provides discrete interval optimization.
Notes
Files
10.2478_caim-2021-0003.pdf
Files
(7.7 MB)
Name | Size | Download all |
---|---|---|
md5:21f32b0bd946efcfa15ba5944129d9e4
|
3.2 MB | Preview Download |
md5:28e4f1e32d083ed59ee9cfc1a2d85959
|
4.5 MB | Preview Download |