Bayesian Performance Analysis for Algorithm Ranking Comparison

Rojas-Delgado Jairo; Ceberio Josu; Calvo Borja; Lozano Jose A.

doi:10.5281/zenodo.6320599

Published March 1, 2022 | Version 1.1

Software Open

Bayesian Performance Analysis for Algorithm Ranking Comparison

1. Basque Center for Applied Mathematics. Bilbao, Spain.
2. University of the Basque Country (UPV/EHU). Donostia, Spain.
3. Basque Center for Applied Mathematics. Bilbao, Spain / University of the Basque Country (UPV/EHU). Donostia, Spain.

Presentation code for the paper: Bayesian Performance Analysis for Algorithm Ranking Comparison.

This repository contains the presentation code and tools used in the published paper. More recent versions can be found in:

BayesPermus Package: https://github.com/ml-opt/BayesPermus
Presentation code: https://github.com/ml-opt/BayesPermusPresentation

Abstract

In the field of optimization and machine learning, the statistical assessment of results has played a key role in conducting algorithmic performance comparisons. Classically, null hypothesis statistical tests have been used, however, recently, alternatives based on Bayesian statistics have shown great potential in complex scenarios, especially, when dealing with the uncertainty in the comparison.

In this work, we delve deep into the Bayesian statistical assessment of experimental results by proposing a framework for the analysis of several algorithms on several problems/instances. To this end, experimental results for each experiment are transformed to their corresponding rankings of algorithms assuming that these rankings have been generated by a probability distribution (defined on permutation spaces). From the set of rankings, we estimate the posterior distribution of the parameters of the probability models considered, and several inferences concerning the analysis of the results are analysed. Particularly, we study questions related to the probability of having one algorithm in the first position of the ranking or the probability that one algorithm is in the ranking before another. Not limited to that, the assumptions, strengths, and weaknesses of the models in each case are studied. Finally, we provide some guidelines for the authors to avoid the misuse of the presented analysis.

Files

BayesPermusPresentation.zip

Files (6.0 MB)

Name	Size	Download all
BayesPermusPresentation.zip md5:209e1884420c0e0d703eb38d5be767e8	6.0 MB	Preview Download

	All versions	This version
Views	210	138
Downloads	58	21
Data volume	847.5 MB	125.0 MB

Bayesian Performance Analysis for Algorithm Ranking Comparison

Authors/Creators

Description

Files

BayesPermusPresentation.zip

Files (6.0 MB)