Evaluating Real-World Generalizability of Algorithm Selection Models

Anonymous

doi:10.5281/zenodo.18350950

Published January 23, 2026 | Version v1

Peer review Open

Evaluating Real-World Generalizability of Algorithm Selection Models

Anonymous

Evaluating Real-World Generalizability of Algorithm Selection Models

This Zenodo upload contains:

Raw benchmark results (per algorithm, per dimension, per benchmark)
Instance sample files used to compute ELA features
Computed ELA feature tables
Aggregated performance tables used in the algorithm-selection experiments
Crossmatch test CSVs for distribution-comparison analyses

1. Top-level layout

At the top level you will find four benchmark folders, one folder with ELA features, several aggregated performance tables, and CSVs for crossmatch analyses:

BBOB/ Raw results + samples for BBOB benchmark
CEC/ Raw results + samples for CEC benchmark
ROB/ Raw results + samples for ROB benchmark
UAV/ Raw results + samples for UAV benchmark
ela/ Computed ELA feature tables (CSV)

performance_dim_6_noPRS.csv
performance_dim_6_no_agg_noPRS.csv
performance_dim_12_noPRS.csv
performance_dim_12_no_agg_noPRS.csv
performance_dim_18_noPRS.csv
performance_dim_18_no_agg_noPRS.csv
performance_dim_24_noPRS.csv
performance_dim_24_no_agg_noPRS.csv
performance_dim_30_noPRS.csv
performance_dim_30_no_agg_noPRS.csv

crossmatch_test.csv
crossmatch_test_cosine.csv
crossmatch_test_euclidean.csv

.ipynb_checkpoints/ (optional) notebook metadata folder

Top-level files explained

A) performance tables

performance_dim_{d}_noPRS.csv
Aggregated performance table for dimension d (mean aggregated over runs), excluding PRS.
performance_dim_{d}_no_agg_noPRS.csv
Non-aggregated performance table for dimension d (keeps per-run information), excluding PRS.

B) crossmatch tables

crossmatch_test*.csv
CSVs used for/produced by crossmatch-style distribution comparison tests (including cosine and euclidean variants).

2. Benchmark folders (ROB/, BBOB/, UAV/, CEC/)

Each benchmark folder follows the same structure and naming conventions.

Inside each of ROB/, BBOB/, UAV/, CEC/ you typically find:

_readme Short benchmark-specific notes (if present)
.ipynb_checkpoints/ (optional)
res_<BENCH>_dim<d>_samples.csv
res_<BENCH>_dim<d>budget20000<ALGO>.csv (multiple algorithms)

File types inside each benchmark folder

A) Instance samples (input for ELA)

res_<BENCH>_dim<d>_samples.csv
Instance sample data used to compute ELA features for benchmark <BENCH> in dimension <d>.
These files are the inputs to the ELA computation step.

B) Raw algorithm results

res_<BENCH>_dim<d>budget20000<ALGO>.csv
Raw performance results for a specific algorithm <ALGO> on benchmark <BENCH>, dimension <d>,
with evaluation budget 20000.

Placeholders:

<BENCH> is one of: ROB, BBOB, UAV, CEC
<d> is the problem dimension (e.g., 6, 12, 18, 24, 30)
<ALGO> is the algorithm identifier (e.g., DE, PSO, LSHADE, etc.)

3. ELA features folder (ela/)

The ela/ directory contains computed ELA feature tables generated from the res_*_samples.csv files.

Typical naming convention:

<BENCH>dim<d>scale<S>downsample<FLAG>.csv

Where:

<S> indicates the scaling mode used during feature generation (0, 1, or y)
<FLAG> indicates whether downsampling was applied (0/1)

4. AS_results_* folders

The AS_results folders contain the outputs of the algorithm selection experiments (produced by 2_AS.py). Files are saved per experimental setting, where the setting is encoded in the filename.

Each result file name follows the pattern:

TYPE_dim_<d>scale<S>downsample<True|False>.csv

Where:

TYPE indicates what is stored in the file:
- final_results: aggregated evaluation metrics and summary results for the AS experiment
- predictions: per-instance predictions (e.g., selected algorithm / predicted best / scores depending on the run)
- feature_importances: feature importance values from the trained model(s)
d is the problem dimension (e.g., 6, 12, 18, 24, 30)
S is the scaling mode used for the features/targets:
- 0 = no scaling
- 1 = min–max scaling of X and y
- y = min–max scaling of y only
downsample indicates whether downsampling was applied to the instance samples before computing ELA features:
- True = downsampling applied
- False = no downsampling

Typical files inside AS_results

final_results_dim_<d>scale<S>downsample<...>.csv
Summary metrics for the experiment setting (one row per model/strategy or per split, depending on configuration).
predictions_dim_<d>scale<S>downsample<...>.csv
Instance-level outputs for the experiment setting (e.g., chosen algorithm and/or predicted performance).
feature_importances_dim_<d>scale<S>downsample<...>.csv
Feature importance values computed from the trained model(s) for that setting.

5. Relationship between files (high level)

Benchmark sample files:
<BENCH>/res_<BENCH>_dim<d>_samples.csv
are used to compute ELA feature tables in:
ela/<BENCH>dim<d>scale<S>downsample<FLAG>.csv
Benchmark raw algorithm result files:
<BENCH>/res_<BENCH>dim<d>budget20000<ALGO>.csv
are aggregated into the top-level performance tables:
performance_dim{d}noPRS.csv and performance_dim{d}_no_agg_noPRS.csv
crossmatch_test*.csv are used for distribution-comparison / crossmatch analyses across datasets or feature spaces

Files

AlgorithmSelectionRealWorldDataTransferability.zip

Files (15.7 GB)

Name	Size	Download all
AlgorithmSelectionRealWorldDataTransferability.zip md5:e3ef6eccc55c195a81815e1d1ec8caf9	15.7 GB	Preview Download

Additional details

Repository URL: https://anonymous.4open.science/r/Algorithm_Selection_real_world_generalizability-1332
Programming language: Python , R

	All versions	This version
Views	31	31
Downloads	2	2
Data volume	31.4 GB	31.4 GB

Evaluating Real-World Generalizability of Algorithm Selection Models

Authors/Creators

Description

Evaluating Real-World Generalizability of Algorithm Selection Models

1. Top-level layout

Top-level files explained

2. Benchmark folders (ROB/, BBOB/, UAV/, CEC/)

3. ELA features folder (ela/)

4. AS_results_* folders

5. Relationship between files (high level)

Files

AlgorithmSelectionRealWorldDataTransferability.zip

Files (15.7 GB)

Additional details

Software