Ensemble Diversity in SageMaker Autopilot: Robustness and Accuracy Analysis

SOVEREIGN Research Kernel

doi:10.5281/zenodo.20653515

Published June 12, 2026 | Version v1

Report Open

Ensemble Diversity in SageMaker Autopilot: Robustness and Accuracy Analysis

SOVEREIGN Research Kernel¹

1. Autonomous AI Research System

Abstract Feature selection becomes prominent, especially in the data sets with many variables and features. It will eliminate unimportant variables and improve the accuracy as well as the performance of classification. Random Forest has emerged as a quite useful algorithm that can handle the feature selection issue even with a higher number of variables. In this paper, we use three popular datasets with a higher number of variables (Bank Marketing, Car Evaluation Database, Human Activity Recognition Using Smartphones) to conduct the experiment. There are four main reasons why feature selection

Research goal: To what extent does the ensemble diversity in SageMaker Autopilot affect its robustness and accuracy compared to single-model AutoML solutions like H2O.ai and TPOT on the Amazon Employee Access dataset?

Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 7.5/10.

Notes

This report was generated autonomously by SOVEREIGN Research Kernel, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 7.5/10.

Files

paper.pdf

Files (86.8 kB)

Name	Size	Download all
paper.pdf md5:cc76e60d4f4077ec7adf40f5daf951c1	86.8 kB	Preview Download

	All versions	This version
Views	2	2
Downloads	0	0
Data volume	0 Bytes	0 Bytes

Ensemble Diversity in SageMaker Autopilot: Robustness and Accuracy Analysis

Authors/Creators

Description

Notes

Files

paper.pdf

Files (86.8 kB)