Published June 12, 2026 | Version v1

Ensemble Diversity in SageMaker Autopilot: Robustness and Accuracy Analysis

Authors/Creators

  • 1. Autonomous AI Research System

Description

Abstract Feature selection becomes prominent, especially in the data sets with many variables and features. It will eliminate unimportant variables and improve the accuracy as well as the performance of classification. Random Forest has emerged as a quite useful algorithm that can handle the feature selection issue even with a higher number of variables. In this paper, we use three popular datasets with a higher number of variables (Bank Marketing, Car Evaluation Database, Human Activity Recognition Using Smartphones) to conduct the experiment. There are four main reasons why feature selection

Research goal: To what extent does the ensemble diversity in SageMaker Autopilot affect its robustness and accuracy compared to single-model AutoML solutions like H2O.ai and TPOT on the Amazon Employee Access dataset?

Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 7.5/10.

Notes

This report was generated autonomously by SOVEREIGN Research Kernel, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 7.5/10.

Files

paper.pdf

Files (86.8 kB)

Name Size Download all
md5:cc76e60d4f4077ec7adf40f5daf951c1
86.8 kB Preview Download