MLTE: A process and tool for test and evaluation of machine learning models

Derr, Alex

doi:10.5281/zenodo.18334025

Published January 15, 2026 | Version v1

Presentation Open

MLTE: A process and tool for test and evaluation of machine learning models

Derr, Alex

In January 2025, Alex Derr from the Software Engineering Institute at Carnegie Mellon University joined the HiRSE Seminar Series to talk about “MLTE: A process and tool for test and evaluation of machine learning models”

Abstract:
Test and Evaluation (T&E) of ML models largely focuses on model performance (e.g., accuracy) and often does not consider system aspects, which leads to models that fail in production. We will present MLTE, a semi-automated process and tool that enables negotiation, specification, and testing of ML model functional and non-functional requirements. A Negotiation Card records results of stakeholder discussions, which drive model development decisions and relevant test cases. MLTE automates test case execution and stores results that can be shared with stakeholders to provide evidence of testing that guides future iterations and system-level decisions.

The presentation recording is available on the HiRSE YouTube Channel: https://www.youtube.com/watch?v=ivkwKwCx_mQ

Learn more about the HiRSE Seminar Series: https://www.helmholtz-hirse.de/series.html

Files

Files (19.1 MB)

Name	Size	Download all
20260115 - HiRSE - MLTE Demo.pptx md5:2a2e87f56ed15b53bca2b7e599df44dd	19.1 MB	Download

	All versions	This version
Views	67	67
Downloads	4	4
Data volume	76.5 MB	76.5 MB

MLTE: A process and tool for test and evaluation of machine learning models

Authors/Creators

Description

Files

Files (19.1 MB)