Published February 19, 2026 | Version v1
Software Open

CARAT Artifact

  • 1. ROR icon University of Delaware
  • 2. Pacific Northwest National Laboratory
  • 3. EDMO icon Iowa State University

Contributors

Project members:

  • 1. Meta

Description

CARAT: Client-Side Adaptive RPC and Cache Co-Tuning for Parallel File Systems

Tuning parallel file system in High-Performance Computing (HPC) systems remains challenging due to the complex I/O paths, diverse I/O patterns, and dynamic system conditions. While existing autotuning frameworks have shown promising results in tuning PFS parameters based on applications’ I/O patterns, they lack scalability, adaptivity, and the ability to operate online. In this work, focusing on scalable online tuning, we present CARAT, an ML-guided framework to co-tune client-side RPC and caching parameters of PFS, leveraging only locally observable metrics. Unlike global or pattern-dependent approaches, CARAT enables each client to make independent and intelligent tuning decisions online, responding to real-time changes in both application I/O behaviors and system states. We then prototyped CARAT using Lustre and evaluated it extensively across dynamic I/O patterns, real-world HPC workloads, and multi-client deployments. The results demonstrated that CARAT can achieve up to 3× performance improvement over the default or static configurations, validating the effectiveness and generality of our approach. Due to its scalability and lightweight, we believe CARAT has the potential to be widely deployed into existing PFS and benefit various data-intensive applications.

Files

CARAT.zip

Files (1.6 GB)

Name Size Download all
md5:d9705236cf18f3686551602816a5d6a4
1.6 GB Preview Download

Additional details

Software