Published October 10, 2023 | Version v1
Conference paper Open

Is Performance of Object Storage Predictable for Serverless I/O Workloads? A Comparative Study

  • 1. ROR icon Universidad Rovira i Virgili
  • 2. Universitat Rovira i Virgili

Description

Serverless architectures abstract resource provisioning away from the user. However, this property may be at odds with performance. One example of this is Function as a Service (FaaS), where the lack of network addressability compels developers to resort to serverless storage services such as AWS S3 to share (intermediate) data between the functions. For IO-bound workflows, the literature has shown that the performance of parallel reads and writes highly depends on the level of parallelism. Simply put, both an excess or a deficiency in the number of functions may lead to longer IO times. The good news is that the provisioning of functions is fast. Consequently, it is feasible to auto-provision the serverless functions to the optimal number to minimize IO latency. For this, the performance of object storage must be predictable and consistent. We confirmed this in the past for IBM COS. And in this paper, we show that the same occurs to AWS S3. Concretely, we prove that the optimal level of parallelism for parallel reads and writes can be approximated analytically for AWS S3.

Files

CEC_23.pdf

Files (737.6 kB)

Name Size Download all
md5:03ecf7bf083bf35bc39723fee559e259
737.6 kB Preview Download

Additional details

Funding

European Commission
NEARDATA - Extreme Near-Data Processing Platform 101092644
European Commission
CloudSkin - Adaptive virtualization for AI-enabled Cloud-edge Continuum 101092646
European Commission
EXTRACT - A distributed data-mining software platform for extreme data across the compute continuum 101093110
Ministerio de Ciencia, Innovación y Universidades
Plataforma sin servidor de alto rendimiento para sistemas híbridos nube‐periferia PID2019-106774RB-C22
Ministerio de Ciencia, Innovación y Universidades
FPU21/00630 FPU21/00630