Is Performance of Object Storage Predictable for Serverless I/O Workloads? A Comparative Study
Authors/Creators
Description
Serverless architectures abstract resource provisioning away from the user. However, this property may be at odds with performance. One example of this is Function as a Service (FaaS), where the lack of network addressability compels developers to resort to serverless storage services such as AWS S3 to share (intermediate) data between the functions. For IO-bound workflows, the literature has shown that the performance of parallel reads and writes highly depends on the level of parallelism. Simply put, both an excess or a deficiency in the number of functions may lead to longer IO times. The good news is that the provisioning of functions is fast. Consequently, it is feasible to auto-provision the serverless functions to the optimal number to minimize IO latency. For this, the performance of object storage must be predictable and consistent. We confirmed this in the past for IBM COS. And in this paper, we show that the same occurs to AWS S3. Concretely, we prove that the optimal level of parallelism for parallel reads and writes can be approximated analytically for AWS S3.
Files
CEC_23.pdf
Files
(737.6 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:03ecf7bf083bf35bc39723fee559e259
|
737.6 kB | Preview Download |
Additional details
Funding
- European Commission
- NEARDATA - Extreme Near-Data Processing Platform 101092644
- European Commission
- CloudSkin - Adaptive virtualization for AI-enabled Cloud-edge Continuum 101092646
- European Commission
- EXTRACT - A distributed data-mining software platform for extreme data across the compute continuum 101093110
- Ministerio de Ciencia, Innovación y Universidades
- Plataforma sin servidor de alto rendimiento para sistemas híbridos nube‐periferia PID2019-106774RB-C22
- Ministerio de Ciencia, Innovación y Universidades
- FPU21/00630 FPU21/00630