Troubleshooting Resource Exhaustion during Pod Scaling in Kubernetes
Authors/Creators
Description
In Kubernetes, scaling applications horizontally by adding new pods is a fundamental practice for handling varying workloads. However, when resource exhaustion occurs, Kubernetes may fail to scale pods effectively, resulting in unscheduled or evicted pods and system instability. Resource exhaustion typically arises when nodes in the cluster do not have enough CPU, memory, or storage to accommodate additional pods. This article provides a comprehensive guide for troubleshooting this common issue. It covers key diagnostic steps, including checking node resource utilization, inspecting pod resource requests and limits, investigating pending pod status, and ensuring proper configuration of auto scalers and node affinity rules. Following the outlined troubleshooting approach, Kubernetes administrators can quickly identify the root causes of resource exhaustion and apply targeted solutions, ensuring more reliable and efficient pod scaling in production environments. The article also offers best practices for avoiding resource-related scaling issues in the future.
Files
EJAET-9-10-91-95.pdf
Files
(174.6 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:3c79dbcca2791d524875152946f8a292
|
174.6 kB | Preview Download |
Additional details
References
- [1]. B. Burns, J. Beda, K. Hightower and L. Evenson, Kubernetes Up and Running: Dive into the Future of Infrastructure, O'Reilly, 02 Aug 2022.
- [2]. Z. Wei-guo, M. Xi-lin and Z. Jin-zhong, "Research on Kubernetes' Resource Scheduling Scheme," in ICCNS '18: Proceedings of the 8th International Conference on Communication and Network Security, 02 Nov 2018.
- [3]. Q.-M. Nguyen, L.-A. Phan and T. Kim, "Load-Balancing of Kubernetes-Based Edge Computing Infrastructure Using Resource Adaptive Proxy," vol. 22, no. 8, 22 Feb 2022.
- [4]. A. P. Ferreira and R. Sinnott, "A Performance Evaluation of Containers Running on Managed Kubernetes Services," in 2019 IEEE International Conference on Cloud Computing Technology and Science (CloudCom), Sydney, NSW, Australia, 28 Jan 2020.
- [5]. M. Fogli, T. Kudla, B. Musters, G. Pingen, C. V. d. Broek and H. Bastiaansen, "Performance Evaluation of Kubernetes Distributions (K8s, K3s, KubeEdge) in an Adaptive and Federated Cloud Infrastructure for Disadvantaged Tactical Networks," in 2021 International Conference on Military Communication and Information Systems (ICMCIS), The Hague, Netherlands, 20 July 2021.
- [6]. K. Lehtinen, "Scaling a Kubernetes Cluster," 20 Apr 2022.