Published October 17, 2022 | Version v1
Report Open

IMPROVING KUBERNETES SERVICE AVAILABILITY THROUGH CHAOS

Description

Chaos Engineering is the process of testing a distributed computing system to ensure that it can withstand unexpected disruptions. It relies on concepts underlying chaos theory, which focus on random and unpredictable behavior. The goal of chaos engineering is to identify weakness in a system through controlled experiments that introduce random and unpredictable behavior. There are many tools available, with different level of maturity.

Tools like:

• Chaos Mesh

• Litmus Chaos

• Chaos Toolkit

• Pumba

This project aims at selecting and integrating one of these tools into the CERN Kubernetes offering, with the goal of giving service managers a tool that will dramatically increase confidence on their own service availability.

Files

Improving_Kubernetes_Service_Availability_Through_Chaos.pdf

Files (1.7 MB)