1 / 9

Chaos Engineering to Establish Software Reliability

Chaos Engineering is the practice of <br>intentionally introducing controlled and <br>measurable failures into software systems <br>to build resilience and confidence in their <br>ability to withstand unexpected conditions.<br>

Gleecus
Download Presentation

Chaos Engineering to Establish Software Reliability

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. ChaosEngineeringto EstablishSoftwareReliability #techtuesdays

  2. Fallacies of Distributed Systems The network is reliable Latency is zero Bandwidth is infinite The network is secure Topology doesn't change There is one administrator Transport cost is zero The network is homogeneous #techtuesdays

  3. What is Chaos Engineering? Chaos Engineering is the practice of intentionally introducing controlled and measurable failures into software systems to build resilience and confidence in their ability to withstand unexpected conditions. #techtuesdays

  4. Principles of Chaos Engineering 01 Build a hypothesis around a steady-state 02 Simulate real-world events 03 Run experiments in production 04 Automate experiments and run them continuously 05 Minimize blast radius #techtuesdays

  5. Types of Chaos Engineering Experiment Game Days A game day is a simulation of a disaster scenario, designed to test the system's resiliency. Latency Injection Latency injection is the deliberate introduction of a delay in system response times to understand the impact of degraded performance. Volume Testing Volume testing is the process of increasing the volume of traffic to the system to assess its response to high levels of traffic. Infrastructure Failure Simulating infrastructure failures, such as server crashes or network outages, can help to identify weaknesses in the system's failover processes. #techtuesdays

  6. Common Implementation Areas for Chaos Engineering Blind spots of monitoring software Performance bottlenecks Hidden bugs #techtuesdays

  7. Benefits of Using Chaos Engineering for Software Testing Improved System Resilience Chaos Engineering can identify and resolve vulnerabilities in a system, making it more resilient to unexpected conditions. Increase Customer Confidence By improving system reliability and reducing downtime, Chaos Engineering can increase customer confidence in your product or service. Improved Team Culture Chaos Engineering requires engineers to work together and learn from their mistakes, which can lead to an improved team culture and a stronger sense of collaboration. #techtuesdays

  8. Common Tools of Chaos Engineering Chaos Kong Disables entire AWS availability zones. Chaos Monkey Randomly disables production environment instances to cause a system failure. Latency Introduces latency to simulate network outages and degradation. Gremlin A chaos engineering program that works with AWS and Kubernetes #techtuesdays

  9. Thank you for your time. Contact us: hello@gleecus.com #techtuesdays www.Gleecus.com

More Related