Engineering the Future of Software
29–31 Oct 2018: Tutorials & Conference
31 Oct–1 Nov 2018: Training
London, UK

Introduction to chaos architecture: Gaining from learning loops and system weaknesses

Russ Miles (ChaosIQ)
10:4512:15 Monday, 29 October 2018
Location: Buckingham Room - Palace Suite
Average rating: ****.
(4.57, 7 ratings)

Who is this presentation for?

  • Software developers, architects, SREs, DevOps engineers, and system administrators

Prerequisite knowledge

  • Familiarity with platforms such as Kubernetes (useful but not required)

What you'll learn

  • Understand the goal of chaos engineering and the harmonious relationship between the chaos engineering and resilience engineering disciplines
  • Learn how to do chaos engineering on your systems, design game days, automate your first chaos experiments and justify them with contribution models, and share your experiments and their findings
  • Discover pitfalls to avoid


Chaos engineering helps you gain trust and confidence in your system of software development and delivery, but it is often misunderstood to be only about breaking things, and worse only about breaking infrastructure. Russ Miles debunks those limitations and demonstrates how chaos engineering can be a full part of your resilience engineering capability. Going way beyond merely infrastructure weaknesses, Russ explains how to discover, explore, and overcome weaknesses that will affect the resilience of your system in everything from your infrastructure, your platforms, your applications, and, yes, even you (people, practices, and processes).

Photo of Russ Miles

Russ Miles


Russ Miles is CEO of, where he and his team build commercial and open source products and provide services to companies applying chaos engineering to build confidence in the resilience of their production systems. Russ is an international consultant, trainer, speaker, and author. His most recent book, Antifragile Software: Building Adaptable Software with Microservices, explores how to apply chaos engineering to construct and manage complex, distributed systems in production with confidence.