Build Systems that Drive Business
Sep 30–Oct 1, 2018: Training
Oct 1–3, 2018: Tutorials & Conference
New York, NY

Troubleshooting Kubernetes applications

1:30pm–2:10pm Wednesday, October 3, 2018
Location: Beekman/Sutton North Level: Intermediate
Secondary topics:  Resilient, Performant & Secure Distributed Systems
Average rating: ***..
(3.50, 4 ratings)

Prerequisite knowledge

  • A basic understanding of containerized setups (Docker, etc.) and Kubernetes (useful but not required)

What you'll learn

  • Learn how to troubleshoot applications running in Kubernetes


Kubernetes makes it easy to run cloud-native applications in a resilient way. But what if something fails? How do you figure what caused a CrashLoopBackOff error message? What can be done about a service that is not reachable? How do you debug a containerized application or do fault injection in a microservices setup?

Michael Hausenblas addresses these questions and more, including:

  • Common reasons an app does not deploy and how to resolve them
  • Comprehensive debugging (from exec to leveraging logs)
  • Distributed tracing of microservices using Jaeger
  • Network-level troubleshooting (from IPtables to application-level issues)
  • Failure scenarios around storage (networked filesystems, databases)
  • Using chaos engineering to improve resiliency

Along the way, Michael shares concrete examples for each of the above topics, backed by a GitHub repo, and demonstrates some of them live.

Photo of Michael Hausenblas

Michael Hausenblas


Michael Hausenblas is a developer advocate at AWS, part of the container service team, focusing on container security. Michael shares his experience around cloud native infrastructure and apps through demos, blog posts, books, and public speaking engagements as well as contributes to open source software. Previously, was at Red Hat, Mesosphere, MapR, and in two research institutions in Ireland and Austria.