Engineer for the future of Cloud

June 10-13, 2019
San Jose, CA

Add to Your Schedule

Chaos engineering: When the network breaks

Kolton Andrus (Gremlin)

2:10pm–2:55pm Thursday, June 13, 2019

Overcoming Obstacles: Lessons in Resilience
Location: Expo Session

Average rating:

(4.50, 2 ratings)

Who is this presentation for?

Site reliability engineers, network engineers, system admins, and network admins

Level

Beginner

Prerequisite knowledge

A basic understanding of production environments and the infrastructure required to run systems
Experience with Linux, cloud infrastructure, hardware, networking, and systems troubleshooting
Familiarity with chaos engineering (read “Rx onError Guidelines” for an overview)

What you'll learn

Gain a better understanding of determining how and when your network breaks
Deepen your understanding of how network chaos engineering attacks can be used to improve the resiliency of your cloud infrastructure
Learn about different types of network chaos engineering attacks, including packet loss, packet corruption, latency, and black hole

Description

Chaos engineering is a disciplined approach to identifying failures before they become outages. By proactively testing how a system responds under stress, you can identify and fix failures before they end up in the news. Chaos engineering lets you compare what you think will happen to what actually happens in your systems. You literally break things on purpose to learn how to build more resilient systems.

Tammy Butow leads a walk-through of network chaos engineering, covering the tools and practices you need to implement chaos engineering in your organization. Even if you’re already using chaos engineering, you’ll learn to identify new ways to use it to improve the resilience of your network and services. You’ll also discover how other companies are using chaos engineering and the positive results the companies have had using chaos to create reliable distributed systems.

Tammy begins by explaining chaos engineering and its principles. She then asks why many engineering teams (including Netflix, Gremlin, Dropbox, National Australia Bank, Under Armour, Twilio, and more) use chaos engineering and how every engineering team can use it to create reliable systems. You’ll learn how to get started using chaos engineering with your own team as you explore the tools to measure success and the chaos tools and new chaos features built into cloud services. You’ll also discover how to use wargame environments to learn about chaos engineering and how to practice chaos engineering on AWS DocumentDB, AWS DynamoDB, AWS RDS, and AWS S3. Other topics include how to use monitoring tools combined with chaos engineering to help you create reliable distributed systems, where you can learn more, and how to join the chaos community.

Kolton Andrus

Gremlin

Kolton is the founder of Gremlin – helping companies build more robust services. He was a Chaos Engineer at Netflix, focused on the resilience of the Edge services. He designed and built FIT: Netflix’s failure injection service. Prior he improved the performance and reliability of the Amazon Retail website. At both companies he has served as a ‘Call Leader’, managing the resolution of company-wide incidents. Kolton is a father of 5. He is passionate about building resilient systems, primarily as it lets him break things for fun and profit.

Premier Diamond Sponsor

Elite Sponsors

Gold Sponsors

Silver Sponsors

Exhibitors

Innovators

Sponsorship Opportunities

For exhibition and sponsorship opportunities, email velocity@oreilly.com

Partner Opportunities

For information on trade opportunities with O'Reilly conferences, email partners@oreilly.com

Contact Us

View a complete list of Velocity contacts

Information
About
Contact Us
Systems Engineering & Operations Newsletter
More Velocity Events
Diversity
Code of Conduct
Privacy Policy

©2019, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • confreg@oreilly.com