Organizations successfully adapt to change by learning from incidents: developing ways to prepare, examine, discuss, and imagine them.
Ryan Kitchens, Lorin Hochstein, and Nora Jones go beyond the traditional ways of responding to and learning from incidents to explore more effective approaches and techniques that help you build the capacity to encounter failure and manage the consequences of failure successfully.
You’ll then have the opportunity to role-play different incident scenarios.
Set context about what’s going to happen in the tutorial
Examples of things we can learn
Questions should reveal (from Etsy Debrief Facilitation Guide)
Traps to avoid
Put your new skills to use during incident role-playing scenarios, where you will have the opportunity to play someone involved in the incident or an incident investigator. TAs and presenters will be there to guide the exercises.
Ryan Kitchens is a site reliability engineer on the CORE team at Netflix, where he works on building capacity across the organization to ensure its availability and reliability. Previously, Ryan was a founding member of the SRE team at Blizzard Entertainment.
Lorin Hochstein is a senior software engineer on the cloud operations and reliability engineering (CORE) team at Netflix, where he works on ensuring that Netflix remains available. Previously, he was the senior software engineer at SendGrid, lead architect for cloud services at Nimbis Services, computer scientist at the University of Southern California’s Information Sciences Institute, and assistant professor in the Department of Computer Science and Engineering at the University of Nebraska-Lincoln. Lorin holds a BEng in computer engineering from McGill University, an MS in electrical engineering from Boston University, and a PhD in computer science from the University of Maryland.
Nora Jones practices chaos engineering and human factors at Slack and is a student of human factors and systems safety at Lund University. She’s passionate about resilient software, people, and the intersection of those two worlds. She cowrote the book on chaos engineering with her teammates while working at Netflix and keynoted at AWS re:Invent in 2017 to an audience of over 40,000 people about the technical benefits and business case behind implementing chaos engineering.
Comments on this page are now closed.
©2019, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • firstname.lastname@example.org