Build Systems that Drive Business
June 11–12, 2018: Training
June 12–14, 2018: Tutorials & Conference
San Jose, CA

Reliability from the ground up: Designing for five nines

Astrid Atkinson (Google)
9:05am–9:25am Wednesday, June 13, 2018
Location: Grand Ballroom 220
Average rating: ****.
(4.83, 24 ratings)

We often think of reliability in terms of improving individual components like machines, servers, and operational processes. But in a distributed system, something is always failing, and the more parts you add, the more vectors there are for failure. While tools and optimizations can help, reaching the highest levels of reliability requires rethinking the basic design of how components behave and interact.

Astrid Atkinson discusses techniques for building systems that are resilient by design.

Photo of Astrid Atkinson

Astrid Atkinson

Google

Astrid Atkinson is director of software engineering at Google, where she leads development frameworks. During her 10+ years at Google, Astrid has built infrastructure and managed a variety of engineering teams and spent more than five years on call for Google.com. She has led teams across the infrastructure map, from the team responsible for running and building Google’s web-serving layer to App Engine and cloud systems to core search.

Comments on this page are now closed.

Comments

Picture of Audra Carter
Audra Carter | SENIOR SPEAKER MANAGER
07/09/2018 7:38am PDT

@Yasmeen Frishman – you can now find the slides at the top of this page.

Yasmeen Frishman | SENIOR SOFTWARE ENGINEER
07/09/2018 7:29am PDT

I also think that this was a great keynote! Will the slides also be available aside from the recording of this session?

Picture of Eric Bach
Eric Bach | SOLUTION ENGINEER
06/18/2018 5:23am PDT

This was a great keynote! Will the content be available for viewing shortly?