Build Systems that Drive Business
Sep 30–Oct 1, 2018: Training
Oct 1–3, 2018: Tutorials & Conference
New York, NY

Monitoring, Observability, and Performance sessions

The larger your applications get, the harder it is to understand their performance and troubleshoot problems. Increased complexity in applications and services requires new methods for monitoring and improved observability. In this track, you’ll learn best practices for
monitoring large-scale, complex, dynamic and distributed systems built on emerging architectures like microservices and serverless

Track host

Greg PoirierGreg Poirier (Sensu, Inc.) is the SVP of Engineering, where he leads the team building a scalable monitoring product for modern infrastructure. Previously, he was a systems engineer at Stripe, Oracle, Los Alamos National Laboratory, and Earthlink. He has a background in monitoring, systems, and infosec and holds a BS in Computer Science from Auburn University.

9:00am–12:30pm Monday, October 1, 2018
Location: Sutton South/Regent Parlor Level: Intermediate
Secondary topics:  Systems Monitoring & Orchestration
Yuri Shkuro (Uber Technologies), Prithvi Raj (Uber), Won Jun Jang (Uber)
Average rating: *****
(5.00, 2 ratings)
Priyanka Sharma and Yuri Shkuro demonstrate how distributed tracing works and how to employ it in the development and operations of your applications in the programming language of your choice: Java, Go, Python, Node.js, C#, or C++. Read more.
11:35am–12:15pm Tuesday, October 2, 2018
Location: Beekman/Sutton North Level: Beginner
Victoria Nguyen (Fastly)
Victoria Nguyen explains how Fastly overhauled the monitoring and data collection of its globally distributed network without its caches noticing. Read more.
1:30pm–2:10pm Tuesday, October 2, 2018
Location: Beekman/Sutton North Level: Intermediate
Secondary topics:  Systems Monitoring & Orchestration
Jamie Wilkinson (Google)
Average rating: ***..
(3.00, 1 rating)
Jamie Wilkinson offers a brief overview of SLOs, shares a practical guide to implementing sustainable SLO-based alerting for systems of any size, and outlines the tooling required to supplement the system in the absence of cause-based alerting. Read more.
2:25pm–3:05pm Tuesday, October 2, 2018
Location: Beekman/Sutton North Level: Intermediate
Secondary topics:  Resilient, Performant & Secure Distributed Systems
Naoman Abbas (Pinterest)
Average rating: *****
(5.00, 1 rating)
Naoman Abbas offers an overview of tools Pinterest built to process trace data and the use cases they’ve enabled and shares some real-world examples. Join in to learn how to apply these techniques to your own challenges. Read more.
3:50pm–4:30pm Tuesday, October 2, 2018
Location: Beekman/Sutton North Level: Beginner
Secondary topics:  Resilient, Performant & Secure Distributed Systems
Amy Nguyen (Stripe), Cory Watson (Stripe)
Average rating: ***..
(3.00, 1 rating)
You're unsatisfied with one of your monitoring providers. You've considered finding a new solution, but the thought of migrating your data off their platform sounds extremely painful. Amy Nguyen and Cory Watson explain how to make a deadline for an infrastructure-critical software migration while ensuring that everyone's requirements are met and no data has been lost. Read more.
4:45pm–5:25pm Tuesday, October 2, 2018
Location: Beekman/Sutton North Level: Intermediate
Secondary topics:  Systems Monitoring & Orchestration
Bridget Lane (Gannett | USA Today), Kris Vincent (Gannett | USA Today)
Average rating: ***..
(3.50, 4 ratings)
Three years ago, technical teams at USA TODAY NETWORK were completely siloed, making improvements and troubleshooting difficult and often blind to the rest of the technical organization. Bridget Lane and Kris Vincent explain how drastically the teams' tool belts, thought processes, and goals have changed as the company moved from silos to a single pane of glass. Read more.
4:45pm–5:25pm Tuesday, October 2, 2018
Location: Sutton South/Regent Parlor Level: Intermediate
Secondary topics:  Systems Monitoring & Orchestration
Idit Levine (solo.io)
Average rating: *****
(5.00, 1 rating)
Idit Levine demonstrates common debugging techniques and offers an overview of Squash, a new tool and methodology that brings the power of modern popular debuggers to developers of microservices apps that run on container orchestration platforms. Read more.
2:25pm–3:05pm Wednesday, October 3, 2018
Location: Murray Hill Level: Intermediate
Hooman Beheshti (Fastly)
Average rating: ****.
(4.50, 2 ratings)
Now that adoption is ramped up and HTTP/2 is being regularly used on the internet, it's a good time to revisit the protocol and its deployment. Hooman Beheshti reviews protocol basics and digs into core features such as interaction with TCP, server push, priorities and dependencies, and HPACK. Read more.