Build Systems that Drive Business
Sep 30–Oct 1, 2018: Training
Oct 1–3, 2018: Tutorials & Conference
New York, NY

Systems Monitoring & Orchestration
Build and interact with real-world systems

The larger your applications get, the harder it is to understand their performance and troubleshoot problems. This increased complexity in applications and services is driving a stronger interest in monitoring and observability.

Explore how to monitor large-scale, complex, dynamic, and distributed systems built on emerging architectures like microservices and serverless. In these sessions, you'll learn about containers, microservices, and services architectures, including technologies like Docker, CoreOS, and Kubernetes.

We'll help you solve your toughest challenges with real-world advice from leaders in the field who have grappled with the same problems you're facing today. Like how to:

  • Diagnose complex issues in production environments
  • Instrument systems for maximum possible observability
  • Monitor applications being run in containers
  • Monitor system calls, garbage collection, and other interesting events in the Java Virtual Machine
  • Plan and deploy monitoring for your own custom applications in containers
9:00am–12:30pm Monday, October 1, 2018
Location: Sutton South/Regent Parlor Level: Intermediate
Secondary topics:  Systems Monitoring & Orchestration
Yuri Shkuro (Uber Technologies), Prithvi Raj (Uber), Won Jun Jang (Uber)
Average rating: *****
(5.00, 2 ratings)
Priyanka Sharma and Yuri Shkuro demonstrate how distributed tracing works and how to employ it in the development and operations of your applications in the programming language of your choice: Java, Go, Python, Node.js, C#, or C++. Read more.
9:00am–12:30pm Monday, October 1, 2018
Location: Nassau Level: Beginner
Secondary topics:  Systems Monitoring & Orchestration
James Meickle (Quantopian)
Average rating: ****.
(4.00, 1 rating)
Ansible is a "batteries included" automation, configuration management, and orchestration tool that's fast to learn and flexible enough for any architecture. Join James Meickle to get started with Ansible, with an eye toward sustainable development in cloud environments. Read more.
1:30pm–5:00pm Monday, October 1, 2018
Location: Nassau Level: Intermediate
Secondary topics:  Systems Monitoring & Orchestration
Michael Kehoe (LinkedIn)
Average rating: *....
(1.33, 3 ratings)
Michael Kehoe walks you through building a small monitoring utility for cgroup containers to illustrate best practices in container monitoring. You'll explore various cgroup constraints and learn how to specifically monitor for each of them to ensure that your application is behaving as expected. Along the way, Michael shares tricks and tips about monitoring containerized applications. Read more.
11:35am–12:15pm Tuesday, October 2, 2018
Location: Murray Hill Level: Beginner
Secondary topics:  Systems Monitoring & Orchestration
Liz Fong-Jones (Honeycomb), Dave Rensin (Google)
Average rating: ****.
(4.25, 4 ratings)
Implementing site reliability (SRE) engineering doesn't have to be intimidating, and it isn't only for cloud-native organizations. Liz Fong-Jones and Dave Rensin share eight key lessons Google's customer reliability engineering team learned helping large enterprises adopt SRE as an operations engineering model. Read more.
1:30pm–2:10pm Tuesday, October 2, 2018
Location: Beekman/Sutton North Level: Intermediate
Secondary topics:  Systems Monitoring & Orchestration
Jamie Wilkinson (Google)
Average rating: ***..
(3.00, 1 rating)
Jamie Wilkinson offers a brief overview of SLOs, shares a practical guide to implementing sustainable SLO-based alerting for systems of any size, and outlines the tooling required to supplement the system in the absence of cause-based alerting. Read more.
4:45pm–5:25pm Tuesday, October 2, 2018
Location: Sutton South/Regent Parlor Level: Intermediate
Secondary topics:  Systems Monitoring & Orchestration
Idit Levine (solo.io)
Average rating: *****
(5.00, 1 rating)
Idit Levine demonstrates common debugging techniques and offers an overview of Squash, a new tool and methodology that brings the power of modern popular debuggers to developers of microservices apps that run on container orchestration platforms. Read more.
4:45pm–5:25pm Tuesday, October 2, 2018
Location: Beekman/Sutton North Level: Intermediate
Secondary topics:  Systems Monitoring & Orchestration
Bridget Lane (Gannett | USA Today), Kris Vincent (Gannett | USA Today)
Average rating: ***..
(3.50, 4 ratings)
Three years ago, technical teams at USA TODAY NETWORK were completely siloed, making improvements and troubleshooting difficult and often blind to the rest of the technical organization. Bridget Lane and Kris Vincent explain how drastically the teams' tool belts, thought processes, and goals have changed as the company moved from silos to a single pane of glass. Read more.
2:25pm–3:05pm Wednesday, October 3, 2018
Location: Beekman/Sutton North Level: Beginner
Secondary topics:  Systems Monitoring & Orchestration
Jason Yee (Datadog)
Average rating: ****.
(4.67, 3 ratings)
Jason Yee shows how you can more easily test code in production while isolating the effect of potential issues using container orchestration and services meshes. Read more.