Build Systems that Drive Business
30–31 Oct 2018: Training
31 Oct–2 Nov 2018: Tutorials & Conference
London, UK

Schedule: Monitoring, Observability, and Performance sessions

The larger your applications get, the harder it is to understand their performance and troubleshoot problems. Increased complexity in applications and services requires new methods for monitoring and improved observability. In this track, you’ll learn best practices for
monitoring large-scale, complex, dynamic and distributed systems built on emerging architectures like microservices and serverless.

Track host

Marcus BarczakMarcus Barczak (Fastly Inc.) is a Sr. Principal Engineer at Fastly where he works on the Production Engineering team. Having first cut his teeth on MRTG back in the day through to exploring new ways of drawing insight out millions of metrics at Etsy. Marcus loves helping people better understand how their software runs wild in production.

14:0017:30 Wednesday, 31 October 2018
Location: King's Suite - Sandringham
Secondary topics:  Resilient, Performant & Secure Distributed Systems
Lee Calcote (Layer5), Girish Ranganathan (Layer5)
Average rating: **...
(2.67, 6 ratings)
Lee Calcote and Girish Ranganathan walk you through building observable, resilient, and secure microservices with Istio and Kubernetes. Read more.
11:2012:00 Thursday, 1 November 2018
Location: King's Suite - Balmoral
Secondary topics:  Systems Architecture & Infrastructure
Marcus Barczak (Fastly)
Average rating: ****.
(4.62, 8 ratings)
How might your organization navigate a move from traditional push-based monitoring to a pull-based system? Marcus Barczak explains how Fastly migrated to Prometheus for its infrastructure and application monitoring. Read more.
13:1513:55 Thursday, 1 November 2018
Location: King's Suite - Balmoral
Secondary topics:  Systems Monitoring & Orchestration
Maxime Petazzoni (SignalFx)
Average rating: ***..
(3.25, 8 ratings)
Maxime Petazzoni explains why monitoring custom application metrics is essential for visibility into the internal workings of a system and shares a framework for properly instrumenting them, along with a number of relevant use cases. Read more.
14:1014:50 Thursday, 1 November 2018
Location: King's Suite - Balmoral
Average rating: ***..
(3.20, 5 ratings)
Constance Caramanolis simulates a production incident and walks you through a page from the dreaded PagerDuty notification to resolution, demonstrating how engineers at Lyft use Envoy’s extensive metrics to identify the root cause of the incident and then proceed to remedy the situation. Read more.
15:4016:20 Thursday, 1 November 2018
Location: King's Suite - Balmoral
Secondary topics:  Systems Monitoring & Orchestration
Adrian McMichael (Rightmove)
Average rating: ****.
(4.30, 10 ratings)
Adrian McMichael explores property portal Rightmove's structured approach to logging and monitoring across more than 50 microservices, showing you how to get to the bottom of production issues and helping you drive improvement and a sense of ownership in your projects. Read more.
16:3517:15 Thursday, 1 November 2018
Location: King's Suite - Balmoral
Secondary topics:  Systems Monitoring & Orchestration
Average rating: ***..
(3.38, 8 ratings)
Looking at a service in isolation in a multiservice architecture simply does not give you enough information. Distributed tracing tools shine a light on the relationship between components. José Carlos Chávez explains how distributed tracing works, what you can use it for, and how tools like Zipkin can help. Read more.