17–19 October 2016: Conference & Tutorials
19–20 October 2016: Training
London, UK

Monitoring 101: Finding signal in the noise

Ilan Rabinovitch (Datadog)
10:50–11:30 Tuesday, 18/10/2016
Location: Sandringham Level: Intermediate
Average rating: ****.
(4.00, 2 ratings)

Prerequisite knowledge

  • Basic systems knowledge

What you'll learn

  • Learn strategies and frameworks for identifying and fixing issues in your environment that you can implement in environments today, regardless of the platforms and tools you use


It only takes monitoring a few machines and applications for it to become very complicated to identify and fix issues in your environment. Throw in the type of dynamic infrastructure provided by cloud providers and container orchestration, and your static monitoring strategies will most likely not scale. Knowing which metrics to watch and how to troubleshoot based on those metrics will help you solve problems more quickly.

Ilan Rabinovitch outlines a framework for your metrics and explains how to use it to find solutions to the issues that come up. Ilan covers the three types of monitoring data, what to collect, what should trigger an alert (avoiding an alert storm and pager fatigue), and how to follow the resources to find the root causes of problems. Ilan’s talk is not tool specific, so you’ll leave with strategies and frameworks you can implement in environments today regardless of the platforms and tools you use.

Photo of Ilan Rabinovitch

Ilan Rabinovitch


Ilan Rabinovitch is vice president of product and community at Datadog, where he spends his days diving into container monitoring metrics, collaborating with Datadog’s open source community, and evangelizing observability best practices. Previously, Ilan spent a number of years leading infrastructure and reliability engineering teams at organizations such as Ooyala and Edmunds.com. He’s active in the open source and DevOps communities, where he is a co-organizer of events such as SCALE and Texas Linux Fest as well as a number of devopsdays events.