Build Systems that Drive Business
Sep 30–Oct 1, 2018: Training
Oct 1–3, 2018: Tutorials & Conference
New York, NY

Using distributed trace data to solve performance and operational challenges

Naoman Abbas (Pinterest)
2:25pm–3:05pm Tuesday, October 2, 2018
Monitoring, Observability, and Performance
Location: Beekman/Sutton North Level: Intermediate
Secondary topics:  Resilient, Performant & Secure Distributed Systems
Average rating: *****
(5.00, 1 rating)

Prerequisite knowledge

  • A basic understanding of distributed systems, microservice architecture, and distributed tracing

What you'll learn

  • Explore the tools Pinterest built to process trace data and the use cases they’ve enabled

Description

Like most modern large-scale applications, Pinterest is built on a microservices architecture. In this scheme, a number of services work together to serve a single user request. Debugging performance and architectural problems in this environment can be challenging. Distributed tracing has emerged as the indispensable tool and solution to address these challenges.

Pinterest recently deployed Pintrace, a Zipkin-based distributed tracing system, to record end-to-end performance data across the execution path of requests, from mobile applications to backend services. Pintrace has evolved over time as its users find new data and as new subsystems integrate with the company’s tracing systems. Pinterest has also built tools for visualization, feature extraction, aggregation, and analysis of trace data, which help enable use cases that wouldn’t have been possible with traditional tooling, such as root-cause analysis, latency analysis, and regression analysis.

Naoman Abbas offers an overview of tools Pinterest built to process trace data and the use cases they’ve enabled and shares some real-world examples. Join in to learn how to apply these techniques to your own challenges.

Photo of Naoman Abbas

Naoman Abbas

Pinterest

Naoman Abbas is an engineering manager for the visibility team at Pinterest, which is responsible for building and maintaining monitoring tools like the company’s metrics system, logsearch, and distributed tracing. Previously, Naoman was a software engineer building cloud platform components at Netflix and Microsoft.