Skills for diagnosing failures in complex, interacting systems are critically important but rarely taught. Even those with experience and expertise can struggle to articulate how they do what they do in order to pass the knowledge on. Drawing on a combination of his own experience carrying the pager and time spent observing and teaching others, Terran Melconian has distilled an explicit, teachable process for efficiently isolating faults.
Starting with the observed symptom (for example, a page), Terran demonstrates how to draw a diagram of possible causes, which bifurcate the search space, and how to collect new observations to decide which path to follow at each fork. He also shares a very common anti-pattern—observe a symptom, hypothesize a cause for the fault, and write and deploy code to address this cause—explains why this process often fails to produce effective results, and outlines what to do instead.
Terran Melconian is a data science consultant and trainer at Air Network Simulation and Analysis. Terran has worked in the consumer web space for the last decade in roles that included software development, operations, data warehousing, and data science. Previously, he built up and managed teams at TripAdvisor and Jobcase, where he focused on hiring generalists and teaching them about the specifics. Terran is passionate about continuing education for high-performing professionals.
©2017, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • firstname.lastname@example.org