Presented By O'Reilly and Cloudera
Make Data Work
Sept 29–Oct 1, 2015 • New York, NY

Failing fast and falling often is no way to run a cluster!

Michael Segel (Segel & Associates.)
2:55pm–3:35pm Wednesday, 09/30/2015
Production Ready Hadoop
Location: 3D 05/08 Level: Intermediate
Average rating: ***..
(3.62, 8 ratings)
Slides:   1-BIN 

In the beginning, there was only one single most important point of failure… the Name Node. Lose the Name Node, you lose your cluster.

Today, there are now multiple points of failure, depending on which vendor’s solution you choose to implement.

Hive, HCatalog, and Ranger among other components have now become critical components. The failure of any one of these can cripple a cluster and cause either significant downtime, data loss, or both.

By identifying where failures can occur, one can either mitigate the risk, or consider alternative designs.

Photo of Michael Segel

Michael Segel

Segel & Associates.

Michael Segel has been working with Hadoop since 2009 at various companies as a solution architect, solving the tough challenges. He is currently globe-trotting as a principal architect with Segel & Associates, looking for the next challenging problem to solve. Michael spends his free time thinking about solutions as he walks his dogs around the River North neighborhood in Chicago. While the founder of CHUG (Chicago area Hadoop User Group), Michael is also in the process of starting a Big Data Anonymous work group for those recovering big data-holics.