Presented By O’Reilly and Cloudera
Make Data Work
21–22 May 2018: Training
22–24 May 2018: Tutorials & Conference
London, UK
Stuart Pook

Stuart Pook
Senior DevOps Engineer, Criteo

Website

Stuart loves storage (208 PB at Criteo) and is part of Criteo’s Lake team that runs some small and two rather large Hadoop clusters. He also loves automation with Chef because configuring more than 3000 Hadoop
nodes by hand is just too slow. Before discovering Hadoop he developed
user interfaces and databases for biotech companies.

Stuart has presented at ACM CHI 2000, Devoxx 2016, NABD 2016, Hadoop Summit Tokyo 2016, Apache Big Data Europe 2016, Big Data Tech Warsaw 2017, and Apache Big Data North America 2017.

Sessions

11:1511:55 Wednesday, 23 May 2018
Stuart Pook (Criteo)
Criteo has a production cluster of 2000 nodes running over 300000 jobs/day and a backup cluster of 1200 nodes. These clusters are in our own data centres as the cloud is more expensive. They were meant to provide a redundant solution to Criteo's storage and compute needs. We will explain our project, what went wrong, and our progress in building another cluster to survive the loss of a full DC. Read more.