Presented By O'Reilly and Cloudera
Make Data Work
March 28–29, 2016: Training
March 29–31, 2016: Conference
San Jose, CA

Hadoop Internals & Development conference sessions

Tuesday, March 29

1:30pm–5:00pm Tuesday, 03/29/2016
Location: LL20 D
Jonathan Seidman (Cloudera), Ted Malaska (Capital One), Gwen Shapira (Confluent), Mark Grover (Lyft)
Average rating: ****.
(4.48, 23 ratings)
Jonathan Seidman, Ted Malaska, Gwen Shapira, and Mark Grover walk participants through building a fraud-detection system, using an end-to-end case study to provide a concrete example of how to architect and implement real-time systems via Apache Hadoop components like Kafka, HBase, Impala, and Spark. Read more.

Wednesday, March 30

11:00am–11:40am Wednesday, 03/30/2016
Location: 230 C
Ben Lorica (O'Reilly Media), Doug Cutting (Cloudera), Mike Cafarella (University of Michigan)
Average rating: **...
(2.87, 15 ratings)
Ben Lorica hosts a conversation with Doug Cutting and Mike Cafarella, the cofounders of Apache Hadoop. Read more.
1:50pm–2:30pm Wednesday, 03/30/2016
Location: 230 C
Tags: real-time
Todd Lipcon (Cloudera)
Average rating: ****.
(4.68, 19 ratings)
Todd Lipcon explores the tradeoffs between real-time transactional access and fast analytic performance from the perspective of storage-engine internals. Todd also outlines Kudu, the new addition to the open source Hadoop ecosystem that complements HDFS and HBase to provide a new option for achieving fast scans and fast random access from a single API. Read more.

Thursday, March 31

1:50pm–2:30pm Thursday, 03/31/2016
Location: 230 A
Silvia Oliveros (Silicon Valley Data Science), Stephen O'Sullivan (Data Whisperers)
Average rating: ***..
(3.58, 12 ratings)
You have your Hadoop cluster, and you are ready to fill it up with data. But wait! Which format should you use to store your data? Should you store it in plain text, SequenceFile, Avro, or Parquet? (And should you compress it?) Silvia Oliveros and Stephen O'Sullivan cover the hows, whys, and whens of choosing one format over another and take a closer look at some of the tradeoffs each offers. Read more.