Presented By O’Reilly and Cloudera
Make Data Work
September 11, 2018: Training & Tutorials
September 12–13, 2018: Keynotes & Sessions
New York, NY
Ryan Blue

Ryan Blue
Senior Software Engineer, Netflix

Ryan Blue is an engineer on Netflix’s big data platform team. Previously, Ryan was responsible for the Avro and Parquet file formats at Cloudera. He is the author of the Analytic Data Storage in Hadoop series of screencasts from O’Reilly.

Sessions

1:15pm–1:55pm Wednesday, 09/12/2018
Location: 1A 10 Level: Intermediate
Secondary topics:  Data Platforms
Ryan Blue (Netflix), Daniel Weeks (Netflix)
Average rating: *****
(5.00, 3 ratings)
In the last few years, Netflix's data warehouse has grown to more than 100 PB in S3. Ryan Blue and Daniel Weeks share lessons learned, the tools Netflix currently uses and those it has retired, and the improvements it is rolling out, including Iceberg, a new table format for S3. Read more.
5:25pm–6:05pm Wednesday, 09/12/2018
Location: 1E 09 Level: Intermediate
Owen O'Malley (Hortonworks), Ryan Blue (Netflix)
Average rating: ****.
(4.33, 3 ratings)
Owen O'Malley and Ryan Blue offer an overview of Iceberg, a new open source project that defines a new table layout with properties specifically designed for cloud object stores, such as S3. It provides a common set of capabilities such as partition pruning, schema evolution, atomic additions, removal, or replacements of files regardless of whether the data is stored in Avro, ORC, or Parquet. Read more.