Presented By O'Reilly and Cloudera
Make Data Work
December 1–3, 2015 • Singapore

Skye Wanderman-Milne
Software Engineer, Cloudera

Skye Wanderman-Milne is an engineer on the Impala team at Cloudera.

Sessions

1:30pm–2:10pm Wednesday, 12/02/2015
Data Science and Advanced Analytics
Location: 321-322 Level: Intermediate
Marcel Kornacker (Cloudera), Skye Wanderman-Milne (Cloudera)
Average rating: ***..
(3.90, 10 ratings)
In this talk, we will explain how data scientists use nested data structures to increase analytic productivity. We will use two well-known relational schemas - TPC-H and Twitter - to demonstrate how to simplify data science workloads with nested schemas. Also, we will outline best practices for converting flat relational schemas into nested ones, and give examples of data science-style analysis. Read more.