Presented By O'Reilly and Cloudera
Make Data Work
22–23 May 2017: Training
23–25 May 2017: Tutorials & Conference
London, UK

Schedule: Hadoop platform and applications sessions

Add to your personal schedule
9:0012:30 Tuesday, 23 May 2017
Location: Capital Suite 2/3
Level: Intermediate
Mark Donsky (Cloudera), Andre Araujo (Cloudera), Mubashir Kazia (Cloudera), Syed Rafice (Cloudera)
Average rating: ***..
(3.50, 4 ratings)
Mark Donsky, André Araujo, Syed Rafice, and Mubashir Kazia walk you through securing a Hadoop cluster. You’ll start with a cluster with no security and then add security features related to authentication, authorization, encryption of data at rest, encryption of data in transit, and complete data governance. Read more.
Add to your personal schedule
13:3017:00 Tuesday, 23 May 2017
Location: Capital Suite 8
Level: Advanced
Jonathan Seidman (Cloudera), Mark Grover (Lyft), Ted Malaska (Blizzard Entertainment)
Average rating: *****
(5.00, 6 ratings)
Using Entity 360 as an example, Jonathan Seidman, Ted Malaska, and Mark Grover explain how to architect a modern, real-time big data platform leveraging recent advancements in the open source software world, using components like Kafka, Impala, Kudu, Spark Streaming, and Spark SQL with Hadoop to enable new forms of data processing and analytics. Read more.
Add to your personal schedule
11:1511:55 Wednesday, 24 May 2017
Location: Capital Suite 13
Level: Beginner
Marcel Kornacker (Cloudera)
Average rating: ****.
(4.12, 8 ratings)
Marcel Kornacker offers an introduction to using Impala and Kudu to power your real-time data-centric applications for use cases like time series analysis (fraud detection, stream market data), machine data analytics, and online reporting. Read more.
Add to your personal schedule
12:0512:45 Wednesday, 24 May 2017
Location: Capital Suite 13
Level: Intermediate
Luke Han (Kyligence)
Average rating: *****
(5.00, 2 ratings)
Apache Kylin is rapidly being adopted over the world—especially in China. Luke Han explores how various industries use Apache Kylin, sharing why these companies choose Apache Kylin (a technology comparison), how they use Apache Kylin (their production deployment pattern), and most importantly, the resulting business impact. Read more.
Add to your personal schedule
14:5515:35 Wednesday, 24 May 2017
Location: Capital Suite 13
Level: Intermediate
Marcel Kornacker (Cloudera), Mostafa Mokhtar (Cloudera)
Average rating: *****
(5.00, 2 ratings)
Marcel Kornacker and Mostafa Mokhtar help simplify the process of making good SQL on Hadoop decisions and cover top performance optimizations for Apache Impala (incubating), from schema design and memory optimization to query tuning. Read more.
Add to your personal schedule
11:1511:55 Thursday, 25 May 2017
Location: Capital Suite 13
Level: Beginner
This session explores Gaffer's history, architecture, data model, features, and functionality and outlines some future goals for the project. Read more.
Add to your personal schedule
16:3517:15 Thursday, 25 May 2017
Location: Capital Suite 13
Level: Intermediate
Arturo Bayo (Synergic Partners), Alvaro Fernandez Velando (Santander Spain)
Average rating: ****.
(4.50, 6 ratings)
Arturo Bayo and Alvaro Fernandez Velando explain how a data hub strategy helps clarify data sharing and governance in an organization and share one way to implement a data hub architecture using big data technology and resources that are already established in the enterprise. Read more.