Presented By O'Reilly and Cloudera
Make Data Work
September 26–27, 2016: Training
September 27–29, 2016: Tutorials & Conference
New York, NY

Schedule: Hadoop internals & development sessions

1:15pm–1:55pm Wednesday, 09/28/2016
Location: 3D 10 Level: Intermediate
Marcel Kornacker (Cloudera), Mostafa Mokhtar (Cloudera)
Average rating: ****.
(4.80, 5 ratings)
Performance tuning your SQL-on-Hadoop deployment may seem overwhelming at times, especially for BI workloads that need interactive response times with high concurrency. Marcel Kornacker and Mostafa Mokhtar simplify the process and cover top performance optimizations for Apache Impala (incubating), from schema design and memory optimization to query tuning. Read more.
2:05pm–2:45pm Wednesday, 09/28/2016
Location: 3D 10 Level: Intermediate
Adam Bordelon (Mesosphere), Mohit Soni (Mesosphere)
Average rating: ****.
(4.33, 3 ratings)
Adam Bordelon and Mohit Soni demonstrate how projects like Apache Myriad (incubating) can install Hadoop on Mesosphere DC/OS alongside other data center-scale applications, enabling efficient resource sharing and isolation across a variety of distributed applications while sharing the same cluster resources and hence breaking silos. Read more.
2:55pm–3:35pm Wednesday, 09/28/2016
Location: 3D 10 Level: Intermediate
Zhe Zhang (LinkedIn), Uma Maheswara Rao G (Intel)
Average rating: **...
(2.67, 3 ratings)
The new erasure coding feature in Apache Hadoop (HDFS-EC) reduces the storage cost by ~50% compared with 3x replication. Zhe Zhang and Uma Maheswara Rao G present the first-ever performance study of HDFS-EC and share insights on when and how to use the feature. Read more.
4:35pm–5:15pm Thursday, 09/29/2016
Location: River Pavilion Level: Intermediate
Todd Lipcon (Cloudera)
Apache Kudu was first announced as a public beta release at Strata NYC 2015 and recently reached 1.0. This conference marks its one year anniversary as a public open source project. Todd Lipcon offers a very brief refresher on the goals and feature set of the Kudu storage engine, covering the development that has taken place over the last year. Read more.
4:35pm–5:15pm Thursday, 09/29/2016
Location: 1 C04 / 1 C05 Level: Beginner
Vinayak Borkar (X15 Software)
Average rating: ***..
(3.50, 2 ratings)
Starting from first principles, Vinayak Borkar defines the requirements for a modern operational data store and explores some possible architectures to support those requirements. Read more.