Brought to you by NumFOCUS Foundation and O’Reilly Media Inc.
The official Jupyter Conference
August 22-23, 2017: Training
August 23-25, 2017: Tutorials & Conference
New York, NY

Building Analytics Platform with Apache Toree and Apache Spark

Moderated by: Luciano Resende & Jakob Odersky

Who is this presentation for?

Data Engineers, System Engineers, Dev Ops

Prerequisite knowledge

Some understanding of Jupyter Notebooks, Kernels and Spark is desirable but not required.

What you'll learn

After the talk, the audience will have a deep understand of the components that are necessary to create an analytical platform capable of hosting a large number of consumers.

Description

Data Scientists are becoming a necessity of every company in the data centric world of today, and with them comes the requirement to make available a flexible and interactive analytics platform. This session will describe our experience and best practices putting together an Analytical platform based on Jupyter Notebooks, Apache Toree and Apache Spark. We will also dive into some of the technical characteristics and challenges of each of these components, describing its challenges and how we fixed or overcome them.