Brought to you by NumFOCUS Foundation and O’Reilly Media
The official Jupyter Conference
Aug 21-22, 2018: Training
Aug 22-24, 2018: Tutorials & Conference
New York, NY

Using the MapD kernel for the Jupyter Notebook

Randy Zwitch (MapD)
2:40pm–3:20pm Friday, August 24, 2018
Extensions and customization, Kernels, Usage and application
Location: Murray Hill Level: Intermediate

Who is this presentation for?

  • Data science engineers and architects

Prerequisite knowledge

  • Intermediate familiarity with data science and Python

What you'll learn

  • Explore MapD and its Jupyter kernel extension
  • Learn how to use Jupyter's magic function (%%sql) to execute commands on the MapD Core SQL engine and how to integrate MapD's columnar database and GPU DataFrame into a typical machine learning workflow to boost performance

Description

MapD Core is an open source analytical SQL engine that has been designed from the ground up to harness the parallelism inherent in GPUs. This enables queries on billions of rows of data in milliseconds. MapD Core also supports the GPU DataFrame (GDF) from GoAi (based on Apache Arrow) and is designed for passing data between processes while keeping it all in GPU memory. In order to provide data scientists with a seamless experience, MapD created a Jupyter Notebook kernel extension that can be installed from a MapD-managed Conda channel.

Randy Zwitch offers an overview of the MapD kernel extension for the Jupyter Notebook and explains how to use it in a typical machine learning workflow. You’ll learn how to deploy a Jupyter notebook with the MapD kernel extension, see how the Jupyter Notebook MapD kernel connects to a MapD server backend, and discover how its magic function (%%sql) executes commands on the MapD Core SQL engine. These SQL queries return their results into the GPU memory data frame using the PyGDF library. The GPU resident DataFrame is then accessed by the machine learning modeling framework to test, train, and make predictions.

Photo of Randy Zwitch

Randy Zwitch

MapD

Randy Zwitch is a Senior Developer Advocate at MapD, enabling customers and community users alike to utilize MapD to its fullest potential. With broad industry experience in Energy, Digital Analytics, Banking, Telecommunications and Media, Randy brings a wealth of knowledge across verticals as well as an in-depth knowledge of open-source tools for analytics.