Presented By O'Reilly and Cloudera
Make Data Work
September 25–26, 2017: Training
September 26–28, 2017: Tutorials & Conference
New York, NY

Accelerate your analytics with a GPU Data Frame (sponsored by MapD)

Todd Mostak (MapD)
1:15pm1:55pm Wednesday, September 27, 2017
Location: 1E 17

What you'll learn

  • Explore the GPU Open Analytics Initiative's first project, the GPU Data Frame (GDF), and learn how it improves performance


Moving data is the biggest problem in computing. Many companies try to do all computation from the memory of one set of devices due to the cost of bandwidth, latency, and energy, as well as the act of running machine learning algorithms against one database instead of multiple ones. And if the company is still operating off the lagging speed of CPU-driven architectures, moving data could take anywhere from hours to days.

Enter the GPU Open Analytics Initiative (GOAI) and its first project, the GPU Data Frame (GDF). Enabled by NVIDIA’s hardware innovation providing 100x more processing cores and 20x greater memory bandwidth, the GDF is an Apache Arrow-based API that enables end-to-end GPU analytics by allowing for seamless passing of data between processes running on the same GPUs. This efficient data interchange improves performance, encouraging development of more sophisticated and interactive GPU-based applications.

Todd Mostak debuts the GDF, showcases the advanced speed and efficiency of GPUs, and highlights the importance of the open source community to enable efficient intra-GPU communication between different processes running on the GPUs. Todd explains in detail how the integration allows developers, data scientists, and researchers to build new functions to cluster or perform analysis on queries and seamless workflows that combine data processing, machine learning, and visualization.

This session is sponsored by MapD.

Photo of Todd Mostak

Todd Mostak


Todd Mostak is the founder of MapD. He is a graduate of Harvard’s Kennedy School of Government.