Mar 15–18, 2020

The power of GPUs for data virtualization in Tableau and PowerBI and beyond

Claudiu Barbura (Blueprint)
11:50am12:30pm Wednesday, March 18, 2020
Location: LL21A

Who is this presentation for?

Data engineers, data architects, developers

Level

Beginner

Description

If you have an investment in Spark infrastructure, you’re heavily using SparkSQL for various workloads, and you’re ready to take on a cutting-edge new SQL engine that runs on GPUs for at least one order of magnitude increase in performance with minimum effort…you’re about to find out how you can do that. Claudiu Barbura explains how to shield your existing consumer applications built on Spark (reporting from Tableau and PowerBI, data science workloads in notebooks, etc.) from any replacement or enhancement of your engine.

You’ll discover the lessons Blueprint learned with Spark (CPU), BlazingSQL and Rapids.ai (GPU), and Apache Arrow in its quest to exponentially increase the performance of its data virtualizer that enables real-time access to disparate data sources across different cloud providers and on-premises databases and APIs when a native query translation to the data source isn’t possible.

You’ll learn how you can leverage the performance of this GPU-based SQL engine’s performance (BlazingSQL) in your favorite tools via a unified interface, especially if you’re a BI analyst or data scientist.

Prerequisite knowledge

  • A basic understanding of Spark (useful but not required)
  • Familiarity with cloud storage services (useful but not required)

What you'll learn

  • Learn about BlazingSQL, Rapids.ai, Spark, data virtualization, Apache Arrow, GPU, PowerBI, and Tableau
Photo of Claudiu Barbura

Claudiu Barbura

Blueprint

Claudiu Barbura is a director of engineering at Blueprint, and he oversees product engineering, where he builds large-scale advanced analytics pipelines, IoT, and data science applications for customers in oil and gas, energy, and retail industries. Previously, he was the vice president of engineering at UBIX.AI, automating data science at scale, and senior director of engineering, xPatterns platform services at Atigeo, building several advanced analytics platforms and applications in healthcare and financial industries. Claudiu is a hands-on architect, dev manager, and executive with 20+ years of experience in open source, big data science and Microsoft technology stacks and a frequent speaker at data conferences.

Leave a Comment or Question

Help us make this conference the best it can be for you. Have questions you'd like this speaker to address? Suggestions for issues that deserve extra attention? Feedback that you'd like to share with the speaker and other attendees?

Join the conversation here (requires login)

Contact us

confreg@oreilly.com

For conference registration information and customer service

partners@oreilly.com

For more information on community discounts and trade opportunities with O’Reilly conferences

Become a sponsor

For information on exhibiting or sponsoring a conference

pr@oreilly.com

For media/analyst press inquires