The power of GPUs for data virtualization in Tableau and PowerBI and beyond
Who is this presentation for?Data engineers, data architects, developers
If you have an investment in Spark infrastructure, you’re heavily using SparkSQL for various workloads, and you’re ready to take on a cutting-edge new SQL engine that runs on GPUs for at least one order of magnitude increase in performance with minimum effort…you’re about to find out how you can do that. Claudiu Barbura explains how to shield your existing consumer applications built on Spark (reporting from Tableau and PowerBI, data science workloads in notebooks, etc.) from any replacement or enhancement of your engine.
You’ll discover the lessons Blueprint learned with Spark (CPU), BlazingSQL and Rapids.ai (GPU), and Apache Arrow in its quest to exponentially increase the performance of its data virtualizer that enables real-time access to disparate data sources across different cloud providers and on-premises databases and APIs when a native query translation to the data source isn’t possible.
You’ll learn how you can leverage the performance of this GPU-based SQL engine’s performance (BlazingSQL) in your favorite tools via a unified interface, especially if you’re a BI analyst or data scientist.
- A basic understanding of Spark (useful but not required)
- Familiarity with cloud storage services (useful but not required)
What you'll learn
- Learn about BlazingSQL, Rapids.ai, Spark, data virtualization, Apache Arrow, GPU, PowerBI, and Tableau
Claudiu Barbura is a director of engineering at Blueprint, and he oversees product engineering, where he builds large-scale advanced analytics pipelines, IoT, and data science applications for customers in oil and gas, energy, and retail industries. Previously, he was the vice president of engineering at UBIX.AI, automating data science at scale, and senior director of engineering, xPatterns platform services at Atigeo, building several advanced analytics platforms and applications in healthcare and financial industries. Claudiu is a hands-on architect, dev manager, and executive with 20+ years of experience in open source, big data science and Microsoft technology stacks and a frequent speaker at data conferences.
Leave a Comment or Question
Help us make this conference the best it can be for you. Have questions you'd like this speaker to address? Suggestions for issues that deserve extra attention? Feedback that you'd like to share with the speaker and other attendees?
Join the conversation here (requires login)
Premier Diamond Sponsors
Premier Exhibitor Plus
For conference registration information and customer service
For more information on community discounts and trade opportunities with O’Reilly conferences
For information on exhibiting or sponsoring a conference
For media/analyst press inquires