Spark on Kubernetes for data science
Who is this presentation for?
- Data scientists, data engineers, and analytics managers
Level
Description
Data science has benefitted greatly from advances in big data and containerization technologies. Spark is the leading platform for data engineering and data science at scale. Kubernetes is the leading container orchestration service. Spark on Kubernetes is a winning combination for data science that stitches together a flexible platform harnessing the best of both worlds. Although still very experimental and young, Spark on Kubernetes shows tremendous promise and should be something all data science organizations are aware of.
Jordan Volz gives a brief overview of Spark and Kubernetes, explaining the history of each and why they are so crucial to the modern data scientist. He explores the Spark on Kubernetes project and why it’s an ideal fit for data scientists who may have been dissatisfied with other iterations of Spark in the past. He also dives into Spark on Kubernetes as the go-to platform in cloud native architectures as organizations begin to modernize their older on-premises architectures and ready them for cloud deployments. He shows some concrete examples to whet your appetite and get you excited to go home and start experimenting with Spark on Kubernetes for yourself.
Prerequisite knowledge
- Familiarity with big data and containerization ideas (useful but not required)
What you'll learn
- Learn how Spark and Kubernetes combine forces to create the next go-to platform for data science on cloud native architectures
Jordan Volz
Dataiku
Jordan Volz is a senior data scientist at Dataiku, where he helps customers design and implement ML applications. Previously, Jordan specialized in big data technologies as a systems engineer at Cloudera and enterprise search technology as a technical consultant at Autonomy, frequently working with large financial organizations in the US and Canada. He holds degrees from Bard College and the University of Amherst, and he’s academically trained in pure mathematics.
Presented by
Elite Sponsors
Strategic Sponsors
Zettabyte Sponsors
Contributing Sponsors
Exabyte Sponsors
Content Sponsor
Impact Sponsors
Supporting Sponsor
Non Profit
Contact us
confreg@oreilly.com
For conference registration information and customer service
partners@oreilly.com
For more information on community discounts and trade opportunities with O’Reilly conferences
strataconf@oreilly.com
For information on exhibiting or sponsoring a conference
pr@oreilly.com
For media/analyst press inquires