Presented By O'Reilly and Cloudera
Make Data Work
22–23 May 2017: Training
23–25 May 2017: Tutorials & Conference
London, UK

Lessons learned working with Spark and Cassandra

Matthias Niehoff (codecentric AG)
14:5515:35 Thursday, 25 May 2017
Level: Beginner
Average rating: ****.
(4.00, 4 ratings)

Who is this presentation for?

  • Architects and developers

Prerequisite knowledge

  • A basic understanding of Cassandra and Spark/Spark Streaming

What you'll learn

  • Understand how to integrate Spark and Cassandra
  • Learn best practices for building applications based on both technologies and common mistakes you want to avoid


Matthias Niehoff shares lessons learned working with Spark, Cassandra, and the Spark-Cassandra connector and best practices drawn from his work on multiple big and fast data projects, as well as challenges encountered along the way.

Topics include:

  • Cassandra bucketing
  • Spark partitioning
  • Efficient queries
  • Spark join with Cassandra table
  • Spark data locality
Photo of Matthias Niehoff

Matthias Niehoff

codecentric AG

Matthias Niehoff works as Data Architect and Head of Data & AI for codecentric and supports customers in the design and implementation of data architectures. His focus is not so much on the ML model, but rather on the necessary infrastructure and organization to help data science projects succeed.

Comments on this page are now closed.


Picture of Matthias Niehoff
Matthias Niehoff | HEAD OF DATA & AI
29/05/2017 21:24 BST

I just uploaded slides to the portal.

26/05/2017 18:29 BST


Where can I get the slides for this presentation?

Kind regards