Presented By O'Reilly and Cloudera
Make Data Work
22–23 May 2017: Training
23–25 May 2017: Tutorials & Conference
London, UK

Lessons learned working with Spark and Cassandra

Matthias Niehoff (codecentric AG)
14:5515:35 Thursday, 25 May 2017
Level: Beginner
Average rating: ****.
(4.00, 4 ratings)

Who is this presentation for?

  • Architects and developers

Prerequisite knowledge

  • A basic understanding of Cassandra and Spark/Spark Streaming

What you'll learn

  • Understand how to integrate Spark and Cassandra
  • Learn best practices for building applications based on both technologies and common mistakes you want to avoid


Matthias Niehoff shares lessons learned working with Spark, Cassandra, and the Spark-Cassandra connector and best practices drawn from his work on multiple big and fast data projects, as well as challenges encountered along the way.

Topics include:

  • Cassandra bucketing
  • Spark partitioning
  • Efficient queries
  • Spark join with Cassandra table
  • Spark data locality
Photo of Matthias Niehoff

Matthias Niehoff

codecentric AG

Matthias Niehoff is an IT consultant at codecentric AG in Germany, where he focuses on big data and streaming applications with Apache Cassandra and Apache Sparkā€”as well as other tools in the area of big data. Matthias shares his experience at conferences, meetups, and user groups.

Comments on this page are now closed.


Picture of Matthias Niehoff
Matthias Niehoff | IT CONSULTANT
29/05/2017 21:24 BST

I just uploaded slides to the portal.

26/05/2017 18:29 BST


Where can I get the slides for this presentation?

Kind regards