Presented By O'Reilly and Cloudera
Make Data Work
22–23 May 2017: Training
23–25 May 2017: Tutorials & Conference
London, UK

Lessons learned working with Spark and Cassandra

Matthias Niehoff (codecentric AG)
14:5515:35 Thursday, 25 May 2017
Level: Beginner
Average rating: ****.
(4.00, 4 ratings)

Who is this presentation for?

  • Architects and developers

Prerequisite knowledge

  • A basic understanding of Cassandra and Spark/Spark Streaming

What you'll learn

  • Understand how to integrate Spark and Cassandra
  • Learn best practices for building applications based on both technologies and common mistakes you want to avoid

Description

Matthias Niehoff shares lessons learned working with Spark, Cassandra, and the Spark-Cassandra connector and best practices drawn from his work on multiple big and fast data projects, as well as challenges encountered along the way.

Topics include:

  • Cassandra bucketing
  • Spark partitioning
  • Efficient queries
  • Spark join with Cassandra table
  • Spark data locality
Photo of Matthias Niehoff

Matthias Niehoff

codecentric AG

Matthias Niehoff is an IT consultant at codecentric AG in Germany, where he focuses on big data and streaming applications with Apache Cassandra and Apache Sparkā€”as well as other tools in the area of big data. Matthias shares his experience at conferences, meetups, and user groups.

Leave a Comment or Question

Help us make this conference the best it can be for you. Have questions you'd like this speaker to address? Suggestions for issues that deserve extra attention? Feedback that you'd like to share with the speaker and other attendees?

Join the conversation here (requires login)

Comments

Picture of Matthias Niehoff
Matthias Niehoff | IT CONSULTANT
29/05/2017 21:24 BST

I just uploaded slides to the portal.

Aitezaz Sheikh | SENIOR DEVELOPER
26/05/2017 18:29 BST

Hi,

Where can I get the slides for this presentation?

Kind regards