Presented By O'Reilly and Cloudera
Make Data Work
September 26–27, 2016: Training
September 27–29, 2016: Tutorials & Conference
New York, NY

The state of Spark and what's next after Spark 2.0

Ram Sriharsha (Databricks)
11:20am–12:00pm Wednesday, 09/28/2016
Spark & beyond
Location: Hall 1B Level: Beginner
Average rating: **...
(2.88, 17 ratings)

Prerequisite knowledge

  • A basic understanding of Spark
  • What you'll learn

  • Explore recent developments in Apache Spark
  • Understand where the Spark project is headed
  • Description

    Ram Sriharsha reviews major developments in Apache Spark 2.0 and discusses future directions for the project to make Spark faster and easier to use for a wider array of workloads, with an emphasis on API evolution, single-node performance (Project Tungsten Phase 3), and Structured Streaming.

    Photo of Ram Sriharsha

    Ram Sriharsha


    Ram Sriharsha is the product manager for Apache Spark at Databricks and an Apache Spark committer and PMC member. Previously, Ram was architect of Spark and data science at Hortonworks and principal research scientist at Yahoo Labs, where he worked on scalable machine learning and data science. He holds a PhD in theoretical physics from the University of Maryland and a BTech in electronics from the Indian Institute of Technology, Madras.

    Comments on this page are now closed.


    10/06/2016 7:25am EDT

    Forget that I have found them

    10/06/2016 7:24am EDT

    Where are the recordings for these sessions?

    10/03/2016 11:50am EDT

    Poor Audio in the conference call. Slides posted somewhere?

    09/29/2016 7:16am EDT

    Are slides available? Thanks

    09/28/2016 6:25pm EDT

    Poor audio in the conference hall. Can I get the notebook that was used for the Demo?

    dino vitale
    09/28/2016 6:19pm EDT

    will the deck be posted?