Presented By O'Reilly and Cloudera
Make Data Work
September 26–27, 2016: Training
September 27–29, 2016: Tutorials & Conference
New York, NY
Ram Sriharsha

Ram Sriharsha
Product Manager, Apache Spark, Databricks


Ram Sriharsha is the product manager for Apache Spark at Databricks and an Apache Spark committer and PMC member. Previously, Ram was architect of Spark and data science at Hortonworks and principal research scientist at Yahoo Labs, where he worked on scalable machine learning and data science. He holds a PhD in theoretical physics from the University of Maryland and a BTech in electronics from the Indian Institute of Technology, Madras.


11:20am–12:00pm Wednesday, 09/28/2016
Spark & beyond
Location: Hall 1B Level: Beginner
Ram Sriharsha (Databricks)
Average rating: **...
(2.88, 17 ratings)
Ram Sriharsha reviews major developments in Apache Spark 2.0 and discusses future directions for the project to make Spark faster and easier to use for a wider array of workloads, with an emphasis on API evolution, single-node performance (Project Tungsten Phase 3), and Structured Streaming. Read more.
11:20am–12:00pm Thursday, 09/29/2016
Spark & beyond
Location: Hall 1B Level: Beginner
Tags: real-time
Ram Sriharsha (Databricks)
Average rating: ***..
(3.25, 8 ratings)
Structured Streaming is a new effort in Apache Spark to make stream processing simple without the need to learn a new programming paradigm or system. Ram Sriharsha offers an overview of Structured Streaming, discussing its support for event-time, out-of-order/delayed data, sessionization, and integration with the batch data stack to show how it simplifies building powerful continuous applications. Read more.
2:55pm–3:35pm Thursday, 09/29/2016
Location: 1 C03
Ram Sriharsha (Databricks), Xiangrui Meng (Databricks)
Join Xiangrui Meng and Ram Sriharsha to discuss the state of Spark. Read more.