Skip to main content

Unifying Your Data Management Platform with Hadoop: Batch and Real-time Machine Data Ingest, Alerts, and Analytics

Jayant Shekhar (Sparkflows Inc.)
Hadoop Platform Grand Ballroom East
Average rating: ****.
(4.25, 8 ratings)
Slides:   1-PDF 

Hadoop has evolved significantly in recent years, today serving as a unified platform for near-real-time (NRT) and batch workflows, such as querying, analysis and alerting for logs and machine data.

In this session, we’ll dive into the details of using SolrCloud and Cloudera Impala together to serve search queries, by integrating Flume to stream events into Solr, Impala and HBase. Such a system supports the definition and extraction of data fields that can be incorporated into the user experience at scale. It also allows definition of complex alerts over large datasets, again scaling with the right technologies underneath. Finally, we will explain in detail the front-end layer that gives users a consistent view across data in Solr, Impala and HBase. The beauty of such a system lies in its scalability, low total cost of ownership, and flexibility to allow the same data to be used for building out other features and products.

Photo of Jayant Shekhar

Jayant Shekhar

Sparkflows Inc.

Jayant is Solutions Architect at Cloudera working with various large and small companies in various Verticals in building out their Big Data Platforms, Architecture and Algorithms. Prior to Cloudera Jayant also worked at Yahoo where he was instrumental in building out the large scale Content/Listings Platform using Hadoop & Big Data technologies and working with various Yahoo Properties, Real Estate, Autos, Local, News, Movies etc. Prior to Yahoo, Jayant worked at eBay building out a new Shopping Platform (K2) using Nutch/Hadoop, Search Intelligence Platform, among others. Jayant has Bachelor’s degree in Computer Science from IIT Kharagpur and Master’s degree in Computer Engineering from San Jose State University.

Comments on this page are now closed.


Picture of Jayant Shekhar
Jayant Shekhar
10/30/2013 11:29pm EDT

Hi Marek, I’m in the process of posting the slides. They should soon be online.

Marek K Kolodziej
10/30/2013 4:32pm EDT

Would it be possible to post the slides here, like the other speakers have?


Sponsorship Opportunities

For exhibition and sponsorship opportunities, contact Susan Stewart at

Media Partner Opportunities

For information on trade opportunities with O'Reilly conferences email mediapartners

Press & Media

For media-related inquiries, contact Maureen Jennings at

Contact Us

View a complete list of Strata + Hadoop World 2013 contacts