Skip to main content

Running On-premise Hadoop as a Business

Sumeet Singh (Yahoo)
Enterprise Data Grand Ballroom West
Average rating: ***..
(3.33, 3 ratings)

Cloud-based architectures of Hadoop have made it attractive for public cloud service providers to offer hosted Hadoop services and charge customers on a pay-for-what-you-use basis. For enterprises that have already adopted Hadoop, the data infrastructure has long been seen as a cost element in their budgets. As a result, enterprises thinking of adopting Hadoop are increasingly debating between on-premise and cloud-based models for their data processing needs.

We lay out a set of criteria and methodical approaches to help enterprises that have not yet adopted Hadoop evaluate their options, and discuss the pros and cons of both models. For enterprises that have already made significant investments or have plans to build a Hadoop-based infrastructure, we present an approach to manage Hadoop as a Service with a P&L, transparency in costs, and metering & billing provisions.

As we discuss these approaches, we will share insights gathered from the exercise conducted on one of the largest Hadoop footprints in the world. We will illustrate how to organize cluster resources, compile data required and typical sources, develop TCO models tailored for individual situations, derive unit costs for usage, measure the resource usage for services, optimize for higher utilization, and benchmark costs.

Photo of Sumeet Singh

Sumeet Singh


Sumeet Singh is Head of Products for Cloud Services and Hadoop at Yahoo! responsible for Product Management, Customer Engagements, Evangelism and Community Development, and Program Management for the Cloud Engineering Group. In this role, he also leads the Hadoop products team responsible for both Apache open source contributions and Yahoo! projects. Sumeet has over 14 years of Product Management, Product Development, and Strategy Consulting experience in the technology industry. Sumeet earned his MBA from UCLA Anderson School of Management and MS from Rensselaer Polytechnic Institute, NY.


Sponsorship Opportunities

For exhibition and sponsorship opportunities, contact Susan Stewart at

Media Partner Opportunities

For information on trade opportunities with O'Reilly conferences email mediapartners

Press & Media

For media-related inquiries, contact Maureen Jennings at

Contact Us

View a complete list of Strata + Hadoop World 2013 contacts