Benchmarks represent a way for computing systems and databases to show how they stack up on a common workload. Often, benchmarking becomes a “space race” where vendors work to push the envelope of their joint systems with high-powered hardware and optimized databases. For companies looking to identify solutions based on benchmarks, high performance dreams are often tempered by high performance prices.
The rise of Hadoop, NoSQL, and other distributed computing platforms that rely on commodity hardware has led to a cultural shift away from “big iron” to systems that can scale out by adding lower cost commodity servers. While Hadoop has been able to do many things in the enterprise when it comes to big data, analytics, and applications, one hurdle it had not cleared was the TPC-C benchmark…until now.
In this session, we’ll review how a transactional Hadoop database was able to complete the TPC-C benchmarks through the use of HBase and SQL. We’ll also discuss the deep empirical analysis that we performed on the TPC-C benchmark to understand how it translates to Hadoop. Finally, we’ll share our results for the first time with the public, and show how transactional Hadoop can change the status quo for running operational applications.
The goal of this session is to help expand the footprint of Hadoop in the enterprise by educating attendees about the possibilities for Hadoop to replace traditional databases for OTLP and OLAP workloads.
Monte Zweben is the CEO and co-founder of Splice Machine, provider of the Hadoop RDBMS. A SQL-on-Hadoop solution, Splice Machine has helped many companies scale real-time applications using commodity hardware without application rewrites. A technology industry veteran, Monte’s early career was spent with the NASA Ames Research Center as the deputy chief of the artificial intelligence branch, where he won the prestigious Space Act Award for his work on the Space Shuttle program. Monte then founded and was the chairman and CEO of Red Pepper Software, a leading supply chain optimization company. In 1996 it merged with PeopleSoft, where he was VP and general manager, Manufacturing Business Unit.
In 1998, Monte was the founder and CEO of Blue Martini Software – the leader in e-commerce and multi-channel systems for retailers. Blue Martini went public on NASDAQ in one of the most successful IPOs of 2000, and is now part of JDA. Monte is also the co-author of Intelligent Scheduling, and has published articles in the Harvard Business Review and various computer science journals and conference proceedings. He currently serves on the Board of Directors of Rocket Fuel Inc. as well as the Dean’s Advisory Board for Carnegie-Mellon’s School of Computer Science.
John lead the development of Splice Machine receiving several patents in distributed transaction processing and focusing on the development of Splice Machine’s dual engine architecture. Prior to Splice Machine, John founded Incite Retail in June 2008 and led the company’s strategy and development efforts. At Incite Retail, he built custom big data systems (leveraging HBase and Hadoop) for Fortune 500 companies.
Prior to Incite Retail, he ran the business intelligence practice at Blue Martini Software and built strategic partnerships with integration partners. John was a key subject matter expert for Blue Martini Software in many strategic implementations across the world. His focus at Blue Martini was helping clients incorporate decision support knowledge into their current business processes utilizing advanced algorithms and machine learning. John received dual bachelor degrees in biomedical and mechanical engineering from Washington University in Saint Louis. Leach currently is the organizer for the Saint Louis Hadoop Users Group and is active in the Washington University Elliot Society.
©2015, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • firstname.lastname@example.org
Apache Hadoop, Hadoop, Apache Spark, Spark, and Apache are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries, and are used with permission. The Apache Software Foundation has no affiliation with and does not endorse, or review the materials provided at this event, which is managed by O'Reilly Media and/or Cloudera.