HBase is an online database so response latency is critical. This talk will examine sources of latency in HBase, detailing steps along the read and write paths. We’ll examine the entire request lifecycle, from client to server and back again. We’ll also look at the different factors that impact latency, including GC, cache misses, and system failures. Finally, the talk will highlight some of the work done in 0.96+ to improve the reliability of HBase.
Nick stumbled into HBase in 2008 when his nightly ETL jobs started taking 20+ hours to complete. Since then, he has applied Hadoop and HBase to projects in social media, social gaming, click-stream analysis, climatology, and geographic data. Nick also helped establish Seattle’s Scalability Meetup and tried his hand at entrepreneurship. Most recently, he is an author on HBase in Action, available now. His has a passion for scalable access to scientific data.
Nicolas has stayed focused on the software architecture business at various positions including Head of Architecture at Thomson Reuters for the Risk Management product line. He has been deeply part of the Big Data arena for more than two years, working especially with Hortonworks on HBase MTTR. He combines traditional software and enterprise architecture skills with a deep knowledge of Big Data architecture. Nicolas is PMC member for the HBase project. He is also cofounder of Scaled Risk, a company that provides a Big Data solution on top of Hadoop and HBase.
Comments on this page are now closed.