Presented By O'Reilly and Cloudera
Make Data Work
31 May–1 June 2016: Training
1 June–3 June 2016: Conference
London, UK
Bikas Saha

Bikas Saha
Engineer, Hortonworks Inc

@bikassaha

Bikas Saha has been working in the Apache Hadoop ecosystem since 2011, focusing on YARN and the Hadoop compute stack, and is a committer/PMC member of the Apache Hadoop and Tez projects. Bikas is currently working on Apache Tez, a new framework to build high-performance data processing applications natively on YARN. He has been a key contributor in making Hadoop run natively on Windows. Prior to Hadoop, he worked extensively on the Dryad distributed data processing framework that runs on some of the world’s largest clusters as part of Microsoft’s Bing infrastructure.

Sessions

14:05–14:45 Thursday, 2/06/2016
Hadoop internals & development
Location: Capital Suite 13 Level: Advanced
Bikas Saha (Hortonworks Inc)
Average rating: ***..
(3.00, 6 ratings)
Hadoop is used to run large-scale jobs over hundreds of machines. Considering the complexity of Hadoop jobs, it's no wonder that Hadoop jobs running slower than expected remains a perennial source of grief for developers. Bikas Saha draws on his experience debugging and analyzing Hadoop jobs to describe the approaches and tools that can solve this difficult problem. Read more.