Presented By O'Reilly and Cloudera
Make Data Work
December 1–3, 2015 • Singapore
Jun Liu

Jun Liu
Senior Software Engineer, Intel

Jun Liu is a senior performance engineer in Intel’s Software and Service group, where he works in the area of big data performance modeling and simulation, especially SQL-on-Hadoop systems. Before Intel, Jun was a postdoctoral researcher and senior member of the Database Performance and Migration group (DPMG) at Dublin City University. His primary research focus area is data migration and database performance optimization. Jun also worked as a software engineer at Ericsson and has participated in the development of different projects in the areas of real-time complex events processing and big data analysis. Jun holds a PhD in computing from Dublin City University, an MSc in advanced software engineering from University College Dublin, and a BSc in computer science from Dublin Institution of Technology.

Sessions

11:50am–12:30pm Wednesday, 12/02/2015
Hadoop Platform
Location: 334-335 Level: Intermediate
Jun Liu (Intel), Zhaojuan Bian (Intel)
Average rating: ***..
(3.86, 7 ratings)
Based on previous experience, there are many challenges in designing an Impala cluster for production, such as table schema, data placement, file format selection, hardware selection, and software stack parameters tuning. We will walk through a real-world case study in the banking and financial services sector to illustrate how we use our simulator-based approach to design an Impala cluster. Read more.