July 20–24, 2015
Portland, OR
Jacky Li

Jacky Li
engineer, Huawei technology

Jacky Li joined Huawei in 2004, he has been engaged in telecommunications protocols, network service systems, network data analysis, and visualization research and development work. In recent years, he has been dedicated to seeking opportunities for innovation in network data analysis using open source big data processes and analytic technology, like Apache Hadoop, Spark, and Tachyon.

Sessions

1:30pm–5:00pm Tuesday, 07/21/2015
Paco Nathan (derwen.ai), Haichuan Wang (Huawei), Jacky Li (Huawei technology), Vimal Das Kammath V (Huawei)
This tutorial provides a hands-on introduction to Apache Spark, with coding exercises for Spark apps showing Python, Scala, R, and SQL. We will review the Spark core API, how to build a pipeline with SQL + DataFrames, plus look through the broader Spark ecosystem: Tungsten, Streaming, MLlib, and GraphX. Read more.
4:10pm–5:40pm Thursday, 07/23/2015
Sponsored E 143/144
Paco Nathan (derwen.ai), Jacky Li (Huawei technology)
This session provides an introduction to Apache Spark, with a brief overview of how/why it evolved, then covering the Spark core API, with examples in Python and Scala, how to build a pipeline with SQL + DataFrames, plus look through the broader Spark ecosystem: Tungsten, Streaming, MLlib, GraphX, Packages, etc. Plus many links out case studies of production use cases at scale for Spark. Read more.