Skip to main content
Haoyuan Li

Haoyuan Li
Founder & CTO, Alluxio

Website | @haoyuan

Haoyuan Li is a Computer Science PhD student in the AMP Lab at UC Berkeley, working with Scott Shenker and Ion Stoica on computer systems and cloud computing. He is the lead developer of Tachyon distributed file system. Before Berkeley, he studied at Cornell University and Peking University, and worked at Conviva and Google.


Hadoop & Beyond Rhinelander South
Tutorial Please note: to attend, your registration must include Tutorials on Monday.
Tathagata Das (Databricks), Haoyuan Li (Alluxio), Ion Stoica (UC Berkeley), Reynold Xin (Databricks), Sameer Agarwal (UC Berkeley)
Average rating: ****.
(4.80, 10 ratings)
An introduction to the open-source Berkeley Data Analytics Stack (BDAS). Spark is a high-speed cluster computing engine that supports rich analytics (e.g. machine learning) and lower-latency processing (e.g. streaming). Tachyon provides in-memory storage, letting Spark and Hadoop jobs share data efficiently. Shark and GraphX provide high-speed Hive SQL queries and graph processing on top of Spark. Read more.
Office Hour Table B
Haoyuan Li (Alluxio)
We will answer questions about various aspects of the Berkeley Data Analytics Stack (BDAS), including: * Spark: an open source cluster computing system that aims to make data analytics fast - both fast to run and fast to program. * Shark: a fast SQL query engine built on top of Spark that is compatible with Hive. * Tachyon: a high throughput, distributed in-memory storage system. Read more.


Sponsorship Opportunities

For exhibition and sponsorship opportunities, contact Susan Stewart at

Media Partner Opportunities

For information on trade opportunities with O'Reilly conferences email mediapartners

Press & Media

For media-related inquiries, contact Maureen Jennings at

Contact Us

View a complete list of Strata + Hadoop World 2013 contacts