Skip to main content
Make Data Work
Oct 15–17, 2014 • New York, NY
Haoyuan Li

Haoyuan Li
Founder & CTO, Alluxio

Website | @haoyuan

Haoyuan Li is a Computer Science Ph.D. candidate in AMPLab at UC Berkeley, and he works with Prof. Scott Shenker and Prof. Ion Stoica on big data and cloud computing. He leads Tachyon, an open source memory-centric distributed file system enabling reliable file sharing at memory-speed across cluster frameworks. He is a founding committer of Apache Spark and a co-creator of Spark Streaming. Before Berkeley, he worked at Conviva and Google, where he co-created PFPGrowth algorithm, which is included in Apache Mahout. Haoyuan has a M.S. from Cornell University and a B.S. from Peking University, both in Computer Science.


4:15pm–4:55pm Thursday, 10/16/2014
Hadoop & Beyond
Location: 1 E20/1 E21
Haoyuan Li (Alluxio)
Average rating: ****.
(4.36, 11 ratings)
An introduction to Tachyon, a memory centric storage system started from UC Berkeley. It enables different frameworks to share data at memory-speed. It is also a major component of Berkeley Data Analytics Stack (BDAS). The project is open source and is deployed at multiple companies. It has more than 30 contributors from over 10 institutions, including Yahoo, Intel, Redhat, Alibaba etc. Read more.
5:05pm–5:45pm Thursday, 10/16/2014
Office Hour
Location: Table C
Haoyuan Li (Alluxio)
Haoyuan is here to answer your questions about Tachyon, a memory centric fault-tolerant distributed file system, which enables reliable file sharing at memory-speed across cluster frameworks, such as Spark and MapReduce. He’ll also share Tachyon use cases, and the Tachyon road map (including exciting features from AMPLab and industry partners). Read more.