Presented By O'Reilly and Cloudera
Make Data Work
September 26–27, 2016: Training
September 27–29, 2016: Tutorials & Conference
New York, NY

Alluxio (formerly Tachyon): The journey thus far and the road ahead

Haoyuan Li (Alluxio)
2:55pm–3:35pm Thursday, 09/29/2016
Data innovations
Location: 1 C04 / 1 C05 Level: Beginner
Average rating: ****.
(4.00, 1 rating)

Prerequisite knowledge

  • A general understanding of distributed storage and distributed data processing
  • What you'll learn

  • Understand the new key features in Alluxio
  • Explore a roadmap of the system as well as a number of use cases
  • Description

    Haoyuan Li offers an overview of Alluxio (formerly Tachyon), a memory-speed virtual distributed storage system. The Alluxio open source community is one of the fastest growing open source communities in big data history with more than 250 developers from over 50 organizations around the world, and the Alluxio system has been deployed at a number of companies, including Alibaba, Baidu, Barclays, Intel, Huawei, and Qunar. In some of these deployments, Alluxio has been running in production for over a year, managing PBs of data.

    In the past year, the Alluxio project experienced a tremendous improvement in performance and scalability and was extended with key new features including tiered storage, transparent naming, and unified namespace. At the same time, the Alluxio ecosystem has expanded to include support for more under storage systems and computation frameworks, including Amazon S3, Google Cloud Storage, Gluster, Ceph, HDFS, NFS, and OpenStack Swift. These integrations make it possible to leverage Alluxio in many different environments.

    This year, the goal is to make Alluxio accessible to an even wider set of users through a focus on security, new language bindings, and further increased stability. In addition, the team is working on new APIs to allow applications to access data more efficiently and manage data across different under storage systems.

    Photo of Haoyuan Li

    Haoyuan Li


    Haoyuan (H.Y.) Li is the founder, chairman, and CTO of Alluxio. He holds a PhD in computer science from UC Berkeley’s AMPLab, where he created the Alluxio (formerly Tachyon) open source data orchestration system, cocreated Apache Spark Streaming, and became an Apache Spark founding committer. He also holds an MS from Cornell University and a BS from Peking University, both in computer science.