Presented By O'Reilly and Cloudera
Make Data Work
Sept 29–Oct 1, 2015 • New York, NY
Siwei Zhu

Siwei Zhu
Sr Data Scientist , Scribd

Siwei Zhu is a data scientist at Scribd focused on understanding how users engage with the product. Previously, he has worked as a data scientist at Facebook.

Sessions

2:05pm–2:45pm Wednesday, 09/30/2015
Production Ready Hadoop
Location: 3D 05/08 Level: Intermediate
Siwei Zhu (Scribd), Kevin Perko (Scribd)
Average rating: ***..
(3.17, 12 ratings)
With the explosion of big data open source technologies, companies can now build a powerful data warehouse. But as they reach scale, they’ll find that patching together numerous projects requires building their own tools to manage the data pipeline. In this presentation we will talk about the tools you’ll likely need to build in-house to make your data infrastructure manageable. Read more.