Build & maintain complex distributed systems
October 1–2, 2017: Training
October 2–4, 2017: Tutorials & Conference
New York, NY

Schedule: Distributed Data & Databases sessions

Storage systems and how your systems deal with state management.

Track Host: Leif Walsh

11:35am12:15pm Tuesday, October 3, 2017
Location: Regent
Joseph Breuer (Netflix), Robert Reta (Netflix)
Average rating: *****
(5.00, 1 rating)
The Netflix download feature allows users to download content for offline playback. Implementing this feature required a new persistence architecture to maintain the state of user devices and content licenses. Joseph Breuer and Robert Reta explore the technical decisions behind the choice of a Cassandra event sourcing data store. Read more.
1:30pm2:10pm Tuesday, October 3, 2017
Location: Regent
Ben Linsay (Bumpers)
Average rating: **...
(2.50, 2 ratings)
Machine learning is as accessible as it has ever been, but it’s not always obvious how to go from a cool paper to serving production traffic. Ben Linsay helps you get started putting your paper into production, sharing lessons learned solving real problems with machine learning at Kickstarter. Read more.
2:25pm3:05pm Tuesday, October 3, 2017
Location: Regent
Leif Walsh (Two Sigma)
Average rating: **...
(2.00, 1 rating)
Leif Walsh offers an overview of Flint, Two Sigma's open source time series extension to Spark, explains how it fits in with the Spark programming model, and lays out the roadmap for the future of pandas, PySpark, and Flint. Read more.
3:50pm4:30pm Tuesday, October 3, 2017
Location: Regent
Rob Dickinson (resurface.io)
On the surface, adapting software to use persistent memory seems obvious. After all, persistent memory is simply fast memory that maintains state when the power goes out, like an SSD. But unlike SSDs, persistent memory challenges long-held ideas and conventions about how software works. Rob Dickinson outlines four key ideas that will help focus your persistent memory strategy. Read more.
1:30pm2:10pm Wednesday, October 4, 2017
Location: Grand Ballroom West
Zhenzhong Xu (Netflix)
Average rating: *****
(5.00, 2 ratings)
Keystone, a critical piece of Netflix's backend data infrastructure, ensures massive data movements and real-time event processing. Zhenzhong Xu leads a deep dive into Keystone's architecture and underlying stream processing engines, sharing insights and proven paths on how the company achieves multitenancy, scalability, and resilience in a complex cloud-native distributed system environment. Read more.
2:25pm3:05pm Wednesday, October 4, 2017
Location: Grand Ballroom West
Swaminathan Sundaramurthy (Salesforce Inc), Mark Cho (Pinterest)
Pinterest has to support real-time decision making while operating on petabyte-scale data. Swaminathan Sundaramurthy and Mark Cho offer an overview of Pinterest's real-time data pipeline (modeled on quasi-Kappa architecture), its impact on the company's systems, and tools and processes used and demonstrate how Pinterest models real-time ads analytics on the platform. Read more.
4:45pm5:25pm Wednesday, October 4, 2017
Location: Nassau
Oleksandr Petrov (Independent)
In the world of big and fast data, it's important to be fluent in storage and know the right tools for each job. Alex Petrov shares techniques for picking the right database and indexes, understanding the trade-offs different types of storage bring, scaling out your data and planning its growth, and finding the best resources on the subject. Read more.