Presented By O’Reilly and Cloudera
Make Data Work
September 11, 2018: Training & Tutorials
September 12–13, 2018: Keynotes & Sessions
New York, NY

Executive Briefing: What you need to know about fast data

Dean Wampler (Anyscale)
2:00pm–2:40pm Thursday, 09/13/2018

What you'll learn

  • Understand the business motivations for fast data applications, the organizational challenges required when moving to streaming architectures, and how your organization can meet those challenges, drawing on what it already knows about microservices

Description

Big data started with an emphasis on batch-oriented architectures, where data is captured in large, scalable stores, then processed using batch jobs. To reduce the gap between data arrival and information extraction, these architectures are now evolving to be stream oriented, where data is processed as it arrives. Fast data is the new buzzword. It’s also a new opportunity for innovation in how your organization leverages data.

However, fast data architectures introduce new challenges for your organization. Whereas a batch job might run for hours, a stream processing application might run for weeks or months. This raises the bar for making these systems resilient against traffic spikes, hardware and network failures, and so forth. The microservice world has faced these challenge for a while. Therefore, your data teams will need to evolve to resemble the teams you already have for your microservice-based systems. In fact, you’ll probably merge these teams over time, as your microservices do more data processing and your data systems leverage your microservices.

Join Dean Wampler to learn everything you need to exploit fast data successfully and explore some real-world success stories.

Photo of Dean Wampler

Dean Wampler

Anyscale

Dean Wampler is an expert in streaming data systems, focusing on applications of machine learning and artificial intelligence (ML/AI). He’s head of developer relations at Anyscale, which is developing Ray for distributed Python, primarily for ML/AI. Previously, he was an engineering VP at Lightbend, where he led the development of Lightbend CloudFlow, an integrated system for building and running streaming data applications with Akka Streams, Apache Spark, Apache Flink, and Apache Kafka. Dean is the author of Fast Data Architectures for Streaming Applications, Programming Scala, and Functional Programming for Java Developers, and he’s the coauthor of Programming Hive, all from O’Reilly. He’s a contributor to several open source projects. A frequent conference speaker and tutorial teacher, he’s also the co-organizer of several conferences around the world and several user groups in Chicago. He earned his PhD in physics from the University of Washington.