Our best understanding comes when conclusions fit evidence and when evidence and analysis is a good fit to the way life happens. That is in part why people are increasingly looking to work with data streams.
Telecommunications companies handle large volume streaming data and need to gain insights from anomaly detection and predictive modeling to understand their networks and their users. Web-based retail companies, IoT-based industries, and healthcare companies all have uses for streaming data as well.
Ellen Friedman demonstrates the advantages of a stream-based approach, exploring real-world situations in which companies in a variety of sectors are using stream processing, including in production, as she dives deeper into streaming issues such as low latency, windowing, and maintaining state—in essence different aspects of correctness. Examples will focus on best practices for streaming architecture, the importance of stream transport capabilities of tools like Apache Kafka, and how the new stream processing engine Apache Flink provides real-time or batch-based processing.
Ellen Friedman is a data technologist with a Ph.D. in biochemistry. She is a committer for Apache Drill and Apache Mahout projects and co-author of books including AI & Analytics in Production, Machine Learning Logistics, Streaming Architecture, the Practical Machine Learning series, and Introduction to Apache Flink, all published by O’Reilly Media. Ellen has been a keynote speaker at JFokus in Stockholm, Big Data London and NoSQL Matters Barcelona and an invited speaker at Strata Data conferences, Berlin Buzzwords, Nike Tech Talks, and the University of Sheffield Methods Institute.
Comments on this page are now closed.
©2017, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • firstname.lastname@example.org
Apache Hadoop, Hadoop, Apache Spark, Spark, and Apache are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries, and are used with permission. The Apache Software Foundation has no affiliation with and does not endorse, or review the materials provided at this event, which is managed by O'Reilly Media and/or Cloudera.