Our best understanding comes when conclusions fit evidence and when evidence and analysis is a good fit to the way life happens. That is in part why people are increasingly looking to work with data streams.
Telecommunications companies handle large volume streaming data and need to gain insights from anomaly detection and predictive modeling to understand their networks and their users. Web-based retail companies, IoT-based industries, and healthcare companies all have uses for streaming data as well.
Ellen Friedman demonstrates the advantages of a stream-based approach, exploring real-world situations in which companies in a variety of sectors are using stream processing, including in production, as she dives deeper into streaming issues such as low latency, windowing, and maintaining state—in essence different aspects of correctness. Examples will focus on best practices for streaming architecture, the importance of stream transport capabilities of tools like Apache Kafka, and how the new stream processing engine Apache Flink provides real-time or batch-based processing.
Ellen Friedman is principal technologist at MapR Technologies. Ellen is a committer on the Apache Drill and Apache Mahout projects and coauthor of a number of books on computer science, including Machine Learning Logistics, Streaming Architecture, the Practical Machine Learning series, and Introduction to Apache Flink. Ellen has been an invited speaker at Strata Data conferences, Big Data London, Big Data Paris, Berlin Buzzwords, Nike Tech Talks, the University of Sheffield Methods Institute in the UK, and NoSQL Matters Barcelona. She holds a PhD in biochemistry.
Comments on this page are now closed.
©2017, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • firstname.lastname@example.org
Apache Hadoop, Hadoop, Apache Spark, Spark, and Apache are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries, and are used with permission. The Apache Software Foundation has no affiliation with and does not endorse, or review the materials provided at this event, which is managed by O'Reilly Media and/or Cloudera.