Sep 23–26, 2019
Please log in

Online machine learning in streaming applications

Stavros Kontopoulos (Lightbend), Debasish Ghosh (Lightbend)
11:20am12:00pm Thursday, September 26, 2019
Location: 1A 21/22
Average rating: ****.
(4.00, 2 ratings)

Who is this presentation for?

  • Machine learning (ML) engineers, data scientists, data engineers, data architects, and IoT experts

Level

Intermediate

Description

Applications such as smart homes, smart monitoring of industrial environments, augmented reality in retail, and autoconnected cars are driving a new era in online ML, where ML algorithms have been moved to the edge instead of the cloud. These applications are constrained in terms of resources like power, CPU, memory, etc. and responsiveness. Data flows in the system and the application needs to interact with the surrounding environment in a given time window.

Stavros Kontopoulos and Debasish Ghosh explore the foundations of the algorithmic aspects (Hoeffding Adaptive Trees, classic sketch data structures, and drift detection algorithms) of these applications and dive into the details of how they can be implemented and deployed efficiently in production. They evaluate production concerns like performance (latency, memory footprint, etc.), techniques for updating models being served in a running pipeline and future trends like feature space representation and sampling, and tools to use for the actual implementation and deployment of these algorithms.

The concepts they detail can also be applied in a cloud setting, so many of the practical aspects they cover are universal and will benefit any practitioner of ML. The main focus is cutting-edge applications and technologies, and you don’t want to miss a glance in the future.

Prerequisite knowledge

  • A basic understanding of ML, streaming, tools for writing streaming applications like Apache Spark, Apache Flink, and tools for application deployment and orchestration like Kubernetes

What you'll learn

  • Gain an understanding of online ML, combining ML with streaming, and tools and best practices for deploying online ML-based streaming applications
Photo of Stavros Kontopoulos

Stavros Kontopoulos

Lightbend

Stavros Kontopoulos is a senior engineer on the data systems team at Lightbend. He implemented Lightbend’s fast data strategy. Previously, he built software solutions that scale in different verticals like telecoms and marketing. His interests include distributed system design, streaming technologies, and NoSQL databases.

Photo of Debasish Ghosh

Debasish Ghosh

Lightbend

Debasish Ghosh is a principal software engineer at Lightbend. He’s passionate about technology and open source, loves functional programming, and has been trying to learn math and machine learning. Debasish is an occasional speaker in technology conferences worldwide, including the likes of QCon, Philly ETE, Code Mesh, Scala World, Functional Conf, and GOTO. He’s the author of DSLs In Action and Functional & Reactive Domain Modeling. Debasish is a senior member of ACM. He’s also a father, husband, avid reader, and Seinfeld fan, who loves spending time with his beautiful family.

  • Cloudera
  • O'Reilly
  • Google Cloud
  • IBM
  • Cisco
  • Dataiku
  • Intel
  • Io-Tahoe
  • MemSQL
  • Microsoft Azure
  • Oracle Cloud Infrastructure
  • SAS
  • Arcadia Data
  • BMC Software
  • Hazelcast
  • SAP
  • Amazon Web Services
  • Anaconda
  • Esri
  • Infoworks.io, Inc.
  • Kyligence
  • Pitney Bowes
  • Talend
  • Google Cloud
  • Confluent
  • DataStax
  • Dremio
  • Immuta
  • Impetus Technologies Inc.
  • Keyence
  • Kyvos Insights
  • StreamSets
  • Striim
  • Syncsort
  • SK holdings C&C

    Contact us

    confreg@oreilly.com

    For conference registration information and customer service

    partners@oreilly.com

    For more information on community discounts and trade opportunities with O’Reilly conferences

    strataconf@oreilly.com

    For information on exhibiting or sponsoring a conference

    pr@oreilly.com

    For media/analyst press inquires