Presented By O'Reilly and Cloudera
Make Data Work
September 25–26, 2017: Training
September 26–28, 2017: Tutorials & Conference
New York, NY

Streaming visual analytics: What's possible today and what's coming tomorrow

Shant Hovsepian (Arcadia Data)
4:35pm5:15pm Thursday, September 28, 2017
Stream processing and analytics
Location: 1E 07/08 Level: Intermediate
Average rating: ****.
(4.00, 1 rating)

Who is this presentation for?

  • Data analysts and architects and security and compliance officers

Prerequisite knowledge

  • A basic understanding of data visualization techniques and streaming or message queues
  • General familiarity with BI tools

What you'll learn

  • Understand streaming visual analytics and the limits of current approaches to visualizing streams
  • Learn how to react with and visualize data directly from streams in Kafka, Spark, and Flink

Description

As big data is shifting from a world at rest to a world in motion, we need visualization applications that have been architected to handle streaming systems for use cases from cybersecurity to the IoT to financial services. After all, time is money, and the sooner you deliver insights the greater the impact.

However, all of our existing business intelligence and visualization tools have been designed to work with legacy batch-oriented systems. Streaming visual analytics is a new technique for visualizing and interacting with streaming data in near real time. Shant Hovsepian explains how lambda- and polling-based architectures are being disrupted by reactive visualization systems, as streaming engines embrace the CQRS pattern, and offers analysis of visualizing streams from Apache Kafka, Apache Flink, and Apache Spark. Shant explores current approaches to doing visual analysis on streaming data, along with some of their shortcomings, and details what’s possible with the next generation for streaming analysis systems. You’ll learn how streaming visual analytics can be implemented atop Spark Structured Streaming, Kafka Streams, and Flink and Queryable State.

Topics include:

  • Reactive web applications
  • The differences between polling, SSE, and WebSockets
  • How CQRS fundamentally changes the game
  • The need for schema registries as an alternative to SQL catalogs
  • Issues with staging/querying in key-value stores
  • Visualizing windows
Photo of Shant Hovsepian

Shant Hovsepian

Arcadia Data

Shant Hovsepian is a cofounder and CTO of Arcadia Data, where he is responsible for the company’s long-term innovation and technical direction. Previously, Shant was an early member of the engineering team at Teradata, which he joined through the acquisition of Aster Data. Shant interned at Google, where he worked on optimizing the AdWords database, and was a graduate student in computer science at UCLA. He is the coauthor of publications in the areas of modular database design and high-performance storage systems.