Fueling innovative software
July 15-18, 2019
Portland, OR

Schedule: Building Data-Intensive Applications sessions

Add to your personal schedule
9:00am12:30pm Tuesday, July 16, 2019
Location: C123-124
Secondary topics:  Data Driven
Tim Berglund (Confluent), Brandon Bird (Confluent)
Average rating: ****.
(4.54, 13 ratings)
Join Tim Berglund to learn how to produce and consume a Kafka topic, integrate Kafka with a database using Kafka Connect, and perform real-time stream processing on Kafka data. Read more.
Add to your personal schedule
11:00am11:40am Wednesday, July 17, 2019
Location: D138-140
Secondary topics:  Data Driven
Grishma Jena (IBM)
Average rating: ****.
(4.25, 12 ratings)
Today’s world generates different kinds of data at unbelievably rapid rates. Grishma Jena explains data science, the data science pipeline, and algorithms using real-life examples. Read more.
Add to your personal schedule
11:50am12:30pm Wednesday, July 17, 2019
Location: D138-140
Secondary topics:  Data Driven
Michael Enescu (Project EAN), Peter Enescu (University of California San Diego)
Average rating: ****.
(4.60, 5 ratings)
Fires caused by electric grid failures are increasing at an alarming rate. Michael Enescu and Peter Enescu examine how the energy adaptive networks technology built on open source and used to monitor and control power grids forms a planetary skin that can be used to predict and avoid such disasters as the Napa and Paradise Fires. Read more.
Add to your personal schedule
1:45pm2:25pm Wednesday, July 17, 2019
Location: D138-140
Secondary topics:  Data Driven
Christophe Pettus (PostgreSQL Experts, Inc.)
Average rating: ****.
(4.91, 11 ratings)
Applications from social media to healthcare to media are increasingly focused on humans and their relationships. But people do not lend themselves easily to being reduced to small slots. Christophe Pettus draws on his experience in data modeling and application design to examine how to successfully approach modeling humans and their relationships and legal compliance issues in a GDPR world. Read more.
Add to your personal schedule
2:35pm3:15pm Wednesday, July 17, 2019
Location: D138-140
Secondary topics:  Data Driven
Bas Geerdink (Aizonic)
Average rating: ****.
(4.50, 2 ratings)
Streaming analytics is a popular subject in enterprise organizations because customers want real-time experiences, such as notifications and advice based on online behavior and other users’ actions. Bas Geerdink details an open source reference solution for streaming analytics that covers many use cases that follow a "pipes and filters" pattern, built with Scala, Flink, Kafka, and Cassandra. Read more.
Add to your personal schedule
4:15pm4:55pm Wednesday, July 17, 2019
Location: D138-140
Secondary topics:  Data Driven
Wenbo Zhu (Google)
Average rating: **...
(2.67, 9 ratings)
When designing APIs such as the new GCP Firestore real-time database and Google Assistant, how did Google decide which trade-offs to make? Wenbo Zhu dives deep into the challenges faced while deploying a real-time streaming API designed for clients from data centers to the internet and details the trade-offs API developers need be aware of when designing such an API. Read more.
Add to your personal schedule
5:05pm5:45pm Wednesday, July 17, 2019
Location: D138-140
Secondary topics:  Data Driven
Shradha Ambekar (Intuit)
Average rating: ****.
(4.50, 2 ratings)
Cassandra is one of the most popular datastores in big data and ML applications. Data analysis at scale with fast query response is critical for business needs, and while Cassandra with Spark integration allows running an analytical workload, it can be slow. Shradha Ambekar dives into the challenges faced at Intuit and the solutions her team implemented to improve performance by 100x. Read more.
Add to your personal schedule
11:00am11:40am Thursday, July 18, 2019
Location: E143/144
Secondary topics:  Data Driven
Amy Hodler (Neo4j), William Lyon (Neo4j)
Average rating: ****.
(4.70, 10 ratings)
Graphs provide a method to store and analyze the relationships within the data. Algorithms deepen our understanding of data through aggregation and perspectives to help developers make valuable business decisions for the future based on existing scenarios. Amy Hodler and Mark Needham lead you through a crash course in how to use graph algorithms as part of your big data toolkit. Read more.
Add to your personal schedule
11:50am12:30pm Thursday, July 18, 2019
Location: E143/144
Secondary topics:  Data Driven
Average rating: ***..
(3.00, 5 ratings)
Managing large stateful applications is tough. Matt Schallert outlines the challenges of automating stateful systems at scale and details how embracing a declarative approach can ease operation and automation of these systems on orchestrators such as Kubernetes. He then demonstrates how to apply this methodology to different types of stateful workloads. Read more.
Add to your personal schedule
1:45pm2:25pm Thursday, July 18, 2019
Location: E143/144
Secondary topics:  Data Driven
Jiaqi Liu (Button)
Average rating: ****.
(4.33, 3 ratings)
Data-intensive applications, with many layers of transformations and movement from different data sources, can often be challenging to maintain and iterate even after they are initially built and validated. Jiaqi Liu explores how to factor in monitoring, alerting, and tracing data lineage when building data applications that move and transform data across multiple dependencies. Read more.
Add to your personal schedule
2:35pm3:15pm Thursday, July 18, 2019
Location: E143/144
Secondary topics:  Data Driven
Alex Silva (Pluralsight)
Average rating: ***..
(3.67, 3 ratings)
Alex Silva outlines lessons learned, common pitfalls, and design traps when designing your streaming data infrastructure, and he shares 21 best practices and standards used at Pluralsight. Read more.
Add to your personal schedule
4:15pm4:55pm Thursday, July 18, 2019
Location: E143/144
Secondary topics:  Data Driven
Vicențiu Ciorbaru (MariaDB Foundation)
Average rating: *****
(5.00, 1 rating)
With so many moving parts, it's hard for the average database administrator (DBA) or database developer to come up with a good explanation for why the optimizer chooses certain query plans. Vicențiu Ciorbaru dives deep into how a modern database query optimizer works to optimize your queries and how you can help it work for you. Read more.