Presented By O’Reilly and Cloudera

San Francisco • London • New York

Make Data Work

21–22 May 2018: Training
22–24 May 2018: Tutorials & Conference
London, UK

Expo Hall sessions

Wednesday, 23 May & Thursday, 24 May 2018

11:15–11:55 Wednesday, 23 May 2018

Revolutionizing the newsroom with artificial intelligence

Location: Expo Hall Level: Beginner

Secondary topics: Media, Advertising, Entertainment

Dan Gilbert (News UK), Jonathan Leslie (Pivigo)

Average rating:

(3.75, 4 ratings)

In the era of 24-hour news and online newspapers, editors in the newsroom must quickly and efficiently make sense of the enormous amounts of data that they encounter and make decisions about their content. Daniel Gilbert and Jonathan Leslie discuss an ongoing partnership between News UK and Pivigo in which a team of data science trainees helped develop an AI platform to assist in this task. Read more.

12:05–12:45 Wednesday, 23 May 2018

Interpretable AI: Can we trust machine learning?

Location: Expo Hall Level: Intermediate

Konstantinos Georgatzis (QuantumBlack), Martha Imprialou (QuantumBlack)

Konstantinos Georgatzis and Martha Imprialou explain how to interpret the predictions given by your black-box model and how machine learning is helping to drive decision making today. Read more.

14:05–14:45 Wednesday, 23 May 2018

Time for a new relation: Going from RDBMS to a graph database

Location: Expo Hall Level: Intermediate

Secondary topics: Time Series and Graphs

Tags:

Patrick McFadin (DataStax)

Average rating:

(5.00, 2 ratings)

Graph databases are becoming mainstream. Patrick McFadin explains how to use the knowledge you have gained from your years of working with relational databases in this brave new world. There are many similarities but also some significant differences that can open up completely new use cases. If you're deciding whether to take the plunge into graph databases, this is the talk for you. Read more.

14:55–15:35 Wednesday, 23 May 2018

Machine-learned model quality monitoring in fast data and streaming applications

Location: Expo Hall Level: Intermediate

Secondary topics: Managing and Deploying Machine Learning

Emre Velipasaoglu (Lightbend)

Average rating:

(3.67, 3 ratings)

Most machine learning algorithms are designed to work on stationary data, but real-life streaming data is rarely stationary. Models lose prediction accuracy over time if they are not retrained. Without model quality monitoring, retraining decisions are suboptimal and costly. Emre Velipasaoglu reviews monitoring methods, focusing on their applicability in fast data and streaming applications. Read more.

16:35–17:15 Wednesday, 23 May 2018

Data-driven ecosystems in the automotive industry

Location: Expo Hall Level: Intermediate

Tobias Burger (BMW Group), Joshua Goerner (BMW AG)

Average rating:

(5.00, 1 rating)

The BMW Group IT team drives the usage of data-driven technologies and forms the nucleus of a data-centric culture inside of the organization. Tobias Bürger and Joshua Görner discuss the E-to-E relationship of data and models and share best practices for scaling applications in real-world environments. Read more.

11:15–11:55 Thursday, 24 May 2018

Modeling time series in R

Location: Expo Hall Level: Beginner

Secondary topics: Time Series and Graphs

Jared Lander (Lander Analytics)

Average rating:

(4.00, 2 ratings)

Temporal data is being produced in ever-greater quantity, but fortunately our time series capabilities are keeping pace. Jared Lander explores techniques for modeling time series, from traditional methods such as ARMA to more modern tools such as Prophet and machine learning models like XGBoost and neural nets. Along the way, Jared shares theory and code for training these models. Read more.

12:05–12:45 Thursday, 24 May 2018

A heretical monitoring view: Using PostgreSQL to store Prometheus metrics and visualizing them in Grafana

Location: Expo Hall Level: Intermediate

Secondary topics: Time Series and Graphs

Erik Nordström (Timescale)

Erik Nordström explains how and why to use PostgreSQL as a Prometheus backend to support complex questions (and get a proper SQL interface), offers an overview of pg_prometheus, a custom Prometheus datatype, and prometheus-postgresql-adapter, a remote storage adaptor for PostgreSQL, and shares his experience with TimescaleDB, which enables PostgreSQL to scale for classic monitoring volumes. Read more.

14:05–14:45 Thursday, 24 May 2018

Spark NLP in action: Intelligent, high-accuracy fact extraction from long financial documents

Location: Expo Hall Level: Intermediate

Secondary topics: Financial Services, Text and Language processing and analysis

David Talby (Pacific AI), Saif Addin Ellafi (John Snow Labs), Paul Parau (UiPath)

Average rating:

(4.50, 4 ratings)

Spark NLP natively extends Spark ML to provide natural language understanding capabilities with performance and scale that was not possible to date. David Talby, Saif Addin Ellafi, and Paul Parau explain how Spark NLP was used to augment the Recognos smart data extraction platform in order to automatically infer fuzzy, implied, and complex facts from long financial documents. Read more.

14:55–15:35 Thursday, 24 May 2018

Big data meets renewable energy: Building a real-time asset management platform for renewable energy

Location: Expo Hall Level: Beginner

Secondary topics: Data Integration and Data Pipelines sessions, Data Platforms

Stamatis Stefanakos (D ONE AG)

Average rating:

(4.33, 3 ratings)

Switzerland-based startup WinJi capitalizes on two current megatrends: big data and renewable energy. Stamatis Stefanakos offers an overview of WinJi's TruePower Asset Management Platform, covering the overall architecture and the motivation behind it, the physics behind the data, and the business case. Read more.

Presented by

Elite Sponsors

Exabyte Sponsor

Impact Sponsors

Supporting Sponsor

Sponsorship Opportunities

For exhibition and sponsorship opportunities, email strataconf@oreilly.com

Partner Opportunities

For information on trade opportunities with O'Reilly conferences, email partners@oreilly.com

Contact Us

View a complete list of Strata Data Conference contacts

©2018, O’Reilly UK Ltd • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • confreg@oreilly.com