Presented By O’Reilly and Intel AI
Put AI to work
8-9 Oct 2018: Training
9-11 Oct 2018: Tutorials & Conference
London, UK

Executive Briefing: How to augment sparse training sets with synthetic data

Daeil Kim (AI.Reverie)
16:00–16:40 Wednesday, 10 October 2018
AI Business Summit
Location: Blenheim Room - Palace Suite
Secondary topics:  Computer Vision, Data Networks and Data Markets

What you'll learn

  • Understand the the advantages of synthetic data


Synthetic data promises massive sets of perfectly generated training data for a fraction of the cost of manually sourced annotated data. But doubt remains about the efficacy of using synthetic datasets to train machine learning.

Daeil Kim delineates the advantages of synthetic data and explains how to avoid traps that lead to dead zones and false positives. He also reviews work on simulations for synthetic data in application verticals in which it is traditionally difficult to manually acquire significant datasets. If you have problems with sparse datasets for training, this is the talk for you.

Photo of Daeil Kim

Daeil Kim


Driven by the passion to create a better world with AI, Daeil Kim created AI.Reverie, a simulation platform to train AI to understand the world and make it better. Daeil believes that we can create a future where issues related to food, shelter, and health can be efficiently met with the help of AI. Daeil grew up in New York City. He holds a liberal arts degree at Sarah Lawrence College, focusing on literature. An interest in medicine led him to New Mexico to research schizophrenia and to understand mental illness through artificial intelligence. He then pursued a PhD in computer science at Brown University, focusing on the development of scalable machine learning algorithms. Afterward, his interests in developing tools for investigative journalism led him to pursue a career as a data scientist at the New York Times.