Presented By O'Reilly and Cloudera
December 5-6, 2016: Training
December 6–8, 2016: Tutorials & Conference

Using big data technology to solve data connectivity in a disconnected world

Imron Zuhri (Mediatrac)
12:05pm–12:45pm Wednesday, December 7, 2016
Hadoop use cases
Location: 334/335 Level: Non-technical
Average rating: ***..
(3.33, 3 ratings)

Prerequisite Knowledge

  • Basic knowledge of Apache Spark

What you'll learn

  • Learn how knowledge graph and big data can be implemented in various industries to tackle data problems and improve business performance
  • Explore Mediatrac's big data technology platform focused on data connectivity, object profiling, and knowledge discovery and understand use cases


Many Asian countries, including Indonesia, are still living in an offline world. Most data are still in an analog form and are disconnected due to the lack of a single identity that could link them together. The two combinations made it very hard to analyze data for business or economic development. Most people don’t realize that, even today, most of data preparation process is still done manually and requires a lot of effort from a large number of data scientists to do it, which can be quite impractical.

Big data technology allows disconnected offline data to go digital and connect with one another, enabling companies and government to get a better view of individuals and other entities, contextually gain insights on what matters for them, and validate and optimize their spending and investments.

Mediatrac has been developing intelligent data preparation platform on top of Hadoop infrastructure using Apache Spark that combines Knowledge Graph and machine learning to automate the whole process and more quickly complete the process. Imron Zuhri shares several data connectivity use cases in the areas of marketing, sales and distribution, finance, telecommunication, healthcare, agriculture, and legal (in both the private sector and government) and explains how to leverage distributed computing to tackle massive entity recognition and resolution problems.

Topics include:

  • How Mediatrac collects and prepares data
  • The process of data cleansing and data connectivity in Indonesia
  • How Mediatrac purpose-built big data architecture to convince legacy teams to take a big data approach
  • Supporting tools using also the Knowledge Graph and machine learning that automate data scanning and the text-recognition process
  • How Mediatrac integrates data and contextually interconnects the data at the cell level
  • How Mediatrac utilizes Apache Spark and Apache Accumulo databases to contextually match data from one source to another
Photo of Imron Zuhri

Imron Zuhri


Imron Zuhri is the founder and chief technical director at Mediatrac, where he is responsible for herding the pack of nerds, the data scientist, and data engineers in the company. Together with his wife, Imron also established Erudio School of Art, the only democratic school of the arts high school in Indonesia. He has a wide interest in math, physics, astronomy, movies, music, photography, and literature, but first and foremost, he is obsessed with understanding human behavior, perhaps to compensate for his lack of social interaction.