Presented By O'Reilly and Cloudera
Make Data Work
March 28–29, 2016: Training
March 29–31, 2016: Conference
San Jose, CA
Mike Cafarella

Mike Cafarella
Assistant Professor, University of Michigan

Website | @MikeCafarella

Mike Cafarella is one of the cofounders of the Apache Hadoop and Nutch open source projects. Mike is also an assistant professor of computer science and engineering at the University of Michigan. His research interests include databases, information extraction, data integration, and data mining. Recently, he cofounded Lattice Data (, a company that aims to transform “dark data,” such as unstructured text documents and reports, into high quality structured databases.


4:30pm–5:00pm Tuesday, 03/29/2016
Hardcore Data Science
Location: 210 C/G
Tags: ai
Mike Cafarella (University of Michigan)
Average rating: ****.
(4.60, 10 ratings)
Dark data is the great mass of data buried in text, tables, figures, and images that lacks structure and so is essentially unprocessable by existing software. DeepDive is a system that extracts value from dark data. Mike Cafarella offers an introduction to DeepDive, exploring the key technical innovations that enable DeepDive to produce statistical inference at massive scale. Read more.
11:00am–11:40am Wednesday, 03/30/2016
Ben Lorica (O'Reilly), Doug Cutting (Cloudera), Mike Cafarella (University of Michigan)
Average rating: **...
(2.87, 15 ratings)
Ben Lorica hosts a conversation with Doug Cutting and Mike Cafarella, the cofounders of Apache Hadoop. Read more.