Skip to main content
Make Data Work
Oct 15–17, 2014 • New York, NY

Big Data Anti-Patterns

Douglas Moore (Think Big Analytics)
11:50am–12:30pm Friday, 10/17/2014
Data Science
Location: 1D
Average rating: **...
(2.92, 12 ratings)
Slides:   external link

Big Data Anti-Patterns: Lessons from the Front Lines

The genesis of these popular anti-patterns come from three myths:

  1. “Oh, it’s a little bit in-efficient, just throw more machines at it” – Ruby vs. Java for ETL
  2. “It’s supported by the Hadoop vendor, let’s use that” – Flume vs. Spring Batch for bulk loads
  3. “It’s GA, let’s use that to replace our MPP database” – Impala vs. HBase for fast query

We will explore these myths and where they come from, the results and pain we’ve seen in real world projects, and what we did to correct the situation.

Photo of Douglas Moore

Douglas Moore

Think Big Analytics

Mr. Moore is Principal Big Data Architect for Think Big Analytics and architect of the system described in this presentation. Mr. Moore’s experience spans 25 years of integrating data collection, analysis, OLTP, OLAP, Data Warehouse and graphics systems. Mr. Moore has a Bachelors and Masters in Electrical Engineering and has worked for IBM, Sapient, SmartEnergy and Accius.

Comments on this page are now closed.


Picture of Young-Sik Chung
Young-Sik Chung
10/22/2014 3:50pm EDT

Thank you so much!
I appreciate for your kind help.

Picture of Douglas Moore
Douglas Moore
10/22/2014 8:37am EDT

The slides are posted to:

Picture of Young-Sik Chung
Young-Sik Chung
10/21/2014 10:10pm EDT

I want your presentation slides. Can you posting that material ASAP?