How to Avoid the Three Stooges of Data Warehousing: Practical Lessons for Open Source BI Implementations

Data Warehousing and Business Intelligence, Tutorial
Location: Ballroom E Level: Novice
Average rating: ***..
(3.40, 5 ratings)

Generally, Data Warehouses or Data Marts are made up of three basic components:

  1. Extract/Transform/Load
  2. Database
  3. Business Intelligence / Reporting / Data Surfacing

Each key area has lots of benefits if implemented correctly but, like the Three Stooges, without good planning, communication, and a coordinated approach, bedlam can ensue!

With Open Source offerings reaching a level of maturity where they can be implemented into corporate BI solutions, emphasis is rapidly moving towards strong design and implementation fundamentals as many Open Source practitioners begin to embrace Business Intelligence solutions.

The presentation will focus on how to design and build a data warehouse, with emphasis on common pitfalls (pratfalls?) encountered by implementers and how to avoid them. We’ll also spend some time discussing why Data Marts are different from transactional solutions and how to ‘unlearn’ some of those fundamentals when designing and implementing a Data Mart or Data Warehouse, along with some real life examples using Open Source technologies.

My proposed agenda:

  • Data Warehousing – What is it? What isn’t it?
  • The three key components (ETL, Data Store, BI):

* What they do
* Why they are important
* What you should do in design
* What you should avoid.
* Overview of the open source vendor landscape.

  • Tool overload – and how to avoid it.
  • The disposable data mart:

* Should a data mart have a 3 year or less lifespan?

  • Trends in Open Source Data Warehousing and Business Intelligence:

* How it will affect your Business Intelligence initiatives.

  • Follow on learning opportunities.

My target audience is anyone just now getting into data warehousing. Many attendees may have, on their own, already built a data mart and may not be happy with the performance or the effort required to keep it running, or have found that their business clients are not satisfied, and are looking for guidance on best practices.

Photo of David  Lutz

David Lutz


David Lutz joined Infobright in April of 2008 as Director of Sales Engineering. David became an Infobright team member after 13 years in the data warehousing and business intelligence industry as a consumer, consultant and vendor representative of software products for this market.

David filled many roles in those years, including DBA, Data Warehouse Administrator and Data Warehouse Designer at organizations such as The Kroger Company and LensCrafters. As a member of the data warehouse software industry, David has served as a Data Warehouse Project Manager, Product Manager, Business Intelligence (BI) Integration Architect and at various levels of responsibility in the Sales/Systems Engineering organizations of Informix, Net Perceptions, Crystal Decisions, Business Objects, Pervasive Software/Data Junction, Composite Software and Greenplum Software.

David’s experience includes working in all aspects of industry with customers including Cornell University, The Kroger Company, Randalls Groceries, Tricon Global Restaurants (KFC, Taco Bell, Pizza Hut), JCPenney, GTE, Sabre Systems, XTO Energy and Impac Mortgage.

David’s experience spans all aspects of business intelligence – data modeling and administration, ETL/ELT and custom integration, data warehousing, data marting, data mining, operational/analytical BI, and heterogeneous data federation with such tools as Informix ODS/XPS, Illustra (PostgreSQL), Metacube, RedBrick, KD1, Microstrategy, Crystal Decisions suite of BI products, Business Objects BI and EIM tools – Acta/Data Integrator and FirstLogic, BIRT, Pentaho, JasperSoft, Cognos8, Composite Server, and various databases including open source databases like MySQL.

Co-presented By:

O'Reilly Media MySQL/Sun Microsystems
  • Kickfire
  • Virident
  • Infobright, Inc
  • JasperSoft
  • Intel
  • Advanced Micro Devices
  • BIRT Exchange by Actuate
  • Calpont
  • Canonical
  • Continuent
  • Dolphin Interconnect Solutions
  • Facebook
  • HiT Software, Inc.
  • IBM
  • iDashboards
  • Oracle
  • Pentaho
  • R1Soft
  • Schooner Information Technology
  • SQLstream
  • Ticketmaster
  • Zmanda, Inc.
  • Linux Journal

Sponsorship Opportunities

For information on exhibition and sponsorship opportunities at the conference, contact Sharon Cordesse at

Download the MySQL Sponsor/Exhibitor Prospectus

Media Partner Opportunities

Download the Media & Promotional Partner Brochure (PDF) for information on trade opportunities with O'Reilly conferences or contact mediapartners@

Press and Media

For media-related inquiries, contact Maureen Jennings at

MySQL Conference Newsletter

To stay abreast of conference news and to receive email notification when registration opens, please sign up for the MySQL Conference newsletter.

Contact Us

View a complete list of MySQL contacts.