Personal schedule for Mark Madsen

Download or subscribe to Mark Madsen's schedule.

Sponsored Session, Ballroom F
Please note: to attend, your registration must include Tutorials.
Richard Taylor (HPCC Systems from LexisNexis Risk Solutions)
While extracting entities from massive amounts of text is a major problem, a proven solution exists. This tutorial will demonstrate a natural language parsing technology to extract entities from all kinds of text using massively parallel clusters. Read more.
Mission CIty Ballroom Foyer
Average rating: *****
(5.00, 1 rating)
Two events happening in the same time & place: *Mini Maker Faire* is a showcase of innovative data-related hardware, apps, and projects *Data Crush*, an experiment combining wine-tasting with the gathering, analysis, and application of data to track behavioral trends and influencing factors. Read more.
, Mission City Ballroom
This presentation will be streamed live.
Edd Wilder-James (Silicon Valley Data Science), Alistair Croll (Solve For Interesting)
Opening remarks by the Strata program chairs, Edd Dumbill and Alistair Croll. Read more.
, Mission City Ballroom
This presentation will be streamed live.
Abhishek Mehta (Tresata)
Average rating: ***..
(3.67, 3 ratings)
How big data tools and technologies give us back our individual identity ... because if you didn't know you were unique and special, well, you are. Big data can be applied to solving socio-economic problems that rival the scale and importance of building ad optimization models. Read more.
, Mission City Ballroom
This presentation will be streamed live.
Mike Olson (Cloudera)
Average rating: ****.
(4.00, 3 ratings)
Tools for attacking big data problems originated at consumer internet companies, but the number and variety of big data problems have spread across industries and around the world. I'll present a brief summary of some of the critical social and business problems that we're attacking with the open source Apache Hadoop platform. Read more.
, Mission City Ballroom
This presentation will be streamed live.
Ben Goldacre (Bad Science)
Average rating: ****.
(4.67, 6 ratings)
Negative results from clinical trials go missing far too often, leading us to overestimate the benefits of treatments. Attempts to remedy this problem haven't worked well. Ben Goldacre, both a doctor and data geek, will talk about how to fix this, and other, problems in medicine. Read more.
Data Science, Mission City B1
Q McCallum (@qethanm)
Average rating: *....
(1.00, 1 rating)
The biggest problem in data science is ... the data itself. Read more.
Sponsored Session, Ballroom H
Eddie Satterly (Splunk), Sanjay Mehta (Splunk)
In this session, Expedia, one of the world’s leading online travel companies, describes how they tapped into their massive machine data to deliver unprecedented insights across key IT and business areas – from ad metrics and risk analysis, to capacity planning, security, and availability analysis. Read more.
Visualization & Interface, Ballroom AB
This presentation will be streamed live.
Jock Mackinlay (Tableau Software)
Average rating: ***..
(3.50, 4 ratings)
Visual analysis is an iterative process for working with data that exploits the power of the human visual system. The formal core of visual analysis is the mapping of data to appropriate visual representations. Learn what years of research have taught us about designing visualizations people can learn from and understand. Read more.
Business & Industry, Mission City B4
Josh Green (Panjiva)
Despite the hype, Big Data has yet to live up to its potential. Why? Because we’ve spent too much time thinking about the data itself and not enough time considering which business decisions can be improved through the intelligent application of data. Panjiva CEO Josh Green will discuss an alternative approach: starting with a challenging business problem and then tracking down relevant data. Read more.
Exhibit Hall
Grab a drink, mingle with fellow Strata participants, and see the latest technologies and products from leading companies in the data space. Read more.
Event, Mission CIty Ballroom Foyer
Average rating: *....
(1.00, 1 rating)
Don't miss Startup Showcase, Strata's live demo program and competition for startups and early-stage companies. With a panel of industry experts providing real-time feedback, Startup Showcase happens during Strata Conference on Wednesday, February 29, 2012. Read more.
, Mission City Ballroom
This presentation will be streamed live.
Alistair Croll (Solve For Interesting), Edd Wilder-James (Silicon Valley Data Science)
Opening remarks by the Strata program chairs, Alistair Croll and Edd Dumbill. Read more.
, Mission City Ballroom
This presentation will be streamed live.
Jonathan Gosier (AuDigent)
Average rating: **...
(2.00, 2 ratings)
Big data isn't just an abstract problem for corporations, financial firms, and tech companies. To your mother, a 'big data' problem might simply be too much email, or a lost file on her computer. We need to democratize access to the tools used for understanding information by taking the hard-work out of drawing insight from excessive quantities of information. Read more.
, Mission City Ballroom
This presentation will be streamed live.
Luke Lonergan (Greenplum, a division of EMC)
Average rating: ***..
(3.33, 3 ratings)
How are businesses using big data to connect with their customers, deliver new products or services faster and create a competitive advantage? Learn about the changing nature of customer intimacy and how the technologies and techniques around big data analysis provide business advantage in today's social, mobile environment – and why it is imperative to adopt a big data analytics strategy. Read more.
, Mission City Ballroom
This presentation will be streamed live.
Coco Krumme (MIT Media Lab)
Average rating: ***..
(3.00, 3 ratings)
Why data can tell us only so much about food, flavor, and our preferences. Read more.
, Mission City Ballroom
This presentation will be streamed live.
Usman Haque (
Average rating: ****.
(4.33, 3 ratings)
The expected massive growth of connected device, appliance and sensor markets in the coming years - often called 'The Internet of Things' - will need a more rich concept of 'open data' than is currently common. Read more.
, Mission City Ballroom
This presentation will be streamed live.
Hal Varian (Google)
Average rating: ****.
(4.60, 5 ratings)
Google Insights for Search provides an index of search activity for millions of queries. These queries can sometimes help understand consumer behavior. Hal describes some of the issues that arise in trying to use this data for short-term economic forecasts and provide examples. Read more.
Data Science, Mission City B1
This presentation will be streamed live.
Theo Schlossnagle (OmniTI/Circonus)
Average rating: ***..
(3.67, 3 ratings)
In today's environments, we're often forced to collect data before we know if it will be useful. This tendency leads toe seas of data, flowing in real-time with very little structure or understanding of what the data means. Given that, how can you tell when data "is normal?" Let's find out. Read more.
Stefan Groschupf (Datameer)
This session discusses financial services use cases and challenges in using Hadoop analytics including long-term storage and analytics of transactions, identifying cross and up sell opportunities by analyzing web log files and customer profiles, value-at-risk analytics, and understanding the SLA issues and identifying problems in a thousands-of-nodes, big-services oriented architecture. Read more.
Nathan Marz (Twitter)
Average rating: **...
(2.00, 1 rating)
Storm is an open-source realtime computation system relied upon by Twitter for much of its analytics. Storm does for realtime computation what Hadoop did for batch computation. It has a huge range of applications and combines ease of use with a robust foundation. Read more.
Business & Industry, Ballroom E
J. C. Herz (Wired)
Average rating: ****.
(4.00, 1 rating)
This talk uses the OODA Loop concept (Observe, Orient, Decide, Act) as a framework to categorize Big Data use cases and data-driven services and the front-ends to those services. Rather than starting with the underlying technology or the data sources, the OODA loop starts with WHY the user needs information. It answers the question of when a black box beats an analytic tool, and vice versa. Read more.
Sean Byrnes (Flurry, Inc.)
Flurry provides an analytics and advertising platform for smartphone applications. Every month we track over 20 billion sessions across over 330 million devices. This talk will go over the Hadoop and HBase architecture we run and the challenges we face managing a massively growing data set. Read more.
James Phillips (Couchbase, Inc.)
Average rating: **...
(2.00, 2 ratings)
Mobile devices offer boundless opportunities for collection and presentation of temporally- and spatially-relevant data. But there are obstacles: intermittent connectivity as well as processing, storage and other constraints. Featuring real-world apps, this session covers device data collection; device-device and device-cloud data synchronization; and data aggregation and analysis in the cloud. Read more.
Sponsored Session, Ballroom G
Vineet Tyagi (Impetus Technologies)
The session will talk about costs involved in Big Data projects, covering the apparent and also hidden aspects of these costs. It will also discuss how to build a Big Data solution with lower cost of “per TB Data Managed and Analyzed”. Read more.
Business & Industry, Ballroom AB
Robbie Allen (Automated Insights, Inc.)
Average rating: ****.
(4.00, 1 rating)
The ultimate utility of Big Data is transforming it into Big Insights. Charts, graphs, and tables of aggregated data are useful but still require interpretation by the end user. With advances in linguistic algorithms and data processing it is now possible to derive meaningful insights from data and present them in digestible narrative content. Read more.
Ron Bodkin (Think Big Analytics), Kumar Palaniappan (NetApp)
NetApp collects 250 TB per year of unstructured data from devices that phone home. They need to be able to do ad hoc analysis and build predictive models for device support and cross-sales. We discuss our experiences building a Big Data system with NetApp using Hadoop and HBase to improve customer service, drive sales and develop better products. Read more.
Marc Smith (Social Media Research Foundation)
Average rating: *****
(5.00, 1 rating)
Maps of the complex connections that form when people link, like, reply, rate, review, favorite, friend, follow, edit, and mention one another can reveal important trends. It is possible to create network maps with free and open tools that identify key people and sub-groups in any social media population with just a few key clicks. Can you make a pie chart? You can now make a network chart. Read more.


  • EMC
  • Microsoft
  • HPCC Systems™ from LexisNexis® Risk Solutions
  • MarkLogic
  • Shared Learning Collaborative
  • Cloudera
  • Digital Reasoning Systems
  • Pentaho
  • Rackspace Hosting
  • Teradata Aster
  • VMware
  • IBM
  • NetApp
  • Oracle
  • 1010data
  • 10gen
  • Acxiom
  • Amazon Web Services
  • Calpont
  • Cisco
  • Couchbase
  • Cray
  • Datameer
  • DataSift
  • DataStax
  • Esri
  • Facebook
  • Feedzai
  • Hadapt
  • Hortonworks
  • Impetus
  • Jaspersoft
  • Karmasphere
  • Lucid Imagination
  • MapR Technologies
  • Pervasive
  • Platform Computing
  • Revolution Analytics
  • Scaleout Software
  • Skytree, Inc.
  • Splunk
  • Tableau Software
  • Talend

For information on exhibition and sponsorship opportunities at the conference, contact Susan Stewart at

For information on trade opportunities with O'Reilly conferences contact Kathy Yu at mediapartners

For media-related inquiries, contact Maureen Jennings at

View a complete list of Strata contacts