I Didn't Know You Could Do All that with Hadoop

Jack Norris (MapR Technologies)
Average rating: ****.
(4.00, 2 ratings)

Hadoop is gaining momentum with most companies having already deployed Hadoop in some fashion or are testing it in the lab. But there are many aspects of Hadoop that are not fully understood and appreciated including – How Hadoop can easily be leveraged by non-programmers, how to use Hadoop to quickly outperform complex models, how to easily integrate Hadoop into existing environments, and the two step process to use legacy applications with Hadoop.

During the session, Ted Dunning will show that while counter intuitive, as data size increases simple algorithms perform better than complex models on small data. This can greatly simplify the deployment and development of Hadoop applications and the talk will include several examples of machine learning deployments across multiple industries.

This session will also cover recent developments that make Hadoop access available to rank-and -file users. This expands access with standard applications to view and manipulate data beyond programmer access.
This session will provide detailed descriptions of the following:

1) Getting data into and out of the Hadoop cluster as quickly as possible
2) Allowing real-time components to easily access cluster data
3) Using well-known and understood standard tools to access cluster data
4) Making Hadoop easier to use and operate
5) Leveraging existing code in map-reduce settings
6) Integrating map-reduce systems into existing analytic systems

Photo of Jack Norris

Jack Norris

MapR Technologies

Jack Norris is the senior vice president of data and applications at MapR Technologies, where he works with leading customers and partners worldwide to drive the understanding and adoption of new applications enabled by data and analytics. With over 25 years of enterprise software experience, he has demonstrated success from identifying new markets to defining new products to launching companies. Jack’s background includes senior executive positions with establishing analytic, virtualization, and storage companies. Jack was an early employee of MapR Technologies and held senior executive roles with EMC, Brio Technology, and Bain and Company.


  • EMC
  • Microsoft
  • HPCC Systems™ from LexisNexis® Risk Solutions
  • MarkLogic
  • Shared Learning Collaborative
  • Cloudera
  • Digital Reasoning Systems
  • Pentaho
  • Rackspace Hosting
  • Teradata Aster
  • VMware
  • IBM
  • NetApp
  • Oracle
  • 1010data
  • 10gen
  • Acxiom
  • Amazon Web Services
  • Calpont
  • Cisco
  • Couchbase
  • Cray
  • Datameer
  • DataSift
  • DataStax
  • Esri
  • Facebook
  • Feedzai
  • Hadapt
  • Hortonworks
  • Impetus
  • Jaspersoft
  • Karmasphere
  • Lucid Imagination
  • MapR Technologies
  • Pervasive
  • Platform Computing
  • Revolution Analytics
  • Scaleout Software
  • Skytree, Inc.
  • Splunk
  • Tableau Software
  • Talend

For information on exhibition and sponsorship opportunities at the conference, contact Susan Stewart at sstewart@oreilly.com.

For information on trade opportunities with O'Reilly conferences contact Kathy Yu at mediapartners

For media-related inquiries, contact Maureen Jennings at maureen@oreilly.com

View a complete list of Strata contacts