Presented By O'Reilly and Cloudera
Make Data Work
March 28–29, 2016: Training
March 29–31, 2016: Conference
San Jose, CA
Ted Dunning

Ted Dunning
Chief Application Architect, MapR

Website | @ted_dunning

Ted Dunning is chief application architect at MapR. He’s also a board member for the Apache Software Foundation; a PMC member and committer of the Apache Mahout, Apache Zookeeper, and Apache Drill projects; and a mentor for various incubator projects. Ted has years of experience with machine learning and other big data solutions across a range of sectors. He’s contributed to clustering, classification, and matrix decomposition algorithms in Mahout and to the new Mahout Math library and designed the t-digest algorithm used in several open source projects and by a variety of companies. Previously, Ted was chief architect behind the MusicMatch (now Yahoo Music) and Veoh recommendation systems and built fraud-detection systems for ID Analytics (LifeLock). Ted has coauthored a number of books on big data topics, including several published by O’Reilly related to machine learning, and has 24 issued patents to date plus a dozen pending. He holds a PhD in computing science from the University of Sheffield. When he’s not doing data science, he plays guitar and mandolin. He also bought the beer at the first Hadoop user group meeting.

Sessions

11:50am–12:30pm Wednesday, 03/30/2016
IoT and Real-time

Location: 210 C/G
Tags: real-time
Ted Dunning (MapR)
Average rating: ***..
(3.78, 9 ratings)
Application messaging isn’t new—solutions include IBM MQ, RabbitMQ, and ActiveMQ. Apache Kafka is a high-performance, high-scalability alternative that integrates well with Hadoop. Can modern distributed messaging systems like Kafka be considered a legacy replacement or is it purely complementary? Ted Dunning outlines Kafka's architectural benefits and tradeoffs to find the answer. Read more.
4:20pm–5:00pm Wednesday, 03/30/2016
Data Innovations

Location: 230 C
Ted Dunning (MapR)
Average rating: ****.
(4.12, 8 ratings)
SQL is normally a very static language that assumes a fixed and well-known schema. Apache Drill breaks these assumptions by restructuring the execution of queries so optimizations and type resolution can be done just in time. This has profound consequences for how applicable SQL is in the big data world. Ted Dunning walks attendees through Drill and explores its implications for big data. Read more.
5:10pm–5:50pm Wednesday, 03/30/2016
Data Innovations

Location: 210 D/H
Tags: real-time
Ted Dunning (MapR)
Average rating: ****.
(4.11, 9 ratings)
Until recently, batch processing has been the standard model for big data. Today, many have shifted to streaming architectures that offer large benefits in simplicity and robustness, but this isn't your father’s complex event processing. Ted Dunning explores the key design techniques used in modern systems, including percolators, replayable queues, state-point queuing, and microarchitectures. Read more.
11:00am–11:40am Thursday, 03/31/2016
Office Hours

Location: Table A (O'Reilly Booth)
Ted Dunning (MapR)
If you have questions about streaming data architectures, come see Ted. He’ll talk about things like the differences between state-oriented and flow-oriented systems, why streaming is critical, and how to practically apply modern streaming architectures to your problems. Read more.