• 10gen
  • DataStax, Inc.
  • Dell
  • Google
  • Lexis Nexis
  • Oracle
  • VMware
  • Percona

Sponsorship Opportunities

For information on exhibition and sponsorship opportunities at the convention, contact Sharon Cordesse at scordesse@oreilly.com

Download the OSCON Data Sponsor/Exhibitor Prospectus

Media Partner Opportunities

For information on trade opportunities with O'Reilly conferences or contact mediapartners@ oreilly.com

Press and Media

For media-related inquiries, contact Maureen Jennings at maureen@oreilly.com

OSCON Bulletin

To stay abreast of convention news and announcements, please sign up for the OSCON email bulletin (login required)

Contact Us

View a complete list of OSCON contacts

OSCON: Data 2011 Speakers

New speakers are being confirmed regularly. Please check back often to see the latest additions to the program.

Search Speakers

Found result
Brian Aker
Brian Aker (HP)

Brian has spent his life working on the details of how to build and scale out
systems. He is currently working on a new MicroKernel designed MySQL
called Drizzle and is building the plumbing required for a new generation of
large scale computer deployment. He also spends time working on Apache
Modules, Memcached, and Gearman.

In the past, he has been involved with projects for the Army Engineer Corps, The VirtualHospital, Splunk, MySQL, Slashdot, and was a Distinguished Engineer at Sun Microsystems. He calls Seattle his home since that is where his dog Rosalynd is.

Andrew Aksyonoff (Sphinx Technologies)

Andrew is the author of Sphinx, an open-source full-text search engine designed especially for indexing databases.

Siddharth Anand
Siddharth Anand (Agari)

Sid Anand is a software architect/developer with deep experience building and scaling data infrastructure at high-traffic web sites. He currently serves as the Data Architect for Agari, a rising email security company. Prior to joining Agari, Sid held several technical and leadership positions including LinkedIn’s Search Architect, Netflix’s Cloud Data Architect, Etsy’s VP of Engineering, and several technical roles at eBay. He earned his BS and MS degrees in CS from Cornell University, where Sid focused on Distributed Systems.

Outside of work, Sid is a maintainer/committer on Apache Airflow and a co-chair for QCon, a global family of popular practitioner-driven technical conferences.

Scott Andreas
Scott Andreas (Boundary Inc.)

Scott Andreas is an Engineer at Boundary, Inc. hell-bent on quality, efficiency, and performance in highly-concurrent network programming and asynchronous distributed systems.

With a background in Java, Scala, and Ruby and new production deployments in Erlang, his current work involves designing, implementing, and deploying event processing systems with a team of engineers to create a real-time distributed network analytics platform. Previously at Urban Airship, Andreas worked with a team to design and implement a scalable mobile messaging platform backed by a clustered service designed to serve millions of concurrent clients on a handful of commodity servers.

Red-lining services under development, measuring results, and quantifying improvements in terms of infrastructure cost and business value is a favorite past-time.

Robin Anil
Robin Anil (Google)

Robin is a Committer at the Apache Software Foundation where he works with the Mahout Machine Learning community. He is also a co-author of “Mahout in Action” by Manning Publications, a book on how Mahout is used to perform Machine learning on Terabytes of data with ease.

He used to be a Tech Lead on the ML infrastructure for Minekey Inc, a valley based startup which focussing on recommendations and behavioral targeting for publisher content. He was introduced to the newly born Mahout community through the Google Summer of Code program while he was a dual-degree student at IIT Kharagpur. Since then, he has been trying to model machine learning algorithms in to the Map/Reduce format and have successfully merged his Complementary Naive Bayes... Read More.

Dhruv  Bansal
Dhruv Bansal (Infochimps)

Dhruv Bansal cofounded Infochimps in 2008 after realizing that there was no single place to easily find updated data online. He is currently pursuing his PhD in Physics from the University of Texas at Austin.

Josh Berkus (PostgreSQL Experts)

Josh Berkus has been a member of the PostgreSQL Core Team since 2003 and has been working as a database consultant since 1998. His consulting specialties include:

  • Database design
  • Performance tuning and troubleshooting</li
  • Stored procedure debugging</li
  • Training</li
  • Reporting and BI
  • ETL and data cleaning

In addition to his PostgreSQL skills, Josh also:

  • Manages projects
  • Advises companies on working with open source communities
  • Speaks at 6 to 12 conferences a year, worldwide
  • Is an expert in SQL Server 2000
  • Co-founded the San Francisco PostgreSQL User Group

Josh’s work experience includes 8 years of consulting on database applications, primarily building applications for the legal and HR industries and performance tuning. He was also head of Sun Microsystem’s PosgtreSQL... Read More.

Jeremy Bingham
Jeremy Bingham (Dailykos.com)

Jeremy Bingham is the technical director for DailyKos.com, a progressive Democratic weblog that is one of the largest sites in the political blogosphere.

He has worked for Daily Kos for the last seven years handling the technology side of the operation, and was a co-founder of SB Nation, a network of sports-related sites.

Benjamin Black
Benjamin Black (Boundary)

Mr. Black most recently cofounded and was CEO of Boundary. He a
recognized expert in large-scale technology infrastructure design,
implementation,and operation. Previously, he was an engineering
director in the ECN group at Microsoft Windows Live, where he helped
build one of the largest content delivery networks in the world. As
the manager of website engineering at Amazon, he lead the design and
implementation of a new, layer 3 network for Amazon’s production
network. As principal engineer for Amazon infrastructure, he
co-authored the first documents on what later became EC2. As senior
manager for information security, he owned and successfully delivered
Sarbanes-Oxley and PCI DSS compliance for Amazon infrastructure,
including the... Read More.

Andy Blyler
Andy Blyler (Barracuda Networks)

Andy is a programmer, system administrator, entrepreneur, musician, private pilot, & optimist. He is the lead software engineer on the Barracuda Networks Backup product.

Roger Bodamer
Roger Bodamer (10gen)

Roger heads the West Coast Operations for 10gen, the company that
develops and supports the open source database MongoDB. He has over 20 years of experience of building and delivering great and innovative
products to market and has deep expertise and knowledge of database
architectures and internals. Roger holds several patents for database
and middleware technology. His experience leading product development
and engineering teams includes 12 years with Oracle’s Database and
Application Server development organization where he pioneered
products that delivered heterogeneous interoperability, as well as
several years as SVP of product operations and engineering at Apple’s
PowerSchool division. Roger also held leadership positions at OuterBay
and Efficient Frontier. He earned a Bachelor’s degree... Read More.

Adrian Cockcroft
Adrian Cockcroft (Amazon Web Services)

Adrian Cockcroft is vice president of cloud architecture strategy at Amazon Web Services, where he focuses on the needs of cloud-native and “all-in” customers and leads the AWS open source community development program. Adrian has had a long career working at the leading edge of technology and is fascinated by what happens next. He started out as a developer in the UK before joining Sun Microsystems. He was a founding member of eBay Research Labs; worked at Netflix, where he directed a team working on personalization algorithms, served as a cloud architect, helping teams scale and migrate to AWS, and led the Netflix open source program; and worked at VC firm Battery Ventures, where he promoted new ideas around DevOps, microservices, the cloud,... Read More.

Jean-Daniel Cryans
Jean-Daniel Cryans (Cloudera)

Jean-Daniel is a Database Engineer at StumbleUpon. When he’s not developing HBase or supporting its usage inside the company, he’s helping others with the Hadoop stack. Jean-Daniel has been a commiter on the Apache HBase project since 2008.

Justin Dow
Justin Dow (Mozilla Corporation)

Justin, most people know him as “jabba” a long time nickname and is IRC nick, works as a Systems Administrator for Mozilla, managing the Infrastructure team. In his spare time he likes tinkering on classic cars and spending time with his family.

Aurelian Dumitru (Dell, Inc)

Aurelian “A.D.” Dumitru is the Dell | Hadoop chief architect. In that role he is responsible for all architecture decisions and long-term strategy for Hadoop. A.D. has over 20 years of experience. He has been with Dell for more than 11 years in various engineering, architecture, and management positions. His background is in hyperscale massively parallel compute systems. His interests are in automated process control, intelligent processes, and machine learning. Over the years he has authored or made significant contributions to more than 20 patent applications, from RFID and automated process controls to software security and mathematical algorithms.

Ted Dunning
Ted Dunning (MapR)

Ted Dunning is chief application architect at MapR. He’s also a board member for the Apache Software Foundation, a PMC member and committer on many Apache projects, and a mentor for various incubator projects. Ted has years of experience with machine learning and other big data solutions across a range of sectors. He has contributed to clustering, classification, and matrix decomposition algorithms in Mahout and to the new Mahout Math library and designed the t-digest algorithm used in several open source projects and by a variety of companies. Previously, Ted was chief architect behind the MusicMatch (now Yahoo Music) and Veoh recommendation systems and built fraud-detection systems for ID Analytics (LifeLock). Ted has coauthored a number of books on big data topics, including several... Read More.

Ted Dziuba
Ted Dziuba (eBay Local/Milo.com)

Co-founder of Milo.com, a local shopping search engine, which was acquired by eBay in December 2010. Now lead engineer for eBay Local.

Jonathan Ellis
Jonathan Ellis (DataStax)

Jonathan is CTO and co-founder at DataStax (formerly Riptano). Prior to DataStax, Jonathan worked extensively with Apache Cassandra while employed at Racksace. Prior to Rackspace, Jonathan built a multi-petabyte, scalable storage system based on Reed-Solomon encoding for backup provider Mozy. In addition to his work with DataStax, Jonathan is project chair of Apache Cassandra.

Greg Fodor
Greg Fodor (Etsy)

Greg Fodor is currently a engineer on Etsy’s “data wranglers” team, responsible for building products around ‘big data’ at Etsy.

Bill Fox J.D., M.A.
Bill Fox J.D., M.A. (LexisNexis)

Bill Fox, JD, MA, is Senior Director of Healthcare at LexisNexis Risk Solutions. Prior to LexisNexis, Mr. Fox was Vice President of Program Integrity at MAXIMUS Federal. He was a Partner at the law firm of Post and Schell, and was Deputy Chief of Economic and Cyber Crime at the Philadelphia District Attorney’s office and a Special Assistant U.S. Attorney in Philadelphia. Mr. Fox is a Senior Fellow at the Jefferson School of Population Health in Philadelphia and a frequent speaker on topics of healthcare fraud, compliance and quality. He has also served as a tactical strength and conditioning consultant to the U.S. Navy SEALs.

LexisNexis Risk Healthcare Solutions develops and implements innovative data driven technology solutions across the healthcare continuum with a focus... Read More.

Brendan Gregg
Brendan Gregg (Netflix)

Brendan Gregg is a senior performance architect at Netflix, where he does large-scale computer performance design, evaluation, analysis, and tuning. Previously, Brendan worked as a performance and kernel engineer. He has created performance analysis tools included in multiple operating systems, as well as visualizations and methodologies. Brendan is the author of Systems Performance. He received the USENIX LISA Award for outstanding achievement in system administration.

Jeff Hamann
Jeff Hamann (Forest Informatics)

Dr. Jeff Hamann, president of Forest Informatics is obsessed with, analyzing data from, developing tools for, and presenting collaborative, geek-friendly stories and solutions for forests, foresters, and forest owners.

He has developed econometrics and forest simulation packages for the R statistical computing environment, has authored a book for the Use-R series from Springer called "Forest Analytics with R’’, and loves to combine the incongruent worlds of tech, humans, and everything that separates them.

He holds a Bachelors of Science in Forestry from Humboldt State University, and an MS and PhD, in Forest Biometrics and Forest Engineering from Oregon State University.

His passions include Forest Management, Operations Research, Computer Science, Geographic Information Systems, Financial Analysis, and of course Open Source Software.

Tom Hanlon
Tom Hanlon (Functional Media)

Tom Hanlon is a senior instructor at Functional Media, where he delivers courses on the wonders of the Hadoop ecosystem. Before beginning his relationship with Hadoop and large distributed data, he had a happy and lengthy relationship with MySQL with a focus on web operations. He has been a trainer for MySQL, Sun, and Percona.

Russell Hanson
Russell Hanson (RSI/Harvard/TCIN)

Russell Hanson was born in Palo Alto, CA on May 2, 1981 and grew up in California and Bloomington, IN. From 1999—2003 he studied in Portland, Oregon at Reed College, completing a bachelor’s degree in Physics. In 2005 he received an M.S. in Biology from Georgia Tech and his PhD studies were in Chemistry at Boston University. His research interests are varied and span machine learning and statistical learning, chemical engineering, biophysical and medicinal chemistry, cancer and cancer biology, quantitative finance, algorithmic trading and technology, synthetic biology and computational biology, quantum computation and information, natural language processing, and mathematical logic and recursion theory. Past academic appointments include Harvard, MIT, and the Technical University of Berlin. An entrepreneur and businessman, in 2008 he founded SciColab.com... Read More.

Harry Heymann (foursquare)

Harry Heymann is the Engineering Lead for foursquare. Before joining foursquare, Harry spent five years at Google where he worked on Google Payments, Dodgeball and various backend advertising systems. Previously, he held roles at Microsoft and Intel. Harry has a bachelor’s degree in engineering from Carnegie Mellon University. In addition to pursuing his obvious interest in mobile technologies, Harry is currently obsessed with exploring the Scala programming language.

Noel Hidalgo
Noel Hidalgo (World Economic Forum)

Noel Hidalgo works at the intersection of politics, community, technology and art. Advocate for free and open government data, open source software, open communities, free culture, dutch utility bicycles and transparent government; he is an established global leader in progressive political and technology communities; Eagle Scout. He is a member of the Royal Society of Arts (UK), a member of the British Council’s Transatlantic Network 2020, and on the board of advisors for Digital Democracy, a non-profit in New York City. He is one of three co-organizers of New Yorkʼs only meetup dedicated to open government, the Open New York Forum.

Bradley Holt
Bradley Holt (Found Line)

Bradley Holt is a web developer, entrepreneur, free/open source software contributor, community facilitator, speaker, and an author. He is the co-founder of Found Line, a creative studio with capabilities in web development, web design, and print design. He is a minor contributor of source code, bug reports, or documentation to several free/open source projects including PHP and Zend Framework. He organizes the Burlington, Vermont PHP Users Group and is a co-organizer of Vermont Code Camp. He has spoken at SXSW Interactive, OSCON, OSCON Data, the jQuery Conference, ZendCon, and CouchConf. He is the author of Writing and Querying MapReduce Views in CouchDB and Scaling CouchDB, both published by O’Reilly Media.

He blogs at bradley-holt.com... Read More.

Winnie Hsia (Infochimps)

Winnie is the Community Manager for Infochimps. She has been a career long advocate of community building and was the pioneer for social media initiatives at Whole Foods Market and Infosys. She has been quoted in publications including The Wall Street Journal, The Financial Times, Business Week and Mashable. She won the Shorty Award for Best Brand for Whole Foods Market in 2010.

In her spare time, she runs Austin’s premier aerial arts collective, Sky Candy and works to make the world a more connected and happier place.

John Hugg
John Hugg (VoltDB)

John Hugg has spent his entire career working with databases and information management. In 2008, John was lured away from a PhD program by Mike Stonebraker to work on what became VoltDB. As the first engineer on the product, he liaised with a team of academics at MIT, Yale, and Brown who were building H-Store, VoltDB’s research prototype. Then John helped build the world-class engineering team at VoltDB to continue development of the open source and commercial products.

Philipp Janert
Philipp Janert (Principal Value, LLC)

After previous careers in physics and software development, Philipp K. Janert currently provides consulting services for data analysis, algorithm development, and mathematical modeling.

He is the author of two books on data analysis: “Data Analysis with Open Source Tools” (O’Reilly) and “Gnuplot in Action – Understanding Data with Graphs” (Manning Publications).

He holds a Ph.D. in theoretical physics from the University of Washington. Visit his company website at www.principal-value.com

Haidong Ji (Self-employed)

Haidong “Alex” Ji is a hacker who loves to tinker and automate things. During the day, he works with databases, Python, C#, and PowerShell. At night he hacks with whatever tools are necessary, Python and web scraping of late to support a charity. In between, he enjoys spending time with his family and two wonderful dogs.

Richard Kelm
Richard Kelm (Sphinx Search)

From humble beginnings focused on storing and maintaining data in MySQL. Richard, matriculated to Sphinx Search where he’s learned the value of synthesizing mountains of text and attributes into meaning for individuals to consume.

Graylin Kim (New York State Senate)

Graylin is a recent graduate of Rensselaer Polytechnic Institute and a member of the Rensselaer Center for Open Source Software at RPI. He officially joined the development team at the NY Senate CIO in June of 2011 and has been working informally with them since the summer of 2010.

Aaron Kimball
Aaron Kimball (Zymergen, Inc.)

Aaron Kimball is the CTO of Zymergen, Inc. Zymergen uses high-throughput techniques, combined with big data analysis, to improve genetic strains for microbial chemical production. Aaron has been working with Hadoop since 2007. In 2008 he was Cloudera’s first employee, where he wrote Apache Sqoop and MRUnit, as well as performed a lot of Hadoop training. In 2010, Aaron founded WibiData and assumed the role of chief architect. WibiData helps organizations build big data applications. Aaron holds a BS in computer science from Cornell University and an MS in computer science from the University of Washington.

Jeffrey Kirkell (Project Management Institute)

Jeff Kirkell is a technologist, geek, overall agile fanboy, and aspiring cynic who spends his time formulating ways to take the best bits of open source and proprietary technologies to create systems that wow the business side with capabilities. Considering himself more an artist than engineer, and priding himself in having installed more data and application systems on his laptop than anyone he knows, Jeff usually in the Rich Internet Application space. Other interest include Triple-Stores, Key-Value stores, and various SemWeb goodness, in addition to destroying systems for testing purposes and forcing wide spread open source adoption.

Rusty Klophaus
Rusty Klophaus (Basho Technologies)

Rusty Klophaus is a Senior Software Engineer at Basho Technologies where he focuses on building distributed, fault-tolerant applications to store and retrieve Big Data. He is currently on the core engineering team of Riak Search and is the lead engineer designing Secondary Indices. Before joining Basho, Rusty launched an Erlang-based startup; before that he managed multi-million dollar technology products and guided multiple project teams at an Internet advertising company based in New York City. When he’s not hacking, Rusty plays guitar and organizes the Hacker News Meetup Group of Washington, DC.

Jay Kreps
Jay Kreps (Confluent)

Jay Kreps is the cofounder and CEO of Confluent, a company focused on Apache Kafka. Previously, Jay was one of the primary architects for LinkedIn, where he focused on data infrastructure and data-driven products. He was among the original authors of a number of open source projects in the scalable data systems space, including Voldemort (a key-value store), Azkaban, Kafka (a distributed messaging system), and Samza (a stream processing system).

Patrick Lightbody
Patrick Lightbody (New Relic)

Patrick runs product management for New Relic, the leader in software analytics. Prior to that, Patrick started two startups in the website monitoring and load testing space, which were acquired by Gomez and Neustar, respectively.

Patrick is an avid open source contributor, having founded OpenQA, created Selenium Remote Control, and co-created Struts 2.

Patrick has held management and software engineering positions with Jive Software, Spoke Software, and Cisco Systems.

Ryan Lowe
Ryan Lowe (Square)

Ryan is an Engineering Manager within the Platform Engineering Organization at Square, where his team manages thousands of instances of MySQL, PostgreSQL, Redis, and MongoDB. Before joining Square, Ryan helped build and scale some of the largest web, social, gaming, and mobile properties as a Consultant at Percona. As an author, speaker, and active contributor to F/OSS projects, his expertise extends to a variety of data stores, cloud and big data, and beyond.

Kate Matsudaira
Kate Matsudaira (SEOmoz)

Kate Matsudaira fills the role as Vice President of Engineering at SEOmoz where she is responsible for managing the core technology team. Prior to SEOmoz, she filled the role of VP Engineering at another startup, Delve Networks (acquired by Limelight). At Delve she helped create and monetize a very large distributed system used for online video delivery and video search. Prior to that she worked in at other leading technology companies like Amazon.com, Microsoft, and Sun Microsystems.
Kate has extensive knowledge of building large scale distributed web systems, web services, and search. Kate has a B.S. in Computer Science from Harvey Mudd College, and has completed graduate work at the University of Washington in both Business and Computer Science (M.S.).

Yoshinori Matsunobu

Yoshinori Matsunobu is a database and infrastructure architect at DeNA (http://www.dena.jp/en/index.html), living in Tokyo. Yoshinori’s primary responsibility at DeNA is to make our database infrastructure more reliable, faster and more scalable. Before joining DeNA, Yoshinori worked at MySQL/Sun/Oracle as a lead consultant in APAC for four years. Yoshinori has written eight MySQL related technical books so far and has published technical articles about MySQL, Linux, and Java for a monthly database magazine since 2004.

Dwight Merriman
Dwight Merriman (10gen)

Dwight is CEO of 10gen which sponsors the open source MongoDB project. Dwight is an active MongoDB committer.

Dwight was cofounder of DoubleClick and CTO of DoubleClick for 10 years. Dwight is the original architect of the DoubleClick DART ad serving system.

Bruce Momjian
Bruce Momjian (EnterpriseDB)

Bruce Momjian is a co-founder of the PostgreSQL Global Development Group, and has worked on PostgreSQL since 1996. He is the author of PostgreSQL: Introduction and Concepts, published by Addison-Wesley. Bruce is employed by EnterpriseDB. Previously, he was employed by SRA Japan and Great Bridge LLC, both PostgreSQL support companies. He has spoken at many international open-source conferences. Prior to his involvement with PostgreSQL, Bruce worked as a consultant, developing custom database applications for some of the world’s largest law firms. As an academic, Bruce was a high school computer science teacher for five years, holds a Masters in Education, and is currently an adjunct professor at Drexel University.

Gleicon Moraes
Gleicon Moraes (luc.id)

Gleicon Moraes is director of data engineering at luc.id. Gleicon loves infrastructure for data, moving large volumes through distributed messaging systems, and databases. He uses Python, Go, and Erlang and focuses on distributed systems, nonrelational databases, and OSS.

Arun Murthy
Arun Murthy (Hortonworks Inc.)

Arun is the lead of the next generation MapReduce project in Apache Hadoop. Arun has been a full-time contributor to Apache Hadoop since its inception in 2006. He is a long-time committer and member of the Apache Hadoop PMC and jointly holds the current world sorting record using Apache Hadoop. Prior to co-founding Hortonworks, Arun was responsible for all MapReduce code and configuration deployed across the 42,000+ servers at Yahoo!. In essence, he was responsible for running Apache Hadoop’s MapReduce as a service for Yahoo!. Follow Arun on Twitter: @acmurthy.
He is directly responsible for every bit of code and configuration of Map-Reduce deployed at over 40,000 machines running Apache Hadoop at Yahoo. He jointly holds the world-record for sorting data using Hadoop... Read More.

Peter Neubauer
Peter Neubauer (Neo Technology)

Peter is co-founder of a number of popular Open Source projects such
as Neo4j, Tinkerpop, OPS4J and Qi4j. Peter loves connecting things,
writing novel prototypes and throwing together new ideas and projects
around graphs and society-scale innovation. Right now, Peter is
concentrating on turning Open Source projects into profitable
enterprises at Neo Technology, the company sponsoring the development
of Neo4j, the Graph Database. Also, Peter is a Mentor helping startups
at Startupbootcamp Copenhagen and organizing events like
http://www.thoughtmade.com and TEDx Öresund.

Sarah Novotny
Sarah Novotny (Google)

Sarah Novotny is an open source community manager at Google. Sarah’s recent technological focuses include open source, cloud and utility computing, infrastructure automation, and data (big and small; relational and nonrelational). Her calling lies in sharing her excitement about technology and coalescing a group around a consistent vision. She loves meeting people and rapidly assessing what information she needs to impart in order to draw them into the vision or story that she has to share. Sarah enjoys bridging the gap between the business world and the tech world. Her background includes leading operational IT and development teams and external-facing work in biz dev, sales engineering, customer support, and, of course, public speaking.

Owen O'Malley
Owen O'Malley (Cloudera)

Owen O’Malley is a co-founder and Technical Fellow at Cloudera, formerly Hortonworks. Cloudera’s software includes Hadoop and the large ecosystem of big data tools that enterprises need for their data analytics. Owen has been working on Hadoop since the beginning of 2006 at Yahoo, was the first committer added to the project, and used Hadoop to set the Gray sort benchmark in 2008 and 2009. In the last 10 years, he has been the architect of MapReduce, Security, and now Hive. Recently he has been driving the development of the ORC file format and adding ACID transactions to Hive.

Erik Onnen (Urban Airship)

Erik is a Senior Engineer and Analytics team lead at Urban Airship, the leading provider of Push, In-App Purchase and Subscription services for mobile applications. He specializes in distributed systems at scale. Prior to joining Urban Airship, Erik was a platform lead for Jive Software where he lead the development of the activity and recommendation engine services.

David Pacheco
David Pacheco (Joyent)

David Pacheco is the lead engineer of Joyent’s Introspection Team, which develops Cloud Analytics and other tools for observing software in the cloud. Previously a member of Sun’s Fishworks team, David worked on several areas of the Sun Storage 7000 series of appliances including remote replication, fault management, and flash device support.

Josh Patterson
Josh Patterson (Cloudera)

Master’s Thesis: self-organizing mesh networks
Published in IAAI-09: TinyTermite: A Secure Routing Algorithm

Conceived, built, and led Hadoop integration for the openPDC project at TVA (Smartgrid stuff). Led small team which designed classification techniques for timeseries and Map Reduce. Open source work at http://openpdc.codeplex.com

Now: Solutions Architect at Cloudera

Noah Pepper
Noah Pepper (Lucky Sort)

CEO of Lucky Sort, a Portland based startup building an intuitive and powerful platform for realtime text analysis using visual analytics and machine learning.

Tom Quisel
Tom Quisel (OkCupid)

Tom Quisel is OkCupid’s Director of Technology. For the past four years, he’s been devoted to helping millions across the world find love through smarter algorithms and better data analysis. Currently, Tom is giving his all learning to lead the phenomenally talented technical team at OkCupid. The path is as littered with pitfalls as it is with opportunities, but he wouldn’t trade it for any other position in the world. Tom has a B.S. in Computer Science from Carnegie Mellon where he focused on Statistics and Machine learning. Tom also studied bioinformatics at EPFL in Switzerland, where he co-authored a paper on extracting meaningful information from large but noisy biological datasets. In his free time Tom loves to hike the hills of upstate New... Read More.

Inaam Rana (Oracle)

Inaam Rana is a core member of the InnoDB development team at MySQL. He joined the InnoDB team in April 2007. His area of focus has been performance and scalability of the InnoDB storage engine. He has been mostly working on the buffer cache layer, low level concurrency and OS layer. Inaam has had a chance to contribute to the most of the scalability features that were included in the InnoDB plugin. In his previous work experience he had been involved in development of IBM’s DB2 LUW and PostgreSQL database engines. Inaam currently lives in Toronto, Canada.

Krishna Sankar
Krishna Sankar (Volvo Cars)

Krishna Sankar is a consulting data scientist working on retail analytics, social media data science, and forays into deep learning, as well as codeveloping the DeepLearnR package interfacing R over TensorFlow/Skflow. Previously, Krishna was a chief data scientist at Blackarrow.tv, where he focused on optimizing user experience via inference, intelligence, and interfaces. Earlier stints include principal architect/data scientist at Tata America Intl., director of data science at a bioinformatics startup, and distinguished engineer at Cisco. He is a frequent speaker at conferences, including Spark Summit, Spark Camp, OSCON, PyCon, and PyData, on topics such as predicting NFL winners, Spark, data science, machine learning, and social media analysis, as well as a guest lecturer at the Naval... Read More.

Theo Schlossnagle
Theo Schlossnagle (Circonus)

Theo Schlossnagle is a Founder and CEO at Circonus where he develops new technology to help engineers better understand the systems they build and operate. He is the architect of the highly scalable Ecelerity mail transport agent. Theo is a participant in various open source communities including OpenSolaris, Linux, Apache, PostgreSQL, perl, and many others. He is a published author in the area of scalability and distributed systems as well as a veteran speaker in the open source conference circuit.

Theo founded several successful startups as engineering focused organizations including: OmniTI, Circonus, Message Systems and Fontdeck.

Jonathan Seidman
Jonathan Seidman (Orbitz Worldwide)

Jonathan has spent more than 15 years as a software developer, with a focus in the last few years on processing large data sets using tools such as Hadoop. Currently, Jonathan is a Lead Engineer on the Business Intelligence/Big Data team at Orbitz Worldwide. Jonathan is also a co-founder and organizer of the Chicago Hadoop User Group and founder of the Chicago Big Data User Group.

Corey Shields (Mozilla Corporation)

Corey is a manager for Mozilla’s IT Systems team. He has long been an open source advocate, since first being introduced to Slackware on an Infomagick CD in the mid 90’s.

Previous to Mozilla, Corey has worked as a Unix supercomputing administrator for Indiana University and as a systems manager for the Open Source Lab at Oregon State University.

Benoit Sigoure
Benoit Sigoure (StumbleUpon, Inc.)

Benoit is a software engineer with a strong UNIXy/Linux background. He specializes in designing, writing & running large-scale distributed serving systems that serve millions of users. He has a deep understanding of the entire technology stack (including Google’s), from on-wire protocols and low-level implementation details all the way up to high-level designs used in high-availability distributed systems (both in software and in the datacenter).

Benoit designed and implemented OpenTSDB, the open-source, highly scalable, distributed, monitoring system.

Adam Silberstein
Adam Silberstein (Yahoo!)

Adam Silberstein is a Research Scientist in the Systems group at Yahoo! Research. His main area of interest is large-scale data management, including work in online serving systems, batch processing systems, and management of social data. His work has appeared in top database conferences such as SIGMOD, VLDB and ICDE. Prior to joining Yahoo!, Adam completed his Ph.D. at Duke University in 2007.

Nicolas Spiegelberg (Facebook)

Nicolas Spiegelberg is a storage engineer in the Facebook messaging team. He helped implement the HBase storage solution for Facebook Messages from design to deployment. Additionally, Nicolas is an HBase committer and PMC, who has contributed many critical features such as HDFS data reliability, Bloom Filters, and an enhanced compaction algorithm.

UA-Huntsville : Masters in Computer Engineering

Bradford Stephens
Bradford Stephens (roboticprofit)

Bradford is the founder and CEO of Drawn to Scale, creators of the
Spire database. Spire is a SQL database built on Hadoop and HBase,
similar to Google F1. Drawn to Scale has customers powering large web
apps, mobile infrastructures, telecoms, social networks, and more. A
long-time user of Hadoop and HBase, Bradford has built large
infrastructures at various startups and enterprises, and worked on
Microsoft SQL Server. He holds degrees in Computer Science and
Political Science, and spent several years as a Campaign Manager in
politics at the Presidential and U.S. House levels.

Homer Strong (Lucky Sort)

Homer Strong is a data hacker in Portland. His background is in statistics, but he knows how to wrangle a unix terminal too.

Calvin Sun
Calvin Sun (Twitter)

Calvin Sun is the senior engineering manager at Twitter, managing the MySQL development. Prior to that, he was managing the InnoDB team at Oracle. Calvin was a development manager at MySQL from 2006 to 2008, managing MySQL storage engines and 3rd party storage engine vendors relationships. He has over 15+ years of database development experience, mostly on various storage engines.

Lars Thalmann
Lars Thalmann (Oracle)

Dr. Lars Thalmann is the development director for MySQL replication, backup, and connectors. He is responsible for the strategy and development of these features and leads the corresponding engineering teams. Thalmann has worked with MySQL development since 2001, when he started as a MySQL Cluster developer. More recently, he has driven the creation and development of the MySQL backup feature, has guided the evolution of MySQL replication since 2004, and has been a key player in the development of MySQL Cluster replication. Thalmann holds a doctorate in Computer Science from Uppsala University, Sweden. He is one of the authors of the “MySQL High Availability” book.

Laura Thomson
Laura Thomson (Mozilla Corporation)

Laura Thomson is Senior Director of Engineering, Firefox Engineering Operations at Mozilla, and is on the board of ISRG, the non-profit behind Let’s Encrypt. Laura has worked in various engineering roles at Mozilla for the last ten years, and previously worked in jobs including Principal/VP at OmniTI, engineer, consultant, and computer science academic. She has written several books on various open source technologies, including “PHP and MySQL Web Development” (with Luke Welling, her husband). Laura is from Melbourne, Australia, but now lives with her family on a rustic horse farm in Maryland, USA, where she relaxes by indulging in manual labor and other rural pursuits.

Robert Treat
Robert Treat (OmniTI)

With more than fifteen years of experience building database backed, internet based systems at multiple fortune 500 companies, Robert is now CEO of OmniTI, a technical services firm focused on providing web application development and infrastructure management at scale. Author and speaker at conferences worldwide, Robert is a recognized expert within the industry on topics such as Open Source, databases, and managing operations at scale. He occasionally blogs at http://xzilla.net.

Andrew Turner
Andrew Turner (GeoIQ)

Andrew Turner is the CTO at GeoIQ, the company behind GeoCommons, a geospatial visualization and analytics platform. He co-founded Mapufacture, a personalized geospatial search and aggregation system that was acquired by FortiusOne in August 2008 in order to combine real-time feeds with large GIS datasets. Andrew is focused on collaboration and user-generated content around location and time. He is actively involved in open-data projects such as OpenStreetMap and VoteReport, as well as open-source projects like Mapstraction and GeoPress. He regularly speaks at conferences on the benefits of open-source software and geospatial standards to communities and organizations. Andrew wrote the O’Reilly shortcut “Introduction to Neogeography” and “Trends in Where2.0” business report in Spring 2008. He is also published in MacTech and Make magazine on... Read More.

Ramesh Venkataramaiah (Orbitz Worldwide)

Ramesh is a member of the Operations and Engineering Team at Orbitz Worldwide with a focus on analysis of distributed, high availability systems in the travel data domain. His passion is the fusion of distributed, multi-active datacenter infrastructure, parallel processing, and analysis platforms like R for Big Data. In combination with many Machine Learning methodologies, he believes that the next generation of data science and support infrastructure are a must for smooth operations of enterprise data centers and private clouds. He received a Ph. D. in Ocean Engineering from Texas A&M University with a focus on instrumentation and sensor platforms.

Christine White

My research interests are spatial data infrastructure, with emphasis on the study of geography (B.S. and M.S. in Geography from University of South Carolina). I have worked at Esri in Redlands, CA for four years as a consultant, helping large organizations manage and share their spatial data. Last Fall, our company decided to release the Esri Geoportal Server product as 100% open source, under the Apache 2.0 license on sourceforge. Since then, I’ve enjoyed learning about and participating hands-on in the open source community, and being a contributer instead of just a beneficiary of open source solutions.

Tom White
Tom White (Cloudera)

Tom White has been an Apache Hadoop committer since February 2007, and is a member of the Apache Software Foundation. He is the author of Hadoop: The Definitive Guide for O’Reilly. Previously he worked as an independent consultant specializing in Hadoop, and before that was co-founder and lead developer at Kizoom, a UK mobile applications startup. Tom has a bachelor’s degree in mathematics from the University of Cambridge, and a master’s degree in history and philosophy of science from the Universities of Leeds, UK, and Florence, Italy.

Tom Wilkie
Tom Wilkie (Acunu Ltd)

Tom leads the engineering team at Acunu. He previous worked in a range of engineering roles at both Citrix and XenSource managing key customer projects with major enterprises. As one of the first employees at XenSource he played a key part in developing the XenServer management stack. He holds an MA in Computer Science from Cambridge University where he was also a research assistant.

Jared Williams (New York State Senate)

Jared is part of the Open Government movement started by the New York State Senate. He began as an intern in February of 2010 and was hired on in August of that year. He graduated with a BS in Computer Science from SUNY Albany.

Steve Yegge
Steve Yegge (Google)

Steve Yegge is a software engineer with over twenty years of industry experience. He has worked on everything from embedded systems to distributed systems to e-commerce, online games and much more. Steve has been at Google for the past eight years, where he has worked on Ads and Music systems, semantic code indexing, and cloud computing. Prior to joining Google, Steve was a Senior Engineering Manager at Amazon.com, where he spent nearly seven years leading teams in Developer Tools and Customer Service systems. Steve graduated from the University of Washington with a B.S. in Computer Science.

Ezra Zygmuntowicz (VMware Inc)

Ezra Zygmuntowicz has been a rubyist for 9 years now. Over the course of that time he has written many open source project including Merb(which was merged with rails to help build a better architecture for Rails 3.0), Nanite, redis-rb and many more. He wrote the first book on Rails deployment for the pragmatic programmers and also launched one of the first commercial rails sites at the end of 2004(http://yakima-herald.com).

Ezra has gravitated towards the deployment and tooling of ruby applications. He cofounded Engine Yard in 2006 and has spoken at every single RailsConf except for last year.

Ezra is currently working for VMware helping to build a new cloud operating system that will redefine the way people deploy and manage their development and production... Read More.