Presented By O’Reilly and Cloudera
Make Data Work
March 5–6, 2018: Training
March 6–8, 2018: Tutorials & Conference
San Jose, CA

About Strata Data Conference

Why You Should Attend | Experience Strata Data Conference | Who You'll Meet | Kudos | Program Committee | Contact »

Strata Data Conference is where cutting-edge science and new business fundamentals intersect—and merge. It's a deep dive into emerging techniques and technologies. You'll dissect case studies, develop new skills through in-depth tutorials, share emerging best practices in data science, and imagine the future.

Formerly known as Strata + Hadoop World, the conference was created in 2012, when O'Reilly and Cloudera brought together their two successful big data conferences.

Program Chairs Doug Cutting (Chief Architect at Cloudera and founder of Apache Hadoop), Ben Lorica (Chief Data Scientist, O'Reilly), and entrepreneur Alistair Croll have created a program that covers the entire range of big data tools and technologies. Strata Data Conference covers current hot topics like AI and machine learning, and focuses on how to implement data strategies.

The data industry is growing fast, and Strata Data Conference has grown right along with it.  We've added new sessions and tracks to reflect challenges that have emerged in the data field—including security, ubiquitous computing, collaboration, reproducibility, new interfaces, emerging architecture, building data teams, machine data—and much more.

Strata is the largest data conference series in the world, yet it’s kept the informal, collegial spirit that makes it one of the best places to connect and collaborate.

Why You Should Attend

Strata Data Conference is where big data's most influential business decision makers, strategists, architects, developers, and analysts gather to shape the future of their businesses and technologies. If you want to tap into the opportunity that big data presents, you want to be there.

  • Be among the first to understand how you can leverage the promise of this huge change, and survive the resulting disruption
  • Find new ways to leverage your data assets across industries and disciplines
  • Learn how to take big data from science project to real business application
  • Discover training, hiring, and career opportunities for data professionals
  • Meet-face-to-face with other innovators and thought leaders

Experience Strata Data Conference

  • Inspiring keynotes and practical, information-rich sessions, tutorials, and training courses exploring the latest advances, case studies, and best practices
  • Networking opportunities with thousands of other business leaders, data professionals, designers, and developers
  • A vibrant "hallway track" for attendees, speakers, journalists, and vendors to debate and discuss important issues
  • Fun evening events, receptions, and more, giving you more face time with attendees and speakers

Who You'll Meet

Strata Data Conference attracts the best minds in business and data: data analysts, data scientists, developers, and other professionals who work with data, including:

  • Business intelligence managers and analysts
  • Business managers, strategists, and decision makers
  • CDOs, CEOs, COOs
  • CIOs, CTOs, enterprise architects
  • Data-driven designers
  • Data engineers
  • Data scientists
  • Developers and database professionals
  • Innovators and entrepreneurs
  • Journalists and press
  • Product managers
  • Researchers and academics
  • VCs and investors
  • VPs or directors of marketing, analytics, or data warehousing

Kudos for Strata Data Conference

One of the most valuable events to advance my career.”
An excellent event focused on the intersection between distributed computing and data science.”
The event speakers, content, and organization were exceptional. This is probably the pinnacle of Big Data conferences to attend for networking and getting the latest updates on ground breaking new technologies in the Big Data space. I will definitely attend this event next year.”
The conference gave me access to leading thinkers in big data and businesses with big data problems. I enjoyed the tutorials and talks, and look forward to experimenting with some of them in the next few months.”
The event was invigorating from a community perspective. Big Data gets so much hype that being with real practitioners felt awesome. It was also nice to see the spectrum of professionals in the space—data scientists, software guys, business users, tool developers, etc. all in one location—sharing knowledge was awesome.”

Program Chairs

Ben Lorica
(@bigdata) is the chief data scientist at O'Reilly Media, Inc. He has applied business intelligence, data mining, machine learning and statistical analysis in a variety of settings including direct marketing, consumer and market research, targeted advertising, text mining, and financial engineering. His background includes stints with an investment management company, internet startups, and financial services.

Doug Cutting
(@cutting) is the founder of numerous successful open source projects, including Lucene, Nutch, Avro, and Hadoop. Doug joined Cloudera in 2009 from Yahoo!, where he was a key member of the team that built and deployed a production Hadoop storage and analysis cluster for mission-critical business analytics. Doug holds a Bachelor’s degree from Stanford University and sits on the Board of the Apache Software Foundation.​

Alistair Croll Alistair Croll
is an entrepreneur with a background in web performance, analytics, cloud computing, and business strategy. In 2001, he co-founded Coradiant (acquired by BMC in 2011) and has since helped launch Rednod, CloudOps, Bitcurrent, Year One Labs, and several other early-stage companies. He works with startups on business acceleration, and advises a number of larger companies on innovation and technology.

A sought-after public speaker on data-driven innovation and the impact of technology on society, Alistair has founded and run a variety of conferences, including Cloud Connect, Bitnorth, and the International Startup Festival. He’s the chair for Strata Data conference. He has written several books on technology and business, including the best-selling Lean Analytics.

Alistair tries to mitigate chronic ADD by writing about far too many things at Solve For Interesting.

Program committee

  • Michael Abbott, Kleiner Perkins Caufield & Byers
  • Joseph Adler, Facebook
  • Dr. Vijay Srinivas Agneeswaran, SapientNitro
  • Parvez Ahammad, BlackThorn Therapeutics
  • David Andrzejewski, Sumo Logic
  • Assaf Araki, Intel
  • Amr Awadallah, Cloudera
  • Roy Ben-Alta, Amazon Web Services
  • Marty Betz, FirstRain Inc.
  • Bill Schmarzo, EMC Consulting
  • Carla Borsoi, 6SensorLabs
  • dhruba borthakur, Facebook
  • Farrah Bostic, The Difference Engine
  • Sarah Catanzaro, Canvas Ventures
  • Evan Chan, Apple
  • Jike Chong, Tsinghua University | Acorns
  • Eli Collins, Cloudera
  • chetan conikee, ShiftLeft
  • Alistair Croll, Solve For Interesting
  • Beau Cronin, Embedding.js
  • Doug Cutting, Cloudera
  • Michael Dauber, Amplify Partners
  • Margaret Dawson, Red Hat
  • Parviz Deyhim, Google
  • Thomas Dinsmore, Cloudera
  • Chris Douglas, Microsoft
  • Helena Edelson, Apple
  • Jana Eggers, Nara Logics
  • Nick Elprin, Domino Data Lab
  • John Foreman, MailChimp
  • Margaret Francis, Exact Target/ CoTweet
  • Michael Hausenblas, Red Hat
  • Amy Heineike, Primer AI
  • Noah Iliinsky, Amazon Web Services
  • Rohit Jain, Esgyn
  • Ricardo Jimenez-Peris, LeanXcale
  • Arun Kejariwal, MZ
  • Daniel Koffler, Rio Tinto Alcan
  • Coco Krumme, Haven | UC Berkeley
  • Haoyuan Li, Alluxio
  • Michael Li, The Data Incubator
  • Mike Loukides, O'Reilly Media
  • Chris Love, Love 2 Dev
  • Mark Madsen, Third Nature
  • Roger Magoulas, O'Reilly Media
  • Dean Malmgren, Datascope Analytics
  • Elissa Murphy, GoDaddy
  • Arun Murthy, Hortonworks Inc.
  • Andrew Musselman, Lucidworks
  • Erich Nachbar, Google
  • Paco Nathan, O'Reilly Media
  • Pamela Peele, UPMC
  • Molly Rector, DataDirect Networks (DDN)
  • Dan Roesch, Roesch & Associates LLC
  • Dmitriy Ryaboy
  • Eric Sammer, Rocana
  • Toru Shimogaki, NTT DATA Corporation
  • Randy Smerik, Osunatech, Inc.
  • M. C. Srivas, Uber
  • Mike Stringer, Datascope Analytics
  • David Talby,
  • Jen van der Meer, Reason Street
  • Dean Wampler, Lightbend
  • Simon Wardley, Leading Edge Forum
  • Chris Wensel, Concurrent, Inc
  • Edd Wilder-James, Google
  • Sharon Wragg, O'Reilly
  • Reynold Xin, Databricks
  • Fangjin Yang, Imply
  • Charles Zedlewski, Cloudera
  • Alice Zheng, Amazon
  • yan zhou, 1964
  • Margit Zwemer, LiquidLandscape