Presented By O'Reilly and Cloudera
Make Data Work
Feb 17–20, 2015 • San Jose, CA

Strata + Hadoop World Speakers

New speakers are added continuously. Please check back to see the latest updates to the program.

Search Speakers

Michael Abbott
Michael Abbott (Stanford University)

Mike Abbott joined Kleiner Perkins Caufield & Byers in 2011 and focuses on investments in the firm’s digital practice, helping entrepreneurs in the social, mobile and cloud computing sectors rapidly scale teams and ventures. Mike... Read More.

Joseph Adler
Joseph Adler (Facebook), @jadler

Joseph Adler has many years of experience in data mining and data analysis at companies including DoubleClick, VeriSign, and LinkedIn. He graduated from MIT with an B.Sc. and M.Eng in Computer Science and Electrical... Read More.

Subutai Ahmad
Subutai Ahmad (Numenta, Inc.), @numenta

Subutai Ahmad is the VP of Research at Numenta, a company focused on Machine Intelligence. Our technology, Hierarchical Temporal Memory (HTM), is a detailed computational framework based on principles of the brain. Our... Read More.

John Akred
John Akred (Silicon Valley Data Science), @BigDataAnalysis

With over 15 years in advanced analytical applications and architecture, John is dedicated to helping organizations become more data-driven. He combines deep expertise in analytics and data science with business acumen and dynamic engineering leadership.

... Read More.
Naser Al
Naser Al (Altiscale, Inc.)

Nasser Manesh has 25 years of experience in Unix, infrastructure, distributed systems, and backend operations, mostly in DevOps, team lead, and CTO roles. He has founded startups in consumer Internet, mobile, photography and art... Read More.

Nitesh  Ambastha
Nitesh Ambastha (Credit Suisse)

Nitesh Ambastha is the Global Head of Data IT, Private Banking & Wealth Management Products at Credit Suisse. He is responsible for spearheading and creating a multi-year roadmap to design the future state Data Platform... Read More.

Anima Anandkumar
Anima Anandkumar (UC Irvine)

Anima Anandkumar is a principal scientist at Amazon Web Services. Anima is currently on leave from UC Irvine, where she is an associate professor. Her research interests are in the areas of large-scale machine learning,... Read More.

Jesse Anderson

Jesse is a Creative Engineer with many years of experience in creating products and helping companies improve their software engineering. He strives to provide developers with the resources to learn new technologies and improve their... Read More.

Steve Anderson
Steve Anderson (Intel)

Steve Anderson has been working in the IT industry since 1993 and with Intel since 2000. He has an engineering focus which has been on server and application consolidation then onto general virtualization and Read More.

June Andrews
June Andrews (Wise / GE Digital), @Dr_June_Andrews

June Andrews is an applied mathematician specializing in social network analysis. She has worked on the Search Algorithm at Yelp and designed algorithms for computing the structure of large networks with Professor John Hopcroft. Currently,... Read More.

David Andrzejewski
Graph mining for log data Hard-Core Data Science

Lead Data Sciences Engineer at Sumo Logic and co-organizer of SF Bay Area Machine Learning meetup group.


Matt Asay

Matt Asay has been involved with open source since 1998, and is one of the industry’s leading open source business strategists. Asay is a regular columnist for ReadWrite, TechRepublic and InfoWorld. Asay is vice president... Read More.

Rosie Atkins
Rosie Atkins (Groupon)

Rosie Atkins serves at the Director of Product for Breadcrumb POS, Groupon’s point of sale product that primarily serves restaurants, bars and cafes. She has held the position since September, 2014. She leads product... Read More.

Amr Awadallah

Amr Awadallah is the cofounder and CTO at Cloudera. Previously, Amr was an entrepreneur in residence at Accel Partners, served as vice president of product intelligence engineering at Yahoo, and ran one of the... Read More.

Josh Baer
Josh Baer (Spotify), @L_Phant

Josh spent six years as a software engineer building infrastructure components at AT&T before discovering the world of ‘Big Data’ in a class at NYU by O’Reilly author Foster Provost. He ‘joined the band’... Read More.

Prith Banerjee
Prith Banerjee (Schneider Electric), @prithbanerjee

Prith Banerjee is executive vice president and chief technology officer at Schneider Electric, as well as a member of the executive committee, which reports to the chairman and CEO. In this role, Prith is... Read More.

Nenshad  Bardoliwalla

Nenshad Bardoliwalla is an executive and thought leader with a proven track record of success leading product strategy, product management, and development in business analytics. He is the co-author of Driven to Perform: Risk-Aware Performance... Read More.

Dorman Bazzell (Capgemini)

Results oriented technology executive and recognized thought leader with more than 25 years of experience and demonstrated success assisting large, global entities in driving organizational change through the leveraging of Information. Proven track record of... Read More.

Steven Beeckman
Steven Beeckman (Ministry of Defence of Belgium), @stevenbeeckman

Steven is the Technical Project Officer for all software applications facilitating the cross-domain data exploitation within the belgian Ministry of Defence.

Goutham, a Principal Data Integration and Reporting Practice Leader for Capgemini, leads a global team of over ~300 practitioners responsible for the entire information lifecycle. This starts with the strategy / vision phase continuing to... Read More.

Danielle Ben-Gera

Danielle Ben-Gera is Principal Architect at Quid, where she leads the backend team, managing the algorithms stack, as well as the data acquisition and processing pipeline. Over the past three and a half years, she... Read More.

Aaron Benz
Aaron Benz (Accenture)

Aaron is a Data Scientist with Accenture.

Ryan Blue
Ryan Blue (Cloudera)

Ryan Blue is a software engineer at Cloudera, currently working on the Kite SDK team.

Joerg Blumtritt
Joerg Blumtritt (Datarella), @jbenno

Joerg Blumtritt is the founder and CEO of Datarella, a computational social science startup delivering mobile analytics, self-tracking solutions, and data science consulting. After graduating from university with a thesis on machine learning, Joerg... Read More.

Ron Bodkin
Whither YARN? Session

Ron Bodkin is a technical director on the applied artificial intelligence team at Google, where he provides leadership for AI success for customers in Google’s Cloud CTO office. Ron engages deeply with Global F500... Read More.

Irina Borisova (Chegg)

Irina is an Applied Researcher at eBay, working on machine translation training and evaluation. Irina holds two Masters degrees in natural language processing and neurolinguistics.

Vinayak Borkar (X15 Software), @vinayakb

Vinayak Borkar is the CTO of X15 Software, Inc. Previously, he was a PhD candidate at UC Irvine, where he worked on big data and contributed to the Hyracks Open Source Big Data Project.... Read More.

Kirk Borne
Kirk Borne (George Mason University ), @KirkDBorne

Dr. Kirk Borne is a Transdisciplinary Data Scientist and an Astrophysicist. He is Professor of Astrophysics and Computational Science in the George Mason University School of Physics, Astronomy, and Computational Sciences. He has been at... Read More.

David	 Brewster
David Brewster (Paxata)

Dave Brewster is the Co-Founder and CTO at Paxata. He is a serial entrepreneur and seasoned enterprise software technology leader with more than 20 years experience in successfully architecting and delivering scalable technology platforms.... Read More.

Kurt Brown
Kurt Brown (Netflix)

Kurt leads the Data Platform team at Netflix. His group architects and manages the technical infrastructure underpinning the company’s analytics. The Netflix data infrastructure includes various Big Data technologies (e.g. Hadoop, Hive, and Pig), Netflix... Read More.

Michael Brown
Michael Brown (comScore, Inc.)

Michael Brown was a founding member of comScore, Inc. in 1999. He leads the technology efforts of the company to measure Internet and Digital activities. In this position, he helped the company build the world’s... Read More.

Josh Byrd (GoPro), @joshbyrd

Josh Byrd is the Manager of Data Architecture at GoPro working within the Data Science & Engineering team. Prior to GoPro, Josh led global supply chain operations analytics efforts at Apple . His work focuses... Read More.

Alonzo Canada (Interana), @acanada

Alonzo Canada leads developing new ventures with a particular focus on product strategy and UX development. He mashes up business strategy and human-centered design to craft product vision and execute against it. He uses strategy... Read More.

John Canny
John Canny (UC Berkeley)

John F. Canny is a computer scientist and the Paul and Stacy Jacobs Distinguished Professor of Engineering in the Computer Science Department of the University of California, Berkeley. John has made significant contributions in various... Read More.

John Carnahan (Ticketmaster)

John Carnahan is the EVP of Data Science at Ticketmaster.

With a strong background in a wide variety of development roles, I’ve now moved to help developers get the most from DataSift’s products.

Oscar Celma
Oscar Celma (Pandora), @ocelma

Òscar Celma is currently Director of Research at Pandora, where he leads a team of scientists to provide the best personalized radio experience.
From 2011 till 2014 Òscar was Senior Research Scientist at Gracenote.... Read More.

Arnab Chakraborty
Arnab Chakraborty (Accenture)
Find the Business in Your Data Data-Driven Business Day

As a managing director at Accenture Analytics, now part of Accenture Digital, Arnab Chakraborty speaks analytics fluently. He serves as the Global Lead for Industry Analytics in Accenture’s advanced analytics practice and is also responsible... Read More.

Michele Chambers

Michele Chambers is an entrepreneurial executive with 25 years of technology experience, and is the President and COO at RapidMiner, which offers a predictive analytics platform. At RapidMiner, she is responsible for marketing, products... Read More.

Winston Chang
Winston Chang (RStudio)
R Day Tutorial

Winston is a software engineer at RStudio, and holds a Ph.D. in Psychology from Northwestern University. He is a developer for the ggplot2, devtools, shiny, and ggvis packages, and is the author of R Graphics... Read More.

Kuang Chen (Captricity), @kuang

The idea for Captricity came from Kuang’s PhD dissertation at UC Berkeley. His research focused on data-centric approaches to increase the efficiency of low-resource organizations, so they can better serve their disadvantaged clients. While doing... Read More.

Yanpei Chen
Yanpei Chen (Cloudera)

Yanpei Chen is a Software Engineer at Cloudera, working on the Performance Engineering team. He regularly participates in competitive performance “bake-offs” that directly drive customer purchasing decisions. His work touches upon Cloudera Search, Impala, Apache... Read More.

Lu Cheng (Airbnb)
Lu Cheng recently graduated from UC Berkeley with a B.S. in EECS and has been a software engineer on the Airbnb Search & Discover team since February 2014. Since starting at Airbnb, Lu has... Read More.
Darren Chinen (GoPro)

Darren Chinen is the Head of Data Science and Engineering at GoPro. He has extensive experience working with all types of “extreme data” having previously led the Big Data and analytics efforts at Apple, Peet’s... Read More.

Alan Choi
Alan Choi (Cloudera)

Alan Choi is a software engineer at Cloudera working on the Impala project. Previously, he worked at Greenplum on the Greenplum-Hadoop integration and worked extensively on PL/SQL and SQL at Oracle.

Jike Chong
Jike Chong (Simply Hired)

Jike Chong heads Data Science at Simply Hired, the most comprehensive job search engine that indexes over 10M jobs everyday, attracting more than 30 million monthly unique visitors, and serving hundreds of thousands of employers... Read More.

Miklos Christine
Miklos Christine (Databricks), @Miklos_C

Miklos Christine is a solutions engineer for Databricks. Miklos was previously a system engineer at Cloudera where he helped strategic customers deploy and use the Apache Hadoop ecosystem in production. He has contributed to several... Read More.

Woody Christy
Woody Christy (Cloudera)

Woody Christy has been lucky enough to be working in distributed systems his entire career. He led system designs and deployments for Video On Demand systems that scaled out to millions of end users. He... Read More.

Cliff Click
Cliff Click (0xdata)

Cliff Click is the CTO and Co-Founder of 0xdata, a firm dedicated to creating a new way to think about web-scale math and real-time analytics. I wrote my first compiler when I was 15... Read More.

Stewart Collis

Stewart Collis is the Chief Technical Officer of AWhere Inc. He has over 15 years’ experience in all phases of design, development and management of software development projects for desktop and web applications. He has... Read More.

Eric Colson
Eric Colson (Stitch Fix), @ericcolson

Eric Colson is the Chief Algorithms Officer at Stitch Fix, where he specializes in consumer algorithms. He is also an advisor at Big Data incubator Data Elite, and Big Data Platform provider Mortar Data. Previously,... Read More.

Michael Conover

Mike Conover builds machine learning technologies that leverage the behavior and relationships of hundreds of millions of people. A senior data scientist at LinkedIn, Mike has a Ph.D. in complex systems analysis with a focus... Read More.

Kathy Copic
Kathy Copic (Insight Data Science), @KathyScientist

I am one of those ex-Physicists, currently the Program Director for Growth at Insight Data Science. Insight helps PhD’s transition from academia into new careers in industry.

George Corugedo
George Corugedo (RedPoint Global), @RedPointCTO

A mathematician and seasoned technology executive, George Corugedo has over 20 years of business and technical expertise. As co-founder and CTO of RedPoint Global, George is responsible for leading the development of the RedPoint... Read More.

Daniel Crankshaw (UC Berkeley)

Daniel Crankshaw is a second year PhD student working in the UC Berkeley AMPLab with Michael Franklin. Dan’s research focuses on how ideas in distributed database systems can be applied to machine learning and data... Read More.

Alistair Croll
Alistair Croll (Solve For Interesting), @acroll

Alistair Croll is an entrepreneur with a background in web performance, analytics, cloud computing, and business strategy. In 2001, he cofounded Coradiant (acquired by BMC in 2011) and has since helped launch Rednod, CloudOps, Bitcurrent,... Read More.

Poppy  Crum
Poppy Crum (Dolby Laboratories | Stanford University)

Poppy Crum leads the Science Group at Dolby Laboratories and is a Consulting Professor at Stanford University in the Center for Computer Research in Music and Acoustics and the Program in Symbolic Systems. At Dolby,... Read More.

Nick Curcuru
Nick Curcuru (Mastercard)

Nick Curcuru is vice president of enterprise information management at Mastercard, where he’s responsible for leading a team that works with organizations to generate revenue through smart data, architect next-generation technology platforms, and protect data... Read More.

Doug Cutting
Doug Cutting (Cloudera), @cutting

Doug Cutting is the chief architect at Cloudera and the founder of numerous successful open source projects, including Lucene, Nutch, Avro, and Hadoop. Doug joined Cloudera from Yahoo, where he was a key member of... Read More.

Michelangelo D'Agostino

Michelangelo D’Agostino is a Senior Data Scientist with Civis Analytics, where he works on statistical models and writes software for data analysis.

As a reformed particle physicist turned data scientist, Michelangelo loves mungeable datasets, machine... Read More.

Jason (Jinquan) Dai

Jason (Jinquan) Dai is a senior principal engineer and CTO of big data technologies at Intel, where he is responsible for leading the global engineering teams (located in both Silicon Valley and Shanghai) on... Read More.

Kaushik Das
Kaushik Das (Pivotal)

Kaushik Das is an expert at applying mathematical models to solve business problems. He has more than 10 years of experience designing and deploying analytical software, working for enterprise software companies such as Rapt, Demandtec... Read More.

Tathagata Das
Tathagata Das (Databricks)

Tathagata Das is a Apache Spark Committer and a member of the PMC. He is the lead developer of behind Spark Streaming, and currently employed at Databricks. Earlier, he has spent in the AMPLab... Read More.

Michael Dauber
Michael Dauber (Amplify Partners), @dauber

Michael Dauber is a general partner at Amplify Partners. Previously, Mike spent over six years at Battery Ventures, where he led early-stage enterprise investments on the West Coast, including Battery’s investment in a stealth security... Read More.

Gary Davis
Gary Davis (McAfee, a division of Intel Security), @GaryDavis

Gary Davis is Chief Consumer Security Evangelist. Through a consumer lens, he works closely with internal teams to drive strategic alignment of products with the needs of the security space. Gary also oversees McAfee online... Read More.

Allen Day
Allen Day (MapR Technologies), @allenday

Allen Day – Principal Data Scientist, MapR Technologies
Allen is the Principal Data Scientist at MapR Technologies, where he leads interdisciplinary teams to deliver results in fast-paced, high-pressure environments across several industry verticals. Previously,... Read More.

michael dddd
Spark Camp: Ask Us Anything Ask Us Anything

Michael Armbrust is the lead developer of the Spark SQL and Structured Streaming projects at Databricks. Michael’s interests broadly include distributed systems, large-scale structured storage, and query optimization. Michael holds a PhD from UC... Read More.

Marco  Di Placido
Marco Di Placido (O365 Security Signals )

Marco DiPlacido is Principal software engineer on the O365 Security Signals team collaborating with researchers and cloud service owners to build intrusion detection systems for cloud scale.

Jonathan Dinu (Zipfian Academy), @clearspandex

Jonathan Dinu is the Co-founder and CTO of Zipfian Academy, an advanced training program for data scientists and data engineers in San Francisco. His background is in Computer Science and Physics at University of... Read More.

David Dobbins
David Dobbins (Rackspace Hosting)

David is the Software Development and Engineering Manager for the Cloud Big Data Platform team at Rackspace. He and his team develop and operate this cloud service which is focused on delivering the Hadoop platform... Read More.

Sheetal Dolas
Sheetal Dolas (Hortonworks)

Sheetal is a Principal Architect working with Hortonworks with strong expertise in Hadoop ecosystem and rich field experience. He helps small to large enterprises solve their business problems strategically, functionally as well as at scale... Read More.

Scott Donaldson

Scott is Senior Director at FINRA’s Market Regulation Technology. Scott leads the data and analytics teams responsible for the surveillance of U.S. equities, options and fixed income markets.

Chris DuBois

Chris DuBois is a data scientist focused on building tools for other data scientists. At Dato, he has helped design and implement tools for creating recommendation systems as well as large-scale text analysis. His current... Read More.

Ted Dunning
Ted Dunning (MapR, now part of HPE), @ted_dunning

Ted Dunning has been involved with a number of startups with the latest being MapR Technologies where he is Chief Application Architect working on advanced Hadoop-related technologies. He is also a PMC member for... Read More.

Joey Echeverria

Joey Echeverria is the director of engineering at Rocana, where he builds applications for scaling IT operations built on the Apache Hadoop platform. Joey is a committer on the Kite SDK, an Apache-licensed data... Read More.

Jeremy Edberg

Jeremy Edberg, the CEO and Founder of MinOps, which makes using the cloud stupid easy. He is an angel investor and advisor for various incubators and startups. Previously, Jeremy was the founding reliability engineer... Read More.

Alyosha Efros
Alyosha Efros (UC Berkeley), @UCBerkeley
Visual Understanding Beyond Naming Hard-Core Data Science

Alexei (Alyosha) Efros is an associate professor of electrical engineering and computer science at UC Berkeley. Previously, Alyosha spent nine years on the faculty of Carnegie Mellon University and has also been affiliated with École... Read More.

Daniel Eklund (Think Big, a Teradata Company)

Daniel Eklund is a software architect and technologist with over 18 years of experience in enterprise software development. As the first employee and Engineering Practice Manager for Think Big Analytics, Daniel has worked with many... Read More.

Ozgun Erdogan (Citus Data)

Ozgun is the CTO and one of the co-founders of Citus Data. Before founding Citus, Ozgun worked as a software developer in the Distributed Systems Engineering team at Amazon. There, he proposed, designed, and... Read More.

Sameer Farooqui
Spark Camp: Ask Us Anything Ask Us Anything

Sameer Farooqui is a client services engineer at Databricks, where he works with customers on Apache Spark deployments. Sameer works with the Hadoop ecosystem, Cassandra, Couchbase, and general NoSQL domain. Prior to Databricks, he worked... Read More.

Yuliya Feldman
Yuliya Feldman (Dremio Corporation)

Yuliya Feldman is a Principal Software Engineer at MapR. Since joining MapR, Yuliya has worked on a number of products and features starting from MapR admin infrastructure, Map/Reduce framework and most recently on YARN.Read More.

Laura Fennell
Laura Fennell (Intuit)

Laura Fennell is senior vice president, general counsel and secretary, leading Intuit’s legal, corporate affairs, information and physical security, privacy, and data services teams.

Before joining Intuit in April 2004, Fennell served as Sun Microsystems’... Read More.

Bruno Fernandez-Ruiz

Bruno Fernandez-Ruiz is a Yahoo Senior Fellow and VP of Personalization Platforms, overseeing the development and delivery of Yahoo’s personalization technology, which Bruno’s teams use to harvest deep user insights in order to deliver a... Read More.

Bob Filbin
Bob Filbin (Crisis Text Line), @bobfilbin

Bob Filbin is chief data scientist at Crisis Text Line, the first large-scale 24/7 national crisis line for teens on the medium they use and trust most: texting. Bob specializes in the application of behavioral... Read More.

Shai Fine
Shai Fine (Intel)

Shai Fine is a Principal Engineer at the Advanced Analytics group in Intel, focusing on Machine Learning, Business Intelligence, and Big Data. Prior to Intel, Shai worked for the IBM Research Lab in Haifa,... Read More.

Lutz Finger
Designing Data Products Data-Driven Business Day

Lutz Finger is a data scientist and product manager at Google, focusing on the intersection of predictions and data to change healthcare. Using the power of AI, he and his team are committed to improving... Read More.

Danyel Fisher
Danyel Fisher (, @FisherDanyel

Danyel Fisher is a Senior Researcher in information visualization and human-computer interaction at Microsoft Research’s VIBE group. His research focuses on ways to help users interact with data more easily. His recent work has... Read More.

Mike Flannagan (Cisco)

As Senior Director and General Manager for Cisco’s Data and Analytics Group (IBTG), Mike Flannagan guides Cisco’s Big Data and Analytics business strategies and execution.
Since joining Cisco in 2000, Mike has... Read More.

Jonathan Frederic
PyData Ask Us Anything Ask Us Anything
PyData at Strata Tutorial

Jonathan is a full time IPython developer who primarily works on the IPython notebook front-end. In his spare time Jonathan enjoys developing a Python based video game engine and Poster, an open source HTML5 canvas... Read More.

David Freeman (Pentaho)

37 years enterprise sales expereince

David Freeman
David Freeman (LinkedIn)

Dr. Freeman is head of Security Data Science at LinkedIn, where he leads a team charged with detecting and preventing fraud and abuse across the LinkedIn site and ecosystem. He has a Ph.D. in mathematics... Read More.

Chris Fregly
Chris Fregly (Amazon Web Services), @cfregly
Spark Camp: Ask Us Anything Ask Us Anything

Chris Fregly is a senior developer advocate focused on AI and machine learning at Amazon Web Services (AWS). Chris shares knowledge with fellow developers and data scientists through his Advanced Kubeflow AI Meetup and... Read More.

Eric Frenkiel
Eric Frenkiel (MemSQL)

Eric Frenkiel co-founded MemSQL and has served as CEO since inception. Before MemSQL, Eric worked at Facebook on partnership development. He has worked in various engineering and sales engineering capacities at both consumer and... Read More.

Ellen Friedman
Ellen Friedman (Independent)

Ellen Friedman is a solutions consultant, scientist and author, currently writing about a variety of open source and big data topics including being co-author of Mahout in Action (Manning), the Practical Machine Learning series from... Read More.

Ross Fubini
Ross Fubini (Canaan Partners)
Ross Fubini joined Canaan Partners’ Menlo Park office in 2012 as a Venture Partner. He focuses on the firm’s enterprise, consumer, and healthcare IT investment efforts.

Before joining Canaan, Ross was a partner at seed-stage... Read More.

Ajit Gaddam

Ajit is the Chief Security Architect at VISA, the worlds largest payment network, which processed $4.5 trillion last year. Areas of expertise include data security, cryptography, and mobile security. He held other senior roles... Read More.

Anil Gadre
Anil Gadre (MapR)

Anil Gadre is the SVP of Product Management at MapR. Prior to MapR, Anil was the EVP of Product Management at Silver Spring Networks, responsible for product strategy, planning and marketing of networking... Read More.

Eddie Garcia
Eddie Garcia (Cloudera), @edygarcia

Eddie Garcia is chief information security officer at Cloudera, a leader in enterprise analytic data management, where he draws on his more than 20 years of information and data security experience to help Cloudera Enterprise... Read More.

Alan Gates
Alan Gates (Hortonworks)

Alan is a co-founder at Hortonworks and an original member of the engineering team that took Pig from a Yahoo! Labs research project to a successful Apache open source project. Alan also designed HCatalog and... Read More.

Max Gazor (Charles River Ventures)
Ari Gesher
Ari Gesher (Kairos Aerospace), @alephbass

A software/systems engineer with a lot of experience building big, real-world systems.

Jonathan Goldman
Jonathan Goldman (Intuit)

Jonathan is Director of Data Science and Analytics at Intuit. He co-founded Level Up Analytics, a premier data science consulting company focused on data science, big data, and analytics which Intuit acquired in 2013. From... Read More.

Greg Goldsmith (Attivio)

Greg Goldsmith is the Chief Product Officer at Attivio. He is responsible for product strategy, product management, and research & development for Attivio’s Enterprise Search and Big Data Discovery & Dexterity businesses. Greg is the... Read More.

Floris Grandvarlet current responsibility at Cisco is Head of Unified Computing in DCV EMEAR Tech Ops. After the merge of European Theater with Emerging Market Theater, he is now leading a team of... Read More.

Brian Granger
Brian Granger (Cal Poly San Luis Obispo), @ellisonbg

Brian Granger is an Associate Professor of Physics at Cal Poly State
University in San Luis Obispo, CA. He has a background in theoretical physics, with a Ph.D from the University of Colorado. His... Read More.

Alexander Gray
Alexander Gray (Skytree, Inc.), @skytreeHQ

Alexander Gray is CTO at Skytree and Associate Professor in the College of Computing at Georgia Tech. His work has focused on algorithmic techniques for making machine learning tractable on massive datasets. He began... Read More.

Scott Gray
Scott Gray (IBM)

Scott Gray is a senior architect for IBM’s InfoSphere BigInsights Big SQL solution. Gray has an extensive career in the computer industry focusing heavily on relational database, architecture, design, optimization, and internals. Prior to... Read More.

Michael Greene
Michael Greene (Intel)

Michael Greene is vice president of the Software and Services Group and general manager of System Technologies and Optimization at Intel Corporation. Greene is responsible for delivering software and solutions that enable Intel and its... Read More.

Garrett Grolemund
Garrett Grolemund (RStudio)
R Day Tutorial

Garrett Grolemund is the editor-in-chief of, the development center for the Shiny R package, and is the author of Hands-On Programming with R as well as Data Science with R, a forthcoming book by... Read More.

Robert Grossman
Robert Grossman (University of Chicago)

Robert Grossman is a faculty member and the Chief Research Informatics Officer in the Biological Sciences Division of the University of Chicago. He is the Director of the Center for Data Intensive Science and a... Read More.

Mark Grover

Mark Grover is a committer on Apache Bigtop, a committer and PMC member on Apache Sentry (incubating) and a contributor to
Apache Hadoop, Apache Hive, Apache Spark, Apache Pig, Apache Sqoop and Apache... Read More.

Denny Guang-yeu Lee
Denny Guang-yeu Lee (Databricks)
Spark Camp: Ask Us Anything Ask Us Anything

Denny Lee is a Developer Advocate at Databricks. He is a hands-on distributed systems and data sciences engineer with extensive experience developing internet-scale infrastructure, data platforms, and predictive analytics systems for both on-premise and cloud... Read More.

Randy Guck
Randy Guck (Dell Software), @randyguck

Randy Guck is a Principal Engineer at Dell Software, focused on big data solutions for commercial software applications. He has developed software for over 30 years, building specialized databases including semantic, hybrid object/relational, and NoSQL.... Read More.

Carlos Guestrin
Carlos Guestrin (Apple | University of Washington )

Carlos Guestrin is the director of machine learning at Apple and the Amazon Professor of Machine Learning in Computer Science and Engineering at the University of Washington. Carlos was the cofounder and CEO of... Read More.

Maya Gupta
Maya Gupta (Google)

Gupta runs an R&D group at Google Research focused on designing efficient and transparent statistical learning algorithms. From 2003-2012, she was a professor of electrical engineering at the Univ. of Washington. Gupta received the Read More.

Vida Ha
Vida Ha (Databricks), @femineer

Vida is currently a Solutions Engineer at Databricks. In her past, she worked on scaling Square’s Reporting Analytics System. She first began working with distributed computing at Google – where she improved search rankings of... Read More.

John Haddad
John Haddad (Informatica), @JohnM_Haddad

John Haddad is Senior Director of Big Data Product Marketing at Informatica Corporation. He has over 25 years’ experience developing and marketing enterprise applications. Today, he advises organizations on Big Data best practices from a... Read More.

Lisa Hammitt
Lisa Hammitt (Salesforce)

Lisa Hammitt is a senior software executive with 25 years of industry experience. Most recently, as vice president of marketing of Salesforce Community Cloud, she is spearheading strategy and is charting out industry-led use cases... Read More.

Ben Hamner

Ben Hamner is Kaggle’s Chief Scientist and is responsible for the technical side of the business. He is currently focused on applying machine learning to the energy industry, and has previously worked with machine learning... Read More.

Jeffrey Heer
Jeffrey Heer (Trifacta | University of Washington), @jeffrey_heer

Jeff is Trifacta’s Chief Experience Officer and a Professor of Computer Science at the University of Washington, where he directs the Interactive Data Lab. Jeff’s passion is the design of novel user interfaces for exploring,... Read More.

Jeremy Heffner
Jeremy Heffner (Azavea)

I work with crime data to model patterns and forecast risk; the intersection of geography, data science, and social good.

Keywords: geographic data, raster processing, predictive analysis, spacetime event modeling, weather, demographics, machine learning, early... Read More.

Joe Hellerstein

Joseph M. Hellerstein is a Chief Strategy Officer at Trifacta and Chancellor’s Professor of Computer Science at UC Berkeley. His work focuses on data-centric systems and the way they drive computing. He is an Read More.

Spencer Herath (Accenture)

Spencer is a Data Scientist with Accenture.

Craig Hibbeler
Craig Hibbeler (MasterCard Advisors), @BigDataCraig

Craig Hibbeler is principal for big data and security within MasterCard Advisors’ Enterprise Information Management consultancy practice. In his role, Craig leverages practical hands-on experience and broad industry and platform knowledge to develop, execute, secure,... Read More.

Dave Holtz
Dave Holtz (Airbnb)

Dave Holtz is a data scientist at Airbnb focusing on online reputation and pricing. Previously, he worked as a data science engineer at Yub (acquired by and as a data scientist and Product Manager... Read More.

Nicholas Horton
Nicholas Horton (Amherst College )
R Day Tutorial

Applied biostatistician with research interests in missing data methods, statistical computing, and statistical education

Solomon Hsiang
Solomon Hsiang (UC Berkeley)

Solomon Hsiang combines data with mathematical models to understand how society and the environment influence one another. In particular, he focuses on how policy can encourage economic development while managing the global climate. His research... Read More.

Leah Hunter
Leah Hunter (Tech Journalist), @leahthehunter

Leah Hunter writes about the human side of tech for Fast Company, the Guardian, and O’Reilly. She is authoring two upcoming books—one on augmented reality from O’Reilly and the other on the future in five... Read More.

Kurt Hurtado (Elasticsearch Inc), @kurtado

Kurt Hurtado is a Logstash developer based in Los Altos, CA. He has been working with Elasticsearch and Logstash for many years and thrives on building excellent architectures based on the ELK stack for... Read More.

Alysa Z. Hutnik
Alysa Z. Hutnik (Kelley Drye & Warren LLP)

Alysa Z. Hutnik is a partner in the Advertising & Marketing and Privacy & Information Security practices at Kelley Drye & Warren LLP in Washington, D.C. Her practice includes representing clients in all forms... Read More.

Matt Ingenthron
Matt Ingenthron (Couchbase, Inc.), @ingenthr

Matt is an experienced web architect with a software development background. He has deep expertise in building, scaling and operating global-scale Java, Ruby on Rails and AMP web applications. He has been a contributor... Read More.

Arif Janmohamed
Arif Janmohamed (Lightspeed Venture Partners)

Arif joined Lightspeed in 2008 and focuses primarily on investments in the areas of cloud and datacenter technologies, as well as enterprise mobile and SaaS solutions.

Arif is currently working closely with the teams at... Read More.

Anant Jhingran
Anant Jhingran (Apigee)

Dr. Anant Jhingran (PhD Berkeley) joined Apigee from IBM where he was VP and CTO for IBM’s Information Management Division and Co-Chair of IBM wide Cloud Computing Architecture Board. He was responsible... Read More.

Annika Jimenez
Annika Jimenez (Pivotal)

Annika is a seasoned leader of analytics initiatives, and came from Pivotal where she built the “Data Science Dream Team” – an industry-leading group of Data Scientists – representing a rich combination of vertical domain... Read More.

Ann Johnson
Ann Johnson (Interana)

Ann Johnson is cofounder and CEO of Interana, the experts in event data analytics, where she has created a community of all-star talent working to make data-informed decisions a natural extension of everyone’s workflow.... Read More.

Anne Johnson
Anne Johnson (Credit Suisse)

Anne Johnson is the Head of PBWM Products Risk Technology at Credit Suisse. Her SPEAR+ Investment Risk Technology platform has propelled Credit Suisse into a market leadership position of transparently managing and monitoring the... Read More.

Robert Johnson (Interana)

Bobby is co-founder of Interana, where he is building the next generation of tools for analyzing massive amounts of data in real time.

Bobby was Director of Engineering at Facebook where he led the infrastructure... Read More.

Michael Jordan
Michael Jordan (UC Berkeley), @UCBerkeley

Michael I. Jordan is the Pehong Chen Distinguished Professor in the Department of Electrical Engineering and Computer Science and the Department of Statistics at the University of California, Berkeley. His research interests bridge the computational,... Read More.

Adam Jorgensen (Pragmatic Works), @ajbigdata

My passion is helping our clients build management processes and cultures where they are identifying, analyzing and driving their business opportunities through personalized management and analytic solutions. I’ve authored over 10 books on technical and... Read More.

Karthik Kambatla
Karthik Kambatla (Cloudera)

Karthik Kambatla is a Software Engineer at Cloudera in the scheduling and resource management team. He works primarily on MapReduce and YARN, and he is a Committer to Apache Hadoop. He is also a... Read More.

Holden Karau
Holden Karau (Independent), @holdenkarau
Spark Camp: Ask Us Anything Ask Us Anything

Holden Karau is a Software Development Engineer at Databricks and is active in open source. She the author of a book on Spark and has assisted with Spark workshops. Prior to Databricks she worked on... Read More.

Michael Kehoe
Michael Kehoe (LinkedIn)

Michael Kehoe is a site reliability engineer at LinkedIn, where he specializes in building and maintaining reliable, scalable system infrastructure. Previously, he worked with networks at the University of Queensland, built small satellites at Read More.

Kyle Kelley
Kyle Kelley (Netflix), @rgbkrk
PyData Ask Us Anything Ask Us Anything
PyData at Strata Tutorial

Kyle Kelley is a senior software engineer at Netflix, a maintainer on, and a core developer of the IPython/Jupyter project. He wants to help build great environments for collaborative analysis, development, and production workloads... Read More.

Eamonn Keogh (University of California - Riverside)

Dr. Eamonn Keogh is a professor of computer science at the University of California Riverside, specializing in data mining. He is a highly prolific researcher, as of 2015 he is one of only three people... Read More.

Tigran Khrimian
Tigran Khrimian (FINRA)

Tigran Khrimian is a Senior Director at FINRA responsible for the development of the organization’s Big Data ingestion and management platform, which processes billions of market order events on daily basis. He oversees technology... Read More.

Jonathan King (CenturyLink ), @centurylinkcld

Jonathan H. King is Vice President, Cloud Strategy and Business Development for CenturyLink Technology Solutions. In this role, Jonathan leads cloud strategy, business development, alliances, M&A and global go to market for CenturyLink. Prior to... Read More.

Jake Klamka is the founder of the Insight Data Science Fellows Program, a post-doctoral fellowship that helps quantitative PhDs transition from academia to careers in data science. Insight Fellows are now data scientists at top... Read More.

Jennifer Klay
Jennifer Klay (Cal Poly San Luis Obispo)
PyData Ask Us Anything Ask Us Anything
PyData at Strata Tutorial

Jennifer Klay is an Associate Professor of Physics at Cal Poly San Luis Obispo. She has worked with big data at the CERN Large Hadron Collider’s ALICE experiment for 17 years, unlocking the... Read More.

James Kochuba is a Senior Solution Architecture on the SAS Enterprise Architecture team. He helps strategic SAS customers world-wide designing and building high performance architectures. Focus areas in the past several years have... Read More.

Adam  Kocoloski

Adam is a Co-founder and CTO of Cloudant, and an IBM Distinguished Engineer. He is an Apache CouchDB developer, joining the project as one of the first ten committers, and the lead architect... Read More.

Marcel Kornacker
Marcel Kornacker (Cloudera)

Marcel Kornacker is the architect and tech lead at Cloudera for Impala. Prior to Cloudera Marcel worked at Google, where he worked on several ads serving and storage infrastructure projects and eventually became the tech... Read More.

Rado Kotorov
Rado Kotorov (Information Builders)

Dr. Rado Kotorov is vice president of Product Marketing for Information Builders and works both with the WebFOCUS and the iWay product divisions to provide thought leadership, analyze market and technology trends, aid in the... Read More.

John Kreisa
John Kreisa (Hortonworks), @marked_man

Currently serves as VP of Marketing Strategy at Hortonworks. Previous positions include Director at Red Hat and VP at Cloudera.

John holds a B.S. in computer science from The University of Texas at Austin.

Jay Kreps
Jay Kreps (Confluent)

Jay is one of the primary architects for LinkedIn where he focuses on data infrastructure and data-driven products.

He was among the original authors of a number of open source projects in the scalable data... Read More.

Chris   Lalonde
Chris Lalonde (ObjectRocket)

Chris was Co-Founder and CEO of ObjectRocket before merging with Rackspace in February 2013. He now leads ObjectRocket by Rackspace in their new digs in downtown Austin. He has 20+ years of experience building... Read More.

Philip Langdale
Philip Langdale (Cloudera)

Philip Langdale is the engineering lead for cloud at Cloudera. He joined the company as one of the first engineers building Cloudera Manager and served as an engineering lead for that project until moving to... Read More.

Sasha Laundy
Sasha Laundy (Warby Parker), @sashalaundy

Sasha is the founding data scientist and engineer at Polynumeral, a data science consultancy in New York City. She helps clients solve hard data problems and design their data strategy, including the World Bank, New... Read More.

Sylvain Le Borgne (Havas Media)

Sylvain joined Havas Media in 2011 to lead the company’s big data efforts. Recognized for his expertise in online marketing and data driven systems, Sylvain develops and implements client solutions for Artemis, Havas Media’s proprietary... Read More.

Julien Le Dem
Julien Le Dem (WeWork), @J_

Julien Le Dem is a Data Systems Engineer at Twitter. Previously he was a Principal Engineer at Yahoo. He contributes to a number of Hadoop-related projects including HCatalog and he’s a PMC member on... Read More.

Costin Leau
Costin Leau (Elastic), @costinl

Costin Leau is an engineer at Elasticsearch, where he leads big data efforts. An open source veteran, Costin led various Spring projects (Spring OSGi, GemFire, Redis, Hadoop) and authored an OSGi spec. He has spoken... Read More.

Cornelia Levy-Bencheton

Cornelia Lévy-Bencheton is a communications strategy consultant and writer whose data-driven marketing and decision support work helps companies optimize their performance.

As Principal of CLB Strategic Consulting, LLC., her focus is on the... Read More.

Chengxiang Li
Chengxiang Li (Intel)

ChengXiang Li is a software engineer from Intel SSG Big Data Technology team, he is dedicated to enable and improve SQL interfaces in Hadoop ecosystem, and optimize SQL engine performance with IA... Read More.

Fei-Fei Li
Fei-Fei Li (Stanford University)

Dr. Fei-Fei Li is an Associate Professor in the Computer Science Department at Stanford, and the Director of the Stanford Artificial Intelligence Lab and the Stanford Vision Lab. Her research areas are in machine learning, computer... Read More.

Etan Lightstone (New Relic)

As Director of UX Design at New Relic, Etan Lightstone oversees a team of talented designers, leads the user experience design strategy, and on occasion gets the opportunity to contribute to the product codebase. Etan... Read More.

Lucian Lita
Lucian Lita (Yoyo Labs), @datariver

Founder and CEO at YoyoLabs, building custom data platforms, services, and data products with a bias towards personalization and privacy. Previously director of data engineering at Intuit, leading data platform, a/b testing, personalization, and... Read More.

Bill Loconzolo
Bill Loconzolo (Intuit)

Bill Loconzolo is the Vice President of Data Engineering for product analytics across Intuit. His team is focused on creating and scaling data products that leverage the collective data of 50 million customers to provide... Read More.

AJ Loiacono (Truveris), @truveris

A.J. Loiacono is a co-founder of Truveris. In his current role as the Chief Innovation Officer, his responsibilities include product development, strategic planning and enterprise partnerships. A.J. is a serial entrepreneur and has fifteen years... Read More.

Ben Lorica
Ben Lorica (O'Reilly), @bigdata

Ben Lorica is the chief data scientist at O’Reilly. Ben has applied business intelligence, data mining, machine learning, and statistical analysis in a variety of settings, including direct marketing, consumer and market research, targeted advertising,... Read More.

Yucheng Low
Yucheng Low (Dato)

Yucheng Low is a co-founder and Chief Architect of GraphLab Inc. He led the development of the SFrames and SGraphs scalable datastructures underpinning the GraphLab Create Product. He completed his PhD in Machine Learning in... Read More.


Brandon MacKenzie is the Data Science on Hadoop leader on IBM’s Worldwide Technical Sales team for Information Management Software. Brandon is an expert on statistical processing in Hadoop and HPC environments. Brandon earned his... Read More.

Mark Madsen
Mark Madsen (Teradata), @markmadsen

Mark Madsen is a fellow at Teradata, where he’s responsible for understanding, forecasting, and defining the analytics ecosystem and architecture. Previously, he was CEO of Third Nature, where he advised companies on data strategy... Read More.

Roger Magoulas
Roger Magoulas (O'Reilly Media), @rogerm

Roger Magoulas is the vice president of O’Reilly Radar. Previously, Roger was the research director at O’Reilly, where he and his team built the company’s analysis infrastructure and provided analytic services and insights on technology-adoption... Read More.

Oliver Mainka (SAP Labs LLC)

Oliver has been working at SAP since 1990, first in Germany, then since 1995 in Palo Alto, California. Oliver worked in various positions, in User Experience design, in Technical Marketing and Knowledge Management, on... Read More.

Ted Malaska
Ted Malaska (Capital One), @TedMalaska

Ted has worked on close to 60 Clusters over 2-3 dozen clients with over 100’s of use cases. He has 18 years of professional experience working for start-ups, the US government, a number of the... Read More.

Michal Malohlava
Michal Malohlava (0xdata, Inc), @mmalohlava

Michal is a geek, developer, Java, Linux, programming languages enthusiast developing software for over 10 years.
He obtained PhD from the Charles University in Prague in 2012 and post-doc at Purdue University.

During his... Read More.

Tatsiana Maskalevich

Tatsiana is a Data Science manager at Stitch Fix. Blending both industrial and academic research, Tatsiana is expert at solving hard business problems. She brings a background in both mathematics and statistics, and has deep... Read More.

Asim Mathur (eBay)

Asim is a Senior Data Engineer in Machine Translation team at eBay. He has been at eBay for 7 years working in Data Analytics, Data Engineering and Business Intelligence.

Pankaj Mathur (Acxiom)

Pankaj manages the growth strategy and sales for Acxiom’s digital products that are offered directly or through distribution partners. He also provides leadership on Acxiom’s SMB digital solutions strategy. Previously, Pankaj managed strategic enterprise... Read More.

Lauri Mazzuchetti
Lauri Mazzuchetti (Kelley Drye)

Lauri Mazzuchetti is the managing partner of Kelley Drye’s Parsippany office. She represents consumer-facing businesses, including telecommunications carriers, in commercial litigation in federal and state courts, both at the trial and appellate levels. Ms. Mazzuchetti... Read More.

Arianna McClain

Arianna McClain is a design researcher – data specialist at IDEO. Arianna works at the intersection of technology, data, and human behavior. She leads hybrid research processes that merge quantitative (data) and qualitative (stories)... Read More.

Patrick McFadin
Patrick McFadin (Datastax)

Patrick McFadin is regarded as a foremost expert for Apache Cassandra and data modeling. As Chief Evangelist for Apache Cassandra and consultant working for DataStax, he has been involved in some of the biggest deployments... Read More.

Emma McGrattan
Emma McGrattan (Actian)

Emma McGrattan is SVP of engineering at Actian, where she leads the Actian Vector, Actian Vector Hadoop Edition, and Actian Matrix development teams. A leading authority in DBMS technologies, Emma has over 20... Read More.

Wes McKinney
Wes McKinney (Two Sigma Investments), @wesmckinn
PyData Ask Us Anything Ask Us Anything
PyData at Strata Tutorial

Wes McKinney is a software architect at Two Sigma Investments. He is the creator of Python’s pandas library and a PMC member for Apache Arrow and Apache Parquet. He wrote the book Python for... Read More.

Harrison Mebane
Harrison Mebane (Silicon Valley Data Science), @harrisonmebane

I have a background in theoretical physics, but now I do fun stuff with data. I enjoy numbers, coding, and learning new tools.

Eden Medina
Eden Medina (Indiana University, Bloomington), @edenmedina

Eden Medina is Associate Professor of Informatics and Computing and Director of the Rob Kling Center for Social Informatics at Indiana University, Bloomington. Her research uses technology as a means to understand historical processes and... Read More.

Chad Meley
Chad Meley (Teradata), @chad_meley

Chad Meley is the Vice President of Product & Services at Teradata.

Prior to joining Teradata, he led Electronic Arts’ Data Platform organization that supported Financial Analysis, Game Development, Marketing Analysis and CRM. Chad... Read More.

Xiangxiang Meng

Xiangxiang Meng is a Staff Scientist in the Data Science Technologies department at SAS. Xiangxiang received his PhD and MS from the University of Cincinnati. The current focus of his work is on the... Read More.

Miriah Meyer
Miriah Meyer (University of Utah), @miriahmeyer

Miriah Meyer is an associate professor in the School of Computing at the University of Utah, where she runs the Visualization Design Lab. Her research focuses on the design of visualization systems for helping analysts... Read More.

Justin Michaels (Couchbase)

With over 20 years experience in deploying mission critical systems, Justin Michaels industry experience covers capacity planning, architecture and industry vertical experience. Justin brings his passion for architecting, implementing and improving Couchbase to the community... Read More.

Ryan Michaluk
Ryan Michaluk (Allstate)

Ryan is a data scientist in Allstate’s Quantitative Research and Analytics department, where he uses big data to improve the customer experience.

Mostafa Mokhtar
Mostafa Mokhtar (Cloudera)

Mostafa Mokhtar is a performance engineer at Cloudera. Previously, he held similar roles at Hortonworks and on the SQL Server team at Microsoft.

Andreas Mueller
Andreas Mueller (NYU, scikit-learn)
PyData Ask Us Anything Ask Us Anything
PyData at Strata Tutorial

Andreas Mueller received his PhD in machine learning from the University of Bonn. After working as a machine learning researcher on computer vision applications at Amazon for a year, he recently joined the Center for... Read More.

Manu Mukerji

Manu has a background in cloud computing and big data, handling billions of transactions per day in real time. He enjoys building and architecting scalable, highly available data solutions, and has extensive experience working in... Read More.

Aaron Myers
Aaron Myers (Cloudera, Inc.), @atm

Aaron T. Myers is a Software Engineer at Cloudera and an Apache Hadoop Committer. Aaron’s work is primarily focused on HDFS. Prior to joining Cloudera, Aaron was a Software Engineer and VP of Engineering... Read More.

Jacques Nadeau
Jacques Nadeau (Dremio)

Jacques Nadeau is MapR’s lead developer on the Apache Drill open source project. He is an industry veteran with over 15 years of big data and analytics experience. Most recently, he was cofounder and Read More.

Neha Narkhede

Neha Narkhede is the cofounder and CTO at Confluent, a company backing the popular Apache Kafka messaging system. Previously, Neha led streams infrastructure at LinkedIn, where she was responsible for LinkedIn’s petabyte-scale streaming infrastructure... Read More.

Paco Nathan
Paco Nathan (, @pacoid
Spark Camp: Ask Us Anything Ask Us Anything

O’Reilly author (Just Enough Math and Enterprise Data Workflows with Cascading) and a “player/coach” who’s led innovative Data teams building large-scale apps. Director of Community Evangelism for Apache Spark with Databricks, advisor... Read More.

Chris Neumann
Chris Neumann (500 Startups), @ckneumann

Chris Neumann is the CEO and Cofounder of DataHero, the leading platform for visualizing data from online services. Chris was previously the first employee at Big Data pioneer Aster Data Systems, where he helped... Read More.

Emi Nomura
Emi Nomura (Jawbone), @eminomura

Emi is a Senior Data Scientist at Jawbone. She completed her Ph.D. in Neuroscience at Northwestern University studying memory systems of the brain using functional neuroimaging. She went on to study neuroplasticity in patients with... Read More.

Cait O'Riordan
Cait O'Riordan (Financial Times), @caitoriordan
Shazam Data-Driven Business Day

Cait O’Riordan is the Financial Time’s (FT) chief product and information officer (CPIO). She’s responsible for platform and product strategy, development and operations across the FT Group, working in close partnership with editorial and... Read More.

Stephen O'Sullivan
Stephen O'Sullivan (Data Whisperers), @steveos

A leading expert on big data architecture and Hadoop, Stephen brings over 20 years of experience creating scalable, high-availability, data and applications solutions. A veteran of WalmartLabs, Sun and Yahoo!, Stephen leads data architecture and... Read More.

Matthew Ocko
Matthew Ocko (Data Collective), @mattocko

Matt Ocko has three decades of experience as a technology entrepreneur and VC. Over his career, he has invested in Cotendo, Zynga, Facebook, XenSource, UltraDNS, FlashSoft, Fortinet, Aggregate Knowledge, Virtuata, DataMirror, Couchbase, Ayasdi, Kenshoo, D-Wave... Read More.

Vadim Ogievetsky

Vadim Ogievetsky is a frontend developer at Metamarkets. Previously, he was part of the Data Visualization group at Stanford where he contributed to Protovis and D3.js. He is currently focused on Facet, a data visualization... Read More.

Travis Oliphant
Travis Oliphant (Anaconda)
PyData Ask Us Anything Ask Us Anything
PyData at Strata Tutorial

Travis Oliphant has a Ph.D. from the Mayo Clinic and B.S. and M.S. degrees in Mathematics and Electrical Engineering from Brigham Young University. Since 1997, he has worked extensively with Python for numerical and scientific... Read More.

Lance Olson
Lance Olson (Microsoft), @lanceeo

Lance Olson is a Partner Group Program Manager responsible for HDInsight, Microsoft’s Hadoop-as-a-Service offering in Azure. Lance has worked extensively on database, business intelligence, and developer technologies for enterprise customers over the last 20 years.... Read More.

Jerry Overton

Jerry Overton is a data scientist and distinguished technologist in DXC’s Analytics Group, where he is the principal data scientist for industrial machine learning, a strategic alliance between DXC and Microsoft comprising enterprise-scale... Read More.

Andy Palmer (Tamr, Inc.), @andyhpalmer

Andy Palmer is co-founder and CEO of Tamr, Inc., Palmer co-founded Tamr with fellow entrepreneur Michael Stonebraker, PhD, adjunct professor at MIT CSAIL; Ihab Ilyas, professor at the University of Waterloo; and... Read More.

Rahul Pathak
Rahul Pathak (Amazon Web Services)

Rahul Pathak runs the Amazon EMR and AWS Data Pipeline businesses for AWS. Amazon EMR is a web service for running frameworks like Hadoop, Spark, and Presto on managed clusters in... Read More.

DJ Patil
DJ Patil (White House Office of Science and Technology Policy), @dpatil

DJ Patil is the chief data scientist and deputy chief technology officer for data policy at the White House Office of Science and Technology Policy, where he advises on policies and practices to maintain US... Read More.

Pamela Peele
Pamela Peele (UPMC)

Pamela Peele, Ph.D., is the Chief Analytics Officer of the UPMC Insurance Services Division. Dr. Peele brings 13 years of patient care experience along with 12 years of academic research experience to her position... Read More.

Srinath Perera

Dr Srinath Perera, is a Director of Research at WSO2 Inc., where he overlooks the overall WSO2 platform architecture with the CTO. He is a co-founder of Apache Axis2, a member
of... Read More.

Fernando Perez
Fernando Perez (UC Berkeley and Lawrence Berkeley National Laboratory), @fperez_org

Fernando Perez is a research scientist at the UC Berkeley Helen Wills
Neuroscience Institute, and a founding investigator of the Berkeley Institute
for Data Science, created in 2013. He received a PhD in... Read More.

Mike Polcari

Mike Polcari joined 23andMe in 2008 and works with a talented team of engineers to deliver access, understanding, and benefit from the human genome.
Mike is focused on architecture across 23andMe’s commercial products,... Read More.

Jeff Pollock
Jeff Pollock (Oracle)

Mr. Pollock is an expert data integration technology leader. He is currently Vice President of Product Management for the Oracle Data Integration business unit and previously responsible for all IBM Information Integration products. Prior... Read More.

Christopher Pouliot

Chris Pouliot is a real life rocket scientist, who has also spun astronauts until they were motion sick, split atoms to make an aircraft carrier go fast, provided insightful analysis that led to Google to... Read More.

Alexander Prinz (Lufthansa Airlines)
Find the Business in Your Data Data-Driven Business Day

Alexander is the project lead for big data analytics at Lufthansa Airlines and is involved in architecting key analytics transformation initiatives at Lufthansa. An expert in econometrics, Alexander taps into the power of statistical analysis... Read More.

Satyam Priyadarshy

Dr. Priyadarshy is the Chief Data Scientist at Halliburton. He is also the founder of ReIgnite Strategy, an advisory company to CXOs. Dr. Priyadarshy was the VP Data Science at Acxiom. Prior to that he... Read More.

Lisa Qian
Lisa Qian (Airbnb), @@_lisaq

Lisa is a data scientist at Airbnb, where she focuses on search and discovery. Prior to joining Airbnb, Lisa completed a PhD in Applied Physics at Stanford University. Outside of data science, Lisa enjoys playing... Read More.

Fintan Quill
Fintan Quill (Kx Systems Inc.), @fintanquill

Fintan is responsible for Kx sales engineering globally. An expert in developing database analytic systems, Fintan joined Kx in 2012 after having worked extensively with quantitative teams at a variety of Wall Street investment banks,... Read More.

Ki Ra (Directly), @eugmandel

Eugene leads data science at Directly, a startup in San Francisco that helps companies scale excellent customer service by letting expert users help other users on demand. He builds augmented intelligence systems that allow humans... Read More.

Rashmi Raghu
Rashmi Raghu (Pivotal)

Rashmi Raghu has extensive experience in executing complex analytics projects in multiple verticals. She is currently a Principal Data Scientist in the Pivotal Data Labs team at Pivotal with a focus on applications in the... Read More.

Srivatsan Ramanujam

Srivatsan Ramanujam is a Principal Data Scientist at Pivotal where he executes Data Sciences labs for their customers, with a special focus on Text Analytics. He brings with him over 8 years of industry experience... Read More.

Jairam Ranganathan
Jairam Ranganathan (Cloudera)

Jai is the Director of Product Strategy at Cloudera where he is responsible for planning the future roadmap of Cloudera products. Before Cloudera, he spent a decade at VMware, where amongst other things he was... Read More.

Tye Rattenbury
Tye Rattenbury (Trifacta)

Tye Rattenbury is a lead Data Scientist at Trifacta. Tye holds a PhD in Computer Science from UC Berkeley. Prior to joining Trifacta, he was a Data Scientist at Facebook and the Director of Data... Read More.

Chris Re
Chris Re (Stanford University | Apple), @HazyResearch

Christopher (Chris) Re is an assistant professor in the Department of Computer Science at Stanford University. The goal of his work is to enable users and developers to build applications that more deeply understand and... Read More.

Ben Recht
Ben Recht (University of California, Berkeley)

Ben Recht is an associate professor in the Department of Electrical Engineering and Computer Sciences and the Department of Statistics at the University of California, Berkeley. Ben’s research focuses on scalable computational tools for large-scale... Read More.

Azarias Reda (Republican National Committee ), @azarias

Azarias is the first ever Chief Data Officer of the Republican Party. He earned his PhD in computer science from the University of Michigan, and previously founded Meritful – an enterprise software startup based in... Read More.

Matthew Rocklin
Matthew Rocklin (Anaconda)
PyData Ask Us Anything Ask Us Anything
PyData at Strata Tutorial

Matthew Rocklin is an open source software developer at Anaconda focusing on efficient computation and parallel computing, primarily within the Python ecosystem. He has contributed to many of the PyData libraries and today works on... Read More.

Julia Rodriguez
Julia Rodriguez (Eagle Investment Systems), @juliargentinag

Julie Rodriguez is a Boston-based Information Architect with experience in user research, analysis and design for complex systems. Within the global markets domain, Julie has delivered pioneering solutions in such areas as wealth management, investment... Read More.

John Russell
John Russell (Cloudera)

John Russell is a software developer and technical writer, and he’s currently the documentation lead for Impala. He is the author of the forthcoming book from O’Reilly, “Getting Started with Impala”.

Tara Sainath
Tara Sainath (Google)

Tara Sainath received her PhD in Electrical Engineering and Computer Science from MIT in 2009. The main focus of her PhD work was in acoustic modeling for noise robust speech recognition. After her PhD,... Read More.

Noelle Saldana

Noelle Sio has a background in mathematics, statistics, and data mining with an emphasis on digital media. She is currently a Principal Data Scientist at Pivotal (formerly Greenplum). Her work has mainly focused on helping... Read More.

Eric Sammer
Eric Sammer (Rocana), @esammer

Eric Sammer is the CTO and co-founder of ScalingData. Prior to ScalingData, he was an engineering manager at Cloudera. His background is in the development and operations of distributed, highly concurrent, data ingest and... Read More.

Krishna Sankar
Krishna Sankar (U.S.Bank), @ksankar

Krishna Sankar is a Distinguished Engineer − Artificial Intelligence & Machine Learning at U.S. Bank focusing on augmented intelligence, digital human as well as areas like AI explainability. Earlier stints include Senior Data Scientist with... Read More.

Steve Sarsfield

Steve Sarsfield is an author and expert in data quality and data governance. His book “The Data Governance Imperative” is a comprehensive exploration of data governance from the business perspective. Steve draws practical wisdom and... Read More.

Nima Sarshar
Nima Sarshar (inPowered), @nimilinimo

As inPowered’s CTO, Nima leads the development of the core technologies at the heart of inPowered, and oversees all aspects of engineering and product development. Once a tenured professor with 50+ peer-reviewed publications, Nima... Read More.

Bill Schmarzo
Bill Schmarzo (EMC Consulting), @schmarzo

Bill Schmarzo, author of “Big Data: Understanding How Data Powers Big Business”, is responsible for setting the strategy and defining the Big Data service line offerings and capabilities for the EMC Global Services Enterprise... Read More.

Eric Schmidt
Eric Schmidt (Google)

Eric Schmidt is the product management lead for Cloud Dataflow on the Cloud engineering team at Google, where his primary role is to help shape the future of fully managed, large-scale data processing. Eric spends... Read More.

Patrick Schots (Intel)

Patrick Schots has been working in the IT industry since 2000 and with Intel since 2000. Previously, before joining Intel, Patrick worked at Alcatel and Lucent Technology where he had different Engineering functions. Since 2007... Read More.

Jim Scott
Jim Scott (NVIDIA), @kingmesal

Jim Scott is the head of developer relations, data science, at NVIDIA. He’s passionate about building combined big data and blockchain solutions. Over his career, Jim has held positions running operations, engineering, architecture, and... Read More.

Shawn Scully
Shawn Scully (Dato)

Shawn is the VP of Customer Success & Applications at Dato where he helps make it easy to build cool experiences with data. He is data geeky and loves inspired technologies, businesses, and gadgets. His... Read More.

Jonathan Seidman

Jonathan is a Solutions Architect on the Partner Engineering team at Cloudera. Before joining Cloudera, he was a Lead Engineer on the Big Data team at Orbitz Worldwide, helping to build out the Hadoop clusters... Read More.

Carter Shanklin (Hortonworks)

Carter has a firm dislike of mandatory biography fields.

Gwen Shapira
Gwen Shapira (Confluent), @gwenshap

Gwen Shapira is a Solutions Architect at Cloudera and leader of IOUG Big Data SIG. Gwen Shapira studied computer science, statistics and operations research at the University of Tel Aviv, and then went... Read More.

Roman Shaposhnik
Roman Shaposhnik (Pivotal Inc.)

Roman Shaposhnik is a Director of Open Source @Pivotal. He is a member of Apache Software Foundation, committer on Apache Hadoop, founder of Apache Bigtop and, as of late, a man behind Open Data Platform... Read More.

Vin Sharma
Vin Sharma (Intel)

At Intel, Vin Sharma is responsible for strategic ecosystem initiatives driving adoption of end-to-end analytics solutions based on Intel data center platforms. In this role, Vin spearheads technical and marketing engagements partners working with open... Read More.

Clint Sharp
Clint Sharp (Splunk)

Clint Sharp is Director of Product Management overseeing all research and development efforts for Splunk’s Big Data & IT product initiatives. Prior to this role, Clint spent 15 years deploying and running complex IT infrastructures... Read More.

Adam Silberstein
Adam Silberstein (Trifacta)

Adam Silberstein is a lead software engineer at Trifacta. His main area of interest is large-scale data processing, including in the batch processing and online serving spaces. His work has appeared in top database venues... Read More.

Sumeet Singh

Sumeet Singh is a Senior Director of Products at Yahoo responsible for platforms product management and customer engagements. In this role, he also leads the Hadoop products team responsible for both Apache open source contributions... Read More.

Joseph Sirosh
Connected Cows? Keynote

Joseph Sirosh is the corporate vice president of the Information Management and Machine Learning (IMML) team in the Cloud and Enterprise group at Microsoft Corp. Sirosh and his IMML team have shipped public... Read More.

Ram Shankar Siva Kumar
Ram Shankar Siva Kumar (Microsoft (Azure Security Data Science))

Ram Shankar is a security data wrangler in Azure Security Data Science, where he works on the intersection of ML and security. Ram’s work at Microsoft includes a slew of patents in the large intrusion... Read More.

Adam Smith
Adam Smith (Automated Insights)

Adam Smith is the chief operating officer at Automated Insights, where he is responsible for all areas of Automated Insights business, including the Wordsmith platform, new products, and professional service implementations. In addition to running... Read More.

Terence Spies
Terence Spies (Voltage Security), @tspyz

Terence Spies has over 19 years of security and systems software development experience, working with leading companies such as Microsoft, Asta Networks and others. He is frequently quoted by business and technology press on today’s... Read More.

Coe Leta Stafford

Coe Leta is a Design Director at IDEO, Palo Alto where she leads Design Research – IDEO’s human-centered approach of understanding people’s needs, developing insights, and connecting those insights to innovative design solutions for... Read More.

Doug Stein
Doug Stein (metacog, Inc.), @douglas_stein

Doug has been working on creating and scaling technology-enabled personalized learning for the better part of the past 20 years. The tools didn’t really exist in 1994 when he established an interactive software simulation business... Read More.

Rick Stellwagen (Think Big, a Teradata Company)

Rick Stellwagen is director for Think Big’s Data Lake program. After years of building highly available MPP relational database systems, he was an Enterprise and Solution architect in Professional Services. Most recently, he served... Read More.

Michael Stonebraker

Michael Stonebraker is an adjunct professor at MIT CSAIL and a database pioneer who has been involved with Postgres, SciDB, Vertica, VoltDB, Tamr and other database companies. He co-authored the paper “Data... Read More.

Vijay Subramanian
Vijay Subramanian (Rent the Runway), @vjsubr

Vijay analyzes big data to drive business decisions across many facets of Rent the Runway – marketing, operations, product and technology, inventory buying and planning, and customer service. He is credited with building the backend... Read More.

Jagane Sundar
Jagane Sundar (WANdisco)

Jagane Sundar has extensive big data, cloud, virtualization, and networking experience and joined WANdisco through its acquisition of AltoStor, a Hadoop-as-a-Service platform company. Before AltoStor, Jagane was founder and CEO of AltoScale, a Hadoop... Read More.

India Swearingen
India Swearingen (United Way of the Bay Area)

India’s 5+ years of industry experience focuses on optimizing data and evaluation to enhance social change, such as community and housing redevelopment and educational achievement programs/initiatives. As Director of Evaluation + Insight at United Way,... Read More.

Doug Talbott
Doug Talbott (Bedarra Research Labs)

Doug specializes in the research and development of user interface designs and advanced visualization solutions for complex systems.

He has over 25 years of experience in producing and/or managing the development of award-winning content, visual... Read More.

Cathy Tanimura
Cathy Tanimura (Strava)

Cathy has been working with data in one form or another for the past 15 years. She is passionate about bringing data into the culture of startup companies. She has built and run analytics organizations... Read More.

Daniel Templeton

Daniel works in the Cloudera training team building Cloudera’s developer and data science Cloudera Certified Professional certifications. Daniel also has a long history as a software engineer in the high performance computing space and has... Read More.

Andy Terrel
Andy Terrel (NumFOCUS), @aterrel
PyData Ask Us Anything Ask Us Anything
PyData at Strata Tutorial

Andy Terrel is president of NumFOCUS. He is also the chief data scientist of REX Real Estate, where he brings his experience building smart, scalable data systems to the real estate industry. A data... Read More.

Anu Tewary
Anu Tewary (Intuit)

Anuranjita Tewary is Director of Product Management at Intuit. She was a founder at Level Up Analytics, which was acquired by Intuit. Her previous roles have been data scientist at LinkedIn, and product management at... Read More.

Thiruvel Thirumoolan

Thiruvel Thirumoolan is a developer in the Hive and HCatalog team at Yahoo!. In this role he is responsible for deployment of Hive, HiveServer2 and HCatalog across all the Hadoop clusters at Yahoo! and ensuring... Read More.

James Thompson (Palantir Technologies), @astrojams1

Hello, I am James. I’m 29, live in San Francisco, and work in tech. I grew up pretty modestly in small towns in FL and WV, read a lot of books on astrophysics and human... Read More.

Kathleen Ting

Kathleen Ting is a technical account manager at Cloudera where she helps strategic customers deploy and use the Apache Hadoop ecosystem in production. She’s a frequent conference speaker, has contributed to several projects in the... Read More.

Reena Tiwari
Reena Tiwari (Cisco Systems Inc.), @retiwari
Find the Business in Your Data Data-Driven Business Day

Reena Tiwari is an IT leader who delivers technology solutions to solve complex business problems. At Cisco, she is leading a Marketing IT team that is responsible for Data, Analytics, Lead Management, Marketing Operations Automation.... Read More.

Anirudh Todi
Anirudh Todi (Twitter Inc.), @anirudhtodi

Anirudh Todi – Senior Software Engineer at Twitter

At Twitter, Anirudh works on the Data Platform team. Anirudh and his team are chartered with processing and understanding the vast body of data that is generated... Read More.

Omer Trajman
Omer Trajman (ScalingData), @otrajman

Omer brings fifteen years of front-line business and technical experience, having been responsible for some of today’s largest modern database management system deployments. He most recently served as Vice President of Field Operations at WibiData,... Read More.

Catherine Truxillo

Catherine Truxillo, Ph.D. has been with the advanced analytics education team at SAS since 2000 and has written or co-written SAS training courses for advanced statistical methods including: multivariate statistics, linear and generalized... Read More.

Douglas Turnbull
Douglas Turnbull (OpenSource Connections), @softwaredoug

Doug has been engrossed in programming since his parents first bought him an Apple IIe computer in 4th grade. Throughout his early career, Doug proved his flexibility and ingenuity in crafting solutions in a variety... Read More.

Brian Ulicny
Brian Ulicny (Thomson Reuters ), @bulicny

Brian Ulicny is a founding member of the new Data Innovation Lab recently stood up by Thomson Reuters in Boston. The Data Innovation Lab will partner with internal teams, customers and third parties, such as... Read More.

PyData Ask Us Anything Ask Us Anything
PyData at Strata Tutorial

Stéfan van der Walt is a senior lecturer in applied mathematics at
Stellenbosch University, South Africa, and an associate project
scientist in the astronomy department at UC Berkeley. He has been
involved... Read More.

Shankar Vedaraman
Shankar Vedaraman (Netflix)

Shankar Vedaraman leads the data engineering org for Growth, Business operations, and Infrastructure at Netflix. His team is responsible for engineering and managing data models and services that enable data consumption for a variety of... Read More.

sunil venkayala

Sunil Venkayala, Senior Technical Product Manager at HP Vertica in Cambridge, Mass. He leads the Distributed R open-source technology initiative and advanced analytics features of the HP Vertica platform. Prior to joining HP, he was... Read More.

Anand Venugopal
Anand Venugopal (Impetus Technologies Inc.)

Anand Venugopal has been working with Fortune 1000 Enterprises to deliver real business benefits and ROI from Big Data Solutions since 5 plus years at Impetus. And before that since 1995 – has been... Read More.

Dean Wampler

Dean Wampler is the Big Data Architect for Typesafe. He builds scalable, distributed, “Big Data” applications using the Typesafe Reactive Platform, Spark, Mesos, and other tools. He is the author of Programming... Read More.

LianHui Wang

LianHui Wang is a Software Engineer from Tencent’s TEG Big Data Department. He is also a contributor to Spark, Hadoop, and Hive. He has been instrumental in Tencent’s Hadoop and Spark applications, with focus... Read More.

Peter Wang
Peter Wang (Anaconda), @pwang
PyData Ask Us Anything Ask Us Anything
PyData at Strata Tutorial

Peter Wang is the cofounder and CTO of Anaconda, where he leads the product engineering team for the Anaconda platform and open source projects including Bokeh and Blaze. Peter’s been developing commercial scientific computing... Read More.

Tech Expert Sponsor – Evangelizing IT Tech Careers at Chevron

Chip Wells
Chip Wells (SAS)

Chip Wells has over 15 years of experience in implementing theoretical and applied econometrics using the SAS programming language and SAS Solutions. He is a Statistical Services Specialist in the SAS Education... Read More.

Patrick Wendell
Patrick Wendell (Databricks)

Patrick Wendell is an engineer at Databricks as well as a Spark
Committer and PMC member. In the Spark project, Patrick has acted as
release manager for several Spark releases, including Spark... Read More.

John Myles White
John Myles White (Facebook)

John Myles White is one of the main developers of statistical libraries for
Julia. He currently works at Facebook, where he is a member of the Core Data
Science team. John is also... Read More.

Tom White
Tom White (Cloudera)

Tom White has been an Apache Hadoop committer since February 2007, and is a member of the Apache Software Foundation. He is the author of “Hadoop: The Definitive Guide” for O’Reilly. Previously he worked as... Read More.

Tom White
Tom White (Cloudera), @tom_e_white

Tom White is one of the foremost experts on Hadoop. He has been an Apache Hadoop committer since February 2007, and is a member of the Apache Software Foundation. His book Hadoop: The Definitive Guide... Read More.

Paula Wiles Sigmon

Paula Wiles Sigmon is Program Director for marketing of the InfoSphere Information Integration and Governance portfolio at IBM.

She has held various marketing management positions at IBM, leading teams responsible for demand-generation programs,... Read More.

Cack Wilhelm
Cack Wilhelm (Scale Venture Partners)

Cack Wilhelm is a principal at Scale Venture Partners, where she focuses on investments in early-stage software companies, with an eye toward those helping businesses better utilize data, automate workflows, incorporate AI, and build more... Read More.

Richard Williamson
Richard Williamson (Silicon Valley Data Science)

Richard has been at the cutting edge of big data since its inception, leading multiple efforts to build multi-petabyte Hadoop platforms, maximizing business value by combining data science with big data. He has extensive experience... Read More.

Rafał Wojdyła

Rafal is an engineer at Spotify, a member of Hadoop squad responsible for operating, maintaing and growing one of the biggest Hadoop cluster in Europe. He is a core committer to snakebite – super... Read More.

Terry  Woodfield

Dr. Woodfield is a Statistical Services Specialist in the Education Division at SAS.He provides training and mentoring services in the areas of statistical forecasting, predictive modeling, and data mining.Terry was Chief Statistician at Read More.

Bing Xiao (Huawei)

As Senior Director of Product Management and Strategy at US R&D Center of Huawei, Bing oversees new product R&D, open-source initiatives, and leads a global growing team to drive vertical big data-centric solutions. Prior to... Read More.

Reynold Xin
Reynold Xin (Databricks)
Spark Camp: Ask Us Anything Ask Us Anything

Reynold Xin is a committer on Apache Spark. He is also a co-founder of Databricks. Before Databricks, he was pursuing a PhD in the UC Berkeley AMPLab.

Fangjin Yang
Fangjin Yang (Imply)

Fangjin is one of the main committers to the open source Druid project and one first developers at Metamarkets, a San Francisco based data startup. Fangjin previously worked on diagnostic optimization algorithms at Cisco Systems.... Read More.

Sungwook Yoon
Sungwook Yoon (MapR)

Sungwook is a Data Scientist at MapR. Sungwook’s data experience includes malware detection algorithms for packet stream analysis, mobile network signaling analysis, social network analysis, job title analysis as well as call center data analysis.... Read More.

Reza Zadeh
Reza Zadeh (Matroid | Stanford), @Reza_Zadeh
Spark Camp: Ask Us Anything Ask Us Anything

Consulting professor at Stanford within ICME, conducting research and teaching courses targeting doctorate students. Technical Advisor at Databricks. I focus on Discrete Applied Mathematics, Machine Learning Theory and Applications, and Large-Scale Distributed Computing.

Matei Zaharia
Matei Zaharia (Databricks)
Spark Camp: Ask Us Anything Ask Us Anything

Matei Zaharia is an assistant professor of computer science at MIT, and the initial creator of Apache Spark. He is currently on industry leave to start Databricks, a company commercializing Spark, where he is... Read More.

Philip Zeyliger
Philip Zeyliger (Cloudera)

Philip Zeyliger is a Software Engineer at Cloudera. He came to Cloudera from Google, where he worked on scalable storage for user-facing applications. Before that, he worked in finance, at D.E. Shaw. Philip holds a... Read More.

Xuefu Zhang
Xuefu Zhang (Cloudera)

Xuefu Zhang has over 10 year’s experience in software development. Working for Cloudera since May 2013, he spends a lot of his efforts on Apache Hive and Pig. He also worked in the Hadoop team... Read More.

Alice Zheng

Alice Zheng is a senior manager of applied science on the machine learning optimization team on Amazon’s advertising platform. She specializes in research and development of machine learning methods, tools, and applications. She’s the author... Read More.

Wei Zheng
Wei Zheng (Trifacta)

As VP of Products, Wei combines her passion for technology with experience in Enterprise Software to define and shape Trifacta’s product offerings. Having founded several startups of her own, Wei believes strongly in innovative technology... Read More.


Adriana Zubiri is a Program Director in the Information Management Big Data group at the IBM Toronto Software Lab. In her current role, Zubiri is responsible for driving the software development execution for Big... Read More.

Monte Zweben
Monte Zweben (Splice Machine Inc.), @mzweben

Monte Zweben is the CEO and co-founder of Splice Machine, provider of the only Hadoop RDBMS. A SQL-on-Hadoop solution, Splice Machine has helped many companies scale real-time applications using commodity hardware without... Read More.