Strata + Hadoop World brings together the leading data scientists, inventors, analysts and innovators from around the globe to share what’s happening at the forefront of big data.
M. Teresa Serrano Agujetas is the IT and Ops innovation director at the Santander Group. She was previously the CTO for the group for five years. For the last seven years Maite has been... Read More.
Julia Angwin is an award-winning investigative journalist at the independent news organization ProPublica.
From 2000 to 2013 she was a reporter at The Wall Street Journal, where she led a privacy investigative team that was... Read More.
Carme Artigas is the founder and CEO of Synergic Partners, a strategic and technological consulting firm specializing in big data and data science (acquired by Telefónica in 2015). She has more than 20 years... Read More.
Emre is Qubit’s co-founder and CTO. Qubit is a provider of an integrated web personalization, a/b testing, audience segmentation, and digital analytics platform. Prior to starting Qubit, he was a senior product manager at... Read More.
Zbigniew Baranowski is a database systems specialist and a member of a group which provides and supports central database services at CERN.
Christopher Batey recently joined DataStax as a technical evangelist for Apache Cassandra. Previously he worked as a senior software engineer at BSkyB, where he spent his time designing and developing their next generation, Cassandra-backed platform... Read More.
As the Business Enablement Lead for the Data Insights group at Accenture Technology Labs, Hallie is focused on making analytics and data science more accessible to internal and client leadership. She also runs the Accenture-University... Read More.
Francine Bennett is a data scientist, and the CEO and cofounder of Mastodon C. Mastodon C are agile big data specialists who offer open source Hadoop-powered technology and the technical and analytical skills which... Read More.
Ryan Blue is a software engineer at Cloudera, currently working on the Kite SDK team.
Joerg Blumtritt is founder and CEO of Datarella, a computational social science startup that delivers mobile analytics, self-tracking solutions, and data science consulting.
After graduating from university with a thesis on machine learning, Joerg... Read More.
Claudiu Branzan is a senior engineering lead at Atigeo, leading a team of data scientists and software engineers who tackle complex challenges in machine learning, data mining, information retrieval, and statistics. Claudiu has over 10... Read More.
Mikio Braun is co-founder of streamdrill, a startup focused on approximative approaches for real-time big data, and post-doc researcher at TU Berlin, Germany. He holds a Ph.D. in Machine Learning and has worked in research... Read More.
Andrew Brookes is the CTO of ASI Data Science. A computer scientist who has 10+ year of professional experience including leading an engineering team building BlackRock’s global portfolio management systems. His expertise is... Read More.
Oana Calugar is the head of Customer Support for AliveShoes, focused on extracting and using customer support insights to create better products. AliveShoes is the world’s first independent shoemaking community that helps people design and... Read More.
Elena is a research scientist at the Wellcome Trust Sanger Institute – Genome Campus, Cambridge. She holds a PhD in Machine Learning and Data Mining, and over the past six years she has been actively... Read More.
Yanpei Chen is a software engineer at Cloudera, working on the Performance Engineering team. He regularly participates in competitive performance “bake-offs” that directly drive customer purchasing decisions. His work touches upon Cloudera Search, Impala, Apache... Read More.
Shahar Cohen is a data scientist and product visionary at Intel. Currently, he helps in building a vision for the Intel and Michael J. Fox Foundation joint venture for enabling breakthroughs in research on Parkinson’s... Read More.
Matt is a Senior Interaction Designer and Project Lead at IDEO in London. IDEO is an award-winning global design firm that takes a human-centred, design-based approach to helping organisations in the public and... Read More.
Valerie is a UX strategist and former innovation catalyst at Schibsted Media Group. For the past several years, she has trained teams around the world to question assumptions, develop empathy with customers and users, and... Read More.
Doug Cutting is the chief architect at Cloudera and the founder of numerous successful open source projects, including Lucene, Nutch, Avro, and Hadoop. Doug joined Cloudera from Yahoo, where he was a key member of... Read More.
Sudeep Das is a data scientist with a passion for turning data into meaningful insights and stories. He believes that powerful visualizations provide key entry points into understanding data. Very often, Sudeep finds himself hand-rolling... Read More.
Paul Davies is responsible for leading Cisco’s Big Data Solutions portfolio across EMEAR, engaging with Customers, Partners and ISVs on Big Data and Analytics opportunities. Paul joined Cisco’s Data Center team in 2009, after... Read More.
Mark Dijksman advises large organizations about innovations like big data, the Internet of Things, and the possibilities of block chain. He is also the founder and creative director of BigData.Company.
Ellie Dobson works for Pivotal as a data scientist. She spent most of her early life in Northumberland planning to be a musician but did a rather unexpected U-turn at the age of 18, and... Read More.
Director of WW Sales enablement for the Intel AI portfolio.
Tamara Dull is the director of emerging technologies for SAS Best Practices, a thought leadership organization at SAS Institute. Through engaging publications, rich media, and industry engagements, she delivers a pragmatic perspective on... Read More.
Ted Dunning is chief application architect at MapR Technologies. He’s also a board member for the Apache Software Foundation, a PMC member and committer of the Apache Mahout, Apache Zookeeper, and Apache Drill projects,... Read More.
Joey Echeverria is the director of engineering at Rocana, where he builds applications for scaling IT operations built on the Apache Hadoop platform. Joey is a committer on the Kite SDK, an Apache-licensed data... Read More.
I am responsible at Canonical, the company behind Ubuntu, for bringing new disruptive products to market in the Cloud and Big Data space.
Simon is a solutions engineer at Hortonworks, where he helps clients do Hadoop. He is a certified Spark and Hadoop developer. Previously he has worked in the data intensive worlds of hedge funds and financial... Read More.
Stephan Ewen is one of the originators and committers of the Apache Flink project, and is a CTO at a Berlin-based startup where he leads the effort to create a novel distributed system for... Read More.
As co-founder of Think Big, Rick brings 20 years experience in scaling global services organizations. He’s responsible for Think Big’s international business. Previously Rick directed a global division within Sun Microsystems. In 2009 he led... Read More.
Sameer Farooqui is a client services engineer at Databricks, where he works with customers on Apache Spark deployments. Sameer works with the Hadoop ecosystem, Cassandra, Couchbase, and general NoSQL domain. Prior to Databricks, he worked... Read More.
Phil is programme director for the myBBC programme, which is charged with transforming the BBC’s relationship with its audience. The programme has three broad objectives:
Christine Flounders is Regional Manager for London R&D at Bloomberg L.P, responsible for a team of more than 450 technologists working to architect, build and deploy software, digital platforms and mobile applications for Bloomberg customers... Read More.
Trained as a scientist, Andrew has worked with data all his career in both business and academia, for organisations including Microsoft Research, Barclays Capital, Cambridge University, and Royal Bank of Scotland. Import•io is his second... Read More.
Christine Foster is currently the VP of data science at ShopKeep. She started her career as a generalist strategy consultant with Bain & Company, doing sums and averages and PowerPoint. Christine worked on data mashups... Read More.
Jason heads up big data, analytics, and marketing solutions at Marks & Spencer. He was brought on board to bring step change to the business in how it uses data and innovative technology to drive... Read More.
Ellen Friedman is a solutions consultant, scientist, and author, currently writing about a variety of open source and big data topics, including co-authoring Mahout in Action (Manning), the Practical Machine Learning series from O’Reilly, and... Read More.
Alan is a co-founder at Hortonworks and an original member of the engineering team that took Pig from a Yahoo! Labs research project to a successful Apache open source project. Alan also designed HCatalog and... Read More.
Joseph George is the executive director of Server Big Data Strategy and Density Optimized Servers at HP, and is responsible for driving a broad range big data solutions across the HP Server portfolio. He also... Read More.
An IT workload automation expert, Tom has served as a Control-M product manager for more than 10 years. He was responsible for translating the workload automation market requirements gathered from customers and analysts into product... Read More.
Colin Gillespie is a statistician and an associate professor at Newcastle University, UK, where he works on computational statistics, big data problems, and scalable Bayesian inference. He has taught courses on R for over ten... Read More.
Garrett is the editor-in-chief of shiny.rstudio.com, the development center for the Shiny R package, and is the author of Hands-On Programming with R as well as Data Science with R, a forthcoming book by O’Reilly... Read More.
Mark Grover is a committer on Apache Bigtop, a committer and PMC member on Apache Sentry (incubating), and a contributor to Apache Hadoop, Apache Hive, Apache Sqoop, and Apache Flume. He is currently co-authoring... Read More.
Carlos is the CEO of GraphLab, and the Amazon Professor of Machine Learning in Computer Science and Engineering at the University of Washington. A world-recognized leader in the field of machine learning, Carlos was... Read More.
Sebastian Gutierrez is a data entrepreneur who has founded three data-related companies: DataYou (data science & data visualization consulting and education), LetsWombat (data-driven product sampling), and Acheevmo (athletic performance statistics). He was formerly an emerging... Read More.
Mike leads the Emerging Products & Technology team at Autodesk where they identify, evaluate, and develop disruptive technologies that improve the practice of imagining, designing, and creating a better world. His team combines research, development,... Read More.
Co-creator and PMC member of Apache Kylin, Sr. Product Manager of eBay.
Luke Han joined eBay in late 2011 as staff BI architect of Business Intelligence Platform Team. He is Sr. Product Manager... Read More.
Jo is global co-head of the Enterprise Platforms group at Goldman Sachs. Enterprise Platforms is an engineering team – responsible for a broad remit – data architecture, JVM engineering, workflow, runtime management and development... Read More.
Tim is an economist, journalist, and broadcaster. He is author of the million-selling “The Undercover Economist”, a senior columnist at the Financial Times, and the presenter of Radio 4’s “More or Less”. Tim has spoken... Read More.
CTO and founding member of DataShaka.
Big beardy geek.
Scott leads the strategic solutions marketing for big data, core and security products at Informatica. As a part of the big data product team, he grew Informatica’s big data partner ecosystem. Prior to Informatica, Scott... Read More.
Jeremy Heffner works with crime data to model patterns and forecast risk; the intersection of geography, data science, and social good.
The elements he works with every day can include geographic data, raster processing, predictive... Read More.
Felipe Hoffa joined Google in 2011 as a software engineer. As a member of the Google Cloud Platform team, he works with external developers to build applications on Google’s big data platforms.
A technology leader, Francis created the original TortoiseCVS, which has improved version control for tens of millions of people. He was a founder of TheyWorkForYou and WhatDoTheyKnow, which show the world how to use scraping... Read More.
Jeroen is a lead data scientist at Elsevier in Amsterdam. He has an M.Sc. in artificial intelligence and a Ph.D. in machine learning. Jeroen has authored a book titled “Data Science at the Command Line”,... Read More.
David Jonker is EVP and a founder of Oculus. He is a visual analytics designer and technical architect with 20 years experience. David is interested in the visual elegance of information and the underlying... Read More.
Aaron Kimball is the CTO of Zymergen, Inc. Zymergen uses high-throughput techniques, combined with big data analysis, to improve genetic strains for microbial chemical production. Aaron has been working with Hadoop since 2007. In... Read More.
Benedikt Köhler studied sociology, anthropology and psychology in Munich, where he received his PhD in 2006. After founding a mobile web start-up in the late 1990s, he worked as a consultant for Internet and media... Read More.
Marcel Kornacker is the architect and tech lead at Cloudera for Impala. Prior to Cloudera, Marcel worked at Google on several ad-serving and storage infrastructure projects. He eventually became the tech lead for the distributed... Read More.
Anirudh Koul is a data scientist at Microsoft. He brings eight years of applied research experience on petabyte-scale social media datasets including Facebook, Twitter, Yahoo Answers, Quora, Foursquare, and Bing. He has worked on a... Read More.
Gerhard Kreß is part of Siemens Mobility Customer Service, responsible for data driven services. His aim is to strengthen the use of data analytics to enable new customer offerings.
Before that he was in Siemens... Read More.
Adi Krishnan is the senior product manager for Amazon Kinesis, a fully managed service for real-time processing of streaming data at massive scale. Adi loves spending time with customers and partners, to define products that... Read More.
Dileep Kumar works in the Performance Engineering team at Cloudera. He holds an M.S. from Santa Clara University and has more than 15 years of experience in performance engineering for SQL systems. He is... Read More.
Scott Kurth is VP, Advisory Services, at Silicon Valley Data Science. Building on 20 years of experience making emerging technologies relevant to enterprises, Scott crafts vision and strategy for organizations. With a background in architecture... Read More.
JeongMin is a data scientist. JeongMin finished M.S. in Industrial Engineering and and has worked as data scientist and database engineer in the industry. Her current interests focus on anything to do with data analysis... Read More.
Mathieu has worked with Informatica for 10 years and brings extensive experience, having previously worked with Accenture for 5 years. As an IPS consultant Mathieu focused mainly on data integration technology and has implemented... Read More.
Divanny Lamas is Vice President of Product Management for Context Relevant, where she leads product strategy and direction for Context Relevant’s automated predictive analytics solutions for banking, insurance and financial institutions globally.
Prior to her... Read More.
Software Engineer with 30+ years of experience developing DBMS software. S.M., S.B. Computer Science, MIT.
Scott Langevin is a director and research scientist at Oculus and has over 12 years of industry and academic experience. He holds a PhD in computer science with a background in machine learning. Scott’s research... Read More.
Costin Leau is an engineer at Elasticsearch, currently working with NoSQL and big data technologies. An open-source veteran, Costin led various Spring projects (Spring OSGi, GemFire, Redis, Hadoop) and authored an OSGi spec. Speaker at... Read More.
Ben is a co-founder and the CTO of Ambiata, a startup focused on creating products that allow organisations to take a more scientific and automated approach to business. At Ambiata he has lead the... Read More.
Cory Levinson is a data analyst and experimental musician living in Berlin, Germany. He has been working at SoundCloud since 2011, focusing on analytics for creator product experiences. He holds a BSc in mathematics from... Read More.
Rui Li acquired a master’s degree in computing science from Fudan University in 2013. He is now a software engineer at Intel and a committer for Apache Hive. He is also a contributor to Apache... Read More.
Yang Li is the tech lead for Apache Kylin. He joined eBay-Shanghai in January 2014 as a member of the technical staff, and has been a key developer and architect of the Kylin OLAP... Read More.
Angie is co-founder and COO of ASI, a London-based startup that offers bespoke data science and engineering training and internships for partner companies. The training combines the best of academic and startup cultures.... Read More.
Roger Magoulas is the research director at O’Reilly Media and chair of the Strata + Hadoop World conferences. Roger and his team build the analysis infrastructure and provide analytic services and insights on technology-adoption trends... Read More.
Ted has worked on close to 60 clusters for over 2- to 3-dozen clients with over hundreds of use cases. He has 18 years of professional experience working for startups, the US government, a number... Read More.
Gareth has been working at HP on innovative solutions in the Analytics and Data Management space for four years and took over the ownership of the Big Data Analytics portfolio for EMEA in 2013.... Read More.
Neil Martin is a senior project manager at comparethemarket.com, part of the BGL Group in Peterborough, England. He has a 17-year career in project management across the financial services and utilities industries, having spent... Read More.
As a professor and health data scientist at Washington University School of Medicine, Leslie is in a unique position – she has access to vast amounts of health data, connections, and interest in working with... Read More.
Oscar Méndez is co-founder and CEO of Paradigma Tecnólogico and Stratio. Paradigma is an software solutions company with clients, mostly enterprise and large Internet companies, in Spain. Stratio uses the best of breed of... Read More.
Julie Meyer is a leading investor and entrepreneur in digital, high-growth, early stage businesses around the world. She has driven many continent-wide initiatives for creating wealth and growth in the European economy. Julie believes that... Read More.
John Miller is the managing director of the Data Insights R&D group at Accenture’s Technology Lab. He is responsible for overseeing the group’s most innovative research and innovation, and building relationships with clients and research... Read More.
Michael Minella is a software engineer, teacher and author with over a decade of enterprise development experience. Michael was a member of the expert group for JSR-352 (java batch processing). He currently works for... Read More.
Working as a big data architect at Stratio, David Morales has been involved in the inception and evolution of some modules included in the Stratio platform, especially those related to data visualization, real-time, streaming, and... Read More.
Jacques Nadeau is VP of Apache Drill with the Apache Software Foundation and drives MapR’s development of Apache Drill. He is an industry veteran with over 15 years of big data and analytics experience. Most... Read More.
Max Neunhöffer is a mathematician turned database developer. In his academic career he has worked for 16 years on the development and implementation of new algorithms in computer algebra, mainly for the open source system... Read More.
Professor of Electronics and Computer Science
Gilles Noisette is a Master Solution Architect at the HP EMEA Solution Innovation center. He is the technical lead of the HP EMEA Big Data Center of Excellence, promoting HP Big Data solutions,... Read More.
Cory is a product manager on Google Cloud Platform’s storage team in Mountain View, focused on releasing Google’s techno-wizardry on an unsuspecting world. Before Google, Cory worked in cybersecurity, designing the machine learning systems supporting... Read More.
Cait joined Shazam in November 2013 as VP of product, music, and platforms. She is responsible for their hugely successful mobile and web products as well as the music roadmap. Cait joined Shazam from the... Read More.
Sean is director of data science for EMEA at Cloudera. Previously, Sean founded Myrrix Ltd, producing a real-time recommender and clustering product evolved from Mahout. Myrrix is now part of Cloudera. Sean was a... Read More.
Dimitris is responsible for the Big Data Analytics ISV go to market in EMEA and provides the company thought leadership and direction to the set of software partnerships, industry practices and horizontal capabilities... Read More.
Phill Radley is a physics graduate with an MBA who has worked in IT and the communications industry for 30 years, mostly with British Telecommunications plc. He is Chief Data Architect for BT at... Read More.
Jai is the director of product strategy at Cloudera where he is responsible for planning the future roadmap of Cloudera products. Before Cloudera he spent a decade at VMware, where among other things he was... Read More.
David is CEO, president and co-founder of WANdisco, and has quickly established WANdisco as one of the world’s most promising technology companies.
Since co-founding the company in Silicon Valley in 2005, David has led... Read More.
Sean Roberts has a passion for helping others be successful with their data & systems. As EMEA Partner Solutions Engineer at Hortonworks this focus is largely on Hadoop & partner solutions. His career began... Read More.
Aengus has been involved in all aspects of Data Management systems in Financial Services arena for 15 years. Originally starting with online programming for OLTP systems, his career moved into the MPP
Duncan has been a data miner since the mid 1990s. He now leads Teradata’s Data Science team in Europe and Asia.
At Teradata he has been responsible for developing analytical solutions across a number of... Read More.
Brett Rudenstein has an extensive background in Application Lifecycle Management, High Performance Computing and Open Source Software Analysis. He has held senior sales engineering and management positions at Rational Software, PureAtria, IBM, Appistry and... Read More.
Dr. Frank Säuberlich is director advanced analytics in the Teradata International Data Science team. His focus is on demand creation across the EMEA and APJ regions. He previously worked at Urban Science International... Read More.
Anjali is the co-founder of the consulting arm of ASI, a London-based data science training and consulting company. ASI works with partners across many industries delivering projects in an agile and collaborative way.... Read More.
Mark Samson is a systems engineer at Cloudera.
Business analyst, business developer with a strong analytical mind. Majken has been working with IT, management information, analytics, BI, and DW for 20+ years. Keen on everything data, math, and ‘data driven’ as a management... Read More.
Kevin built up the data science and engineering team at Mind Candy, and with the team created a scalable architecture for mobile game analytics. Before Mind Candy, Kevin headed the data science and back-end services... Read More.
Jim has held positions running operations, engineering, architecture, and QA teams. Jim is the cofounder of the Chicago Hadoop Users Group (CHUG), where he has coordinated the Chicago Hadoop community for the past four... Read More.
Head of EMC Consulting Big Data Practice for UKI & TEEAM. Wearer of wearables. A major fanboy of Big Data and the benefits that it can bring business and individuals.
Jonathan is a solutions architect on the Partner Engineering team at Cloudera. Before joining Cloudera, he was a lead engineer on the Big Data team at Orbitz Worldwide, helping to build out the Hadoop clusters... Read More.
Vinod Shankar leads the Big Data Center of Excellence at Capgemini. He focuses on conceptualizing big data propositions and building great teams to deliver them. Vinod’s goal is to help organizations leverage their data investments... Read More.
Gwen Shapira is a solutions architect at Cloudera and leader of the IOUG Big Data SIG. Gwen studied computer science, statistics, and operations research at the University of Tel Aviv, and then went... Read More.
Richard Shaw is a Software Architect for MapR, introducing companies to the wonders of Hadoop and helping them understand how it can help their business. With a background in DevOps and NoSQL, Richard has held... Read More.
Nathan Shetterley is a senior manager for Accenture’s research into emerging data architectures, analytics, and visualizations. He leads multiple teams of researchers and developers who are building data-driven analytical solutions based on the next generation... Read More.
Siim Sikkut serves as ICT Policy Adviser in Government Office of Estonia. His role is to coordinate ICT policy planning and execution across the government, plus advise Prime Minister on e-governance matters and... Read More.
Shashank is a software engineer at Microsoft. Wearing several caps over the past decade, he has been building production pipelines for large scale data processing. Previously, he served as a project lead at HCL... Read More.
Rod Smith is an IBM fellow and vice president of the IBM Emerging Internet Technologies organization, where he leads a group of highly technical innovators who are developing solutions to help organizations realize... Read More.
Yashowardhan is the solutions lead for the Big Data Center of Excellence at Capgemini. Organizations are disrupted by the data explosion in the digital universe and are looking at creating breakthrough opportunities by monetizing these... Read More.
Alessandra Staglianò is a data scientist who has worked on multiple complex projects. In addition to various machine-learning techniques, Alessandra’s expertise is in extracting relevant information from noisy and redundant data. Her former research work... Read More.
Yodit Stanton is CEO and founder of OpenSensors.io, an Internet of Things startup, that makes data from devices easily accessible and reusable.
Yodit is a software developer with a special interest in machine learning,... Read More.
Julie thinks in metaphors and finds beauty in the clear communication of ideas. She is particularly drawn to visual media as a way to understand and transmit information, and is co-author of Beautiful Visualization (O’Reilly... Read More.
David Talby is Atigeo’s senior vice president of engineering, leading the R&D, product management, and operations teams. David has extensive experience in building and operating web-scale analytics and business platforms, as well as building world-class,... Read More.
Ankit Tharwani is proposition manager, Information Business, Personal and Corporate Banking at Barclays Bank PLC.
As senior director of the Analytical Platform Centre of Excellence, Mark Torr leads a distributed team of domain experts in the fields of data management, analytics, reporting, and enterprise architecture.
Mark supports customers in multiple... Read More.
Lars Trieloff is director of product management at Blue Yonder, one of the leading European companies providing platforms for predictive applications. He is responsible for product roadmap, strategy, and marketing. Prior to Blue Yonder, Lars... Read More.
Luis is senior data engineer at Mind Candy, was the first to introduce Spark Streaming at the company, and is responsible for the real-time mobile analytics platform. He has more than 10 years of experience... Read More.
Kai is a senior instructor for Hadoop classes at Cloudera, delivering training classes for developers and administrators worldwide. Before joining Cloudera, Kai had the same role at MySQL/Sun/Oracle, and spoke at various O’Reilly conferences.
Andrew is a software engineer on the HDFS team at Cloudera. Previously, he was a graduate student in the AMPLab at the University of California, Berkeley advised by Prof. Ion Stoica, where he worked... Read More.
Simon Wardley is a researcher for the Leading Edge Forum focused on the intersection of IT strategy and new technologies. Simon is a seasoned executive who has spent the last 15 years defining future IT... Read More.
Marc Warner is the CEO of ASI. Previously, Marc held a research fellowship in physics at Harvard University, where he studied quantum metrology and quantum computing. His PhD research, in the field of... Read More.
Noel has over 15 years experience in software architecture and development, and over a decade in machine learning and data mining. His current project is Myna, which makes bandit algorithms accessible to all. Previous projects... Read More.
Patrick Wendell is a cofounder of Databricks and committer and PMC member of Apache Spark. He is the release manager of Spark’s 1.0, 1.1, and 1.2 releases. Before helping start Databricks, Patrick was a... Read More.
Tom White has been an Apache Hadoop committer since February 2007, and is a member of the Apache Software Foundation. He is the author of Hadoop: The Definitive Guide for O’Reilly. Previously he worked as... Read More.
Edd Dumbill is a technology analyst, writer, and entrepreneur based in California. He’s helping drive businesses with data as VP strategy for Silicon Valley Data Science.
Edd was the founding program chair for
Ken is a former systems architect with Hortonworks who has over 15 years experience working with clients to implement distributed and scalable solutions to meet business needs. He is a Data Science Fellow of the... Read More.
Ann has extensive experience as a UX and UI designer for serious games and virtual worlds, as well as mobile apps. She has served as community and editorial manager for history brand Heritage Key. She... Read More.
Xuefu Zhang has over 10 years experience in software development. Working for Cloudera since May 2013, he spends a lot of his efforts on Apache Hive and Pig. Prior to joining Cloudera, Xuefu Zhang served... Read More.
Alice is the director of data science at GraphLab, a Seattle-based startup that offers powerful large-scale machine learning and graph analytics tools. She loves playing with data and enabling others to play with data. She... Read More.
Shivon Zillis is a venture capitalist and founding member of Bloomberg Beta. She focuses on early stage data and machine intelligence investments. She recently released a report on the current state of machine intelligence, where... Read More.
©2015, O’Reilly UK Ltd • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • firstname.lastname@example.org
Apache Hadoop, Hadoop, Apache Spark, Spark, and Apache are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries, and are used with permission. The Apache Software Foundation has no affiliation with and does not endorse, or review the materials provided at this event, which is managed by O'Reilly Media and/or Cloudera.