New speakers are added continuously. Please check back to see the latest updates to the program.
Mike Abbott joined Kleiner Perkins Caufield & Byers in 2011 and focuses on investments in the firm’s digital practice, helping entrepreneurs in the social, mobile and cloud computing sectors rapidly scale teams and ventures. Mike... Read More.
Joseph Adler has many years of experience in data mining and data analysis at companies including DoubleClick, VeriSign, and LinkedIn. He graduated from MIT with an B.Sc. and M.Eng in Computer Science and Electrical... Read More.
With over 15 years in advanced analytical applications and architecture, John is dedicated to helping organizations become more data-driven. He combines deep expertise in analytics and data science with business acumen and dynamic engineering leadership.... Read More.
Nitesh Ambastha is the Global Head of Data IT, Private Banking & Wealth Management Products at Credit Suisse. He is responsible for spearheading and creating a multi-year roadmap to design the future state Data Platform... Read More.
Anima Anandkumar is a principal scientist at Amazon Web Services. Anima is currently on leave from UC Irvine, where she is an associate professor. Her research interests are in the areas of large-scale machine learning,... Read More.
Jesse is a Creative Engineer with many years of experience in creating products and helping companies improve their software engineering. He strives to provide developers with the resources to learn new technologies and improve their... Read More.
Steve Anderson has been working in the IT industry since 1993 and with Intel since 2000. He has an engineering focus which has been on server and application consolidation then onto general virtualization and
June Andrews is an applied mathematician specializing in social network analysis. She has worked on the Search Algorithm at Yelp and designed algorithms for computing the structure of large networks with Professor John Hopcroft. Currently,... Read More.
Lead Data Sciences Engineer at Sumo Logic and co-organizer of SF Bay Area Machine Learning meetup group.
Michael Armbrust is the lead developer of the Spark SQL project at Databricks. Michael’s interests broadly include distributed systems, large-scale structured storage, and query optimization. Michael holds a PhD from UC Berkeley, where his... Read More.
Matt Asay has been involved with open source since 1998, and is one of the industry’s leading open source business strategists. Asay is a regular columnist for ReadWrite, TechRepublic and InfoWorld. Asay is vice president... Read More.
Rosie Atkins serves at the Director of Product for Breadcrumb POS, Groupon’s point of sale product that primarily serves restaurants, bars and cafes. She has held the position since September, 2014. She leads product... Read More.
Amr Awadallah is cofounder and CTO of Cloudera. Previously, Amr was an entrepreneur in residence at Accel Partners and vice president of engineering at Yahoo, where he led a team that used Apache Hadoop... Read More.
Josh spent six years as a software engineer building infrastructure components at AT&T before discovering the world of ‘Big Data’ in a class at NYU by O’Reilly author Foster Provost. He ‘joined the band’... Read More.
Prith Banerjee is executive vice president and chief technology officer at Schneider Electric, as well as a member of the executive committee, which reports to the chairman and CEO. In this role, Prith is... Read More.
Nenshad Bardoliwalla is an executive and thought leader with a proven track record of success leading product strategy, product management, and development in business analytics. He is the co-author of Driven to Perform: Risk-Aware Performance... Read More.
Results oriented technology executive and recognized thought leader with more than 25 years of experience and demonstrated success assisting large, global entities in driving organizational change through the leveraging of Information. Proven track record of... Read More.
Steven is the Technical Project Officer for all software applications facilitating the cross-domain data exploitation within the belgian Ministry of Defence.
Goutham, a Principal Data Integration and Reporting Practice Leader for Capgemini, leads a global team of over ~300 practitioners responsible for the entire information lifecycle. This starts with the strategy / vision phase continuing to... Read More.
Danielle Ben-Gera is Principal Architect at Quid, where she leads the backend team, managing the algorithms stack, as well as the data acquisition and processing pipeline. Over the past three and a half years, she... Read More.
Aaron is a Data Scientist with Accenture.
Bill Schmarzo, author of “Big Data: Understanding How Data Powers Big Business”, is responsible for setting the strategy and defining the Big Data service line offerings and capabilities for the EMC Global Services Enterprise... Read More.
Ryan Blue is a software engineer at Cloudera, currently working on the Kite SDK team.
Joerg Blumtritt is the founder and CEO of Datarella, a computational social science startup delivering mobile analytics, self-tracking solutions, and data science consulting. After graduating from university with a thesis on machine learning, Joerg... Read More.
Ron Bodkin is CTO Architecture and Services for Teradata. Ron is responsible for leading the global emerging technology team focusing on Artificial Intelligence, GPU and Blockchain. Responsible for leading global consulting teams for... Read More.
Irina is an Applied Researcher at eBay, working on machine translation training and evaluation. Irina holds two Masters degrees in natural language processing and neurolinguistics.
Vinayak Borkar is the CTO of X15 Software, Inc. Previously, he was a PhD candidate at UC Irvine, where he worked on big data and contributed to the Hyracks Open Source Big Data Project.... Read More.
Dr. Kirk Borne is a Transdisciplinary Data Scientist and an Astrophysicist. He is Professor of Astrophysics and Computational Science in the George Mason University School of Physics, Astronomy, and Computational Sciences. He has been at... Read More.
Dave Brewster is the Co-Founder and CTO at Paxata. He is a serial entrepreneur and seasoned enterprise software technology leader with more than 20 years experience in successfully architecting and delivering scalable technology platforms.... Read More.
Kurt leads the Data Platform team at Netflix. His group architects and manages the technical infrastructure underpinning the company’s analytics. The Netflix data infrastructure includes various Big Data technologies (e.g. Hadoop, Hive, and Pig), Netflix... Read More.
Michael Brown was a founding member of comScore, Inc. in 1999. He leads the technology efforts of the company to measure Internet and Digital activities. In this position, he helped the company build the world’s... Read More.
Josh Byrd is the Manager of Data Architecture at GoPro working within the Data Science & Engineering team. Prior to GoPro, Josh led global supply chain operations analytics efforts at Apple . His work focuses... Read More.
Alonzo Canada leads developing new ventures with a particular focus on product strategy and UX development. He mashes up business strategy and human-centered design to craft product vision and execute against it. He uses strategy... Read More.
John F. Canny is a computer scientist and the Paul and Stacy Jacobs Distinguished Professor of Engineering in the Computer Science Department of the University of California, Berkeley. John has made significant contributions in various... Read More.
John Carnahan is the EVP of Data Science at Ticketmaster.
With a strong background in a wide variety of development roles, I’ve now moved to help developers get the most from DataSift’s products.
Òscar Celma is currently Director of Research at Pandora, where he leads a team of scientists to provide the best personalized radio experience.
From 2011 till 2014 Òscar was Senior Research Scientist at Gracenote.... Read More.
As a managing director at Accenture Analytics, now part of Accenture Digital, Arnab Chakraborty speaks analytics fluently. He serves as the Global Lead for Industry Analytics in Accenture’s advanced analytics practice and is also responsible... Read More.
Michele Chambers is an entrepreneurial executive with 25 years of technology experience, and is the President and COO at RapidMiner, which offers a predictive analytics platform. At RapidMiner, she is responsible for marketing, products... Read More.
Winston is a software engineer at RStudio, and holds a Ph.D. in Psychology from Northwestern University. He is a developer for the ggplot2, devtools, shiny, and ggvis packages, and is the author of R Graphics... Read More.
The idea for Captricity came from Kuang’s PhD dissertation at UC Berkeley. His research focused on data-centric approaches to increase the efficiency of low-resource organizations, so they can better serve their disadvantaged clients. While doing... Read More.
Yanpei Chen is a Software Engineer at Cloudera, working on the Performance Engineering team. He regularly participates in competitive performance “bake-offs” that directly drive customer purchasing decisions. His work touches upon Cloudera Search, Impala, Apache... Read More.
Darren Chinen is the Head of Data Science and Engineering at GoPro. He has extensive experience working with all types of “extreme data” having previously led the Big Data and analytics efforts at Apple, Peet’s... Read More.
Alan Choi is a software engineer at Cloudera working on the Impala project. Before joining Cloudera, he worked at Greenplum on the Greenplum-Hadoop integration. Prior to that, Alan worked extensively on PL/SQL and
Jike Chong heads Data Science at Simply Hired, the most comprehensive job search engine that indexes over 10M jobs everyday, attracting more than 30 million monthly unique visitors, and serving hundreds of thousands of employers... Read More.
Miklos Christine is a solutions engineer for Databricks. Miklos was previously a system engineer at Cloudera where he helped strategic customers deploy and use the Apache Hadoop ecosystem in production. He has contributed to several... Read More.
Woody Christy has been lucky enough to be working in distributed systems his entire career. He led system designs and deployments for Video On Demand systems that scaled out to millions of end users. He... Read More.
Cliff Click is the CTO and Co-Founder of 0xdata, a firm dedicated to creating a new way to think about web-scale math and real-time analytics. I wrote my first compiler when I was 15... Read More.
Christopher Colburn is just another data scientist at Netflix.
Stewart Collis is the Chief Technical Officer of AWhere Inc. He has over 15 years’ experience in all phases of design, development and management of software development projects for desktop and web applications. He has... Read More.
Eric Colson is the Chief Algorithms Officer at Stitch Fix, where he specializes in consumer algorithms. He is also an advisor at Big Data incubator Data Elite, and Big Data Platform provider Mortar Data. Previously,... Read More.
Mike Conover builds machine learning technologies that leverage the behavior and relationships of hundreds of millions of people. A senior data scientist at LinkedIn, Mike has a Ph.D. in complex systems analysis with a focus... Read More.
I am one of those ex-Physicists, currently the Program Director for Growth at Insight Data Science. Insight helps PhD’s transition from academia into new careers in industry.
A mathematician and seasoned technology executive, George Corugedo has over 20 years of business and technical expertise. As co-founder and CTO of RedPoint Global, George is responsible for leading the development of the RedPoint... Read More.
Daniel Crankshaw is a second year PhD student working in the UC Berkeley AMPLab with Michael Franklin. Dan’s research focuses on how ideas in distributed database systems can be applied to machine learning and data... Read More.
Poppy Crum leads the Science Group at Dolby Laboratories and is a Consulting Professor at Stanford University in the Center for Computer Research in Music and Acoustics and the Program in Symbolic Systems. At Dolby,... Read More.
Nick Curcuru has been delivering analytics solutions for nearly 20 years in operations and consulting. He is currently principal of the big data analytics practice at MasterCard Advisors, where he works with the executive suite... Read More.
Doug Cutting is the chief architect at Cloudera and the founder of numerous successful open source projects, including Lucene, Nutch, Avro, and Hadoop. Doug joined Cloudera from Yahoo, where he was a key member of... Read More.
Michelangelo D’Agostino is a Senior Data Scientist with Civis Analytics, where he works on statistical models and writes software for data analysis.
As a reformed particle physicist turned data scientist, Michelangelo loves mungeable datasets, machine... Read More.
Jason Dai is currently a Senior Principal Engineer and CTO, Big Data Technologies, at Intel. Prior to that, he was a principle architect in Microsoft, responsible for building the large-scale cloud and big data... Read More.
Kaushik Das is an expert at applying mathematical models to solve business problems. He has more than 10 years of experience designing and deploying analytical software, working for enterprise software companies such as Rapt, Demandtec... Read More.
Tathagata Das is a Apache Spark Committer and a member of the PMC. He is the lead developer of behind Spark Streaming, and currently employed at Databricks. Earlier, he has spent in the AMPLab... Read More.
Prior to joining Amplify as a general partner, Mike Dauber spent over six years at Battery Ventures, where he led early-stage enterprise investments on the West Coast, including Battery’s investment in a stealth security company... Read More.
Gary Davis is Chief Consumer Security Evangelist. Through a consumer lens, he works closely with internal teams to drive strategic alignment of products with the needs of the security space. Gary also oversees McAfee online... Read More.
Allen Day – Principal Data Scientist, MapR Technologies
Allen is the Principal Data Scientist at MapR Technologies, where he leads interdisciplinary teams to deliver results in fast-paced, high-pressure environments across several industry verticals. Previously,... Read More.
Marco DiPlacido is Principal software engineer on the O365 Security Signals team collaborating with researchers and cloud service owners to build intrusion detection systems for cloud scale.
Jonathan Dinu is the Co-founder and CTO of Zipfian Academy, an advanced training program for data scientists and data engineers in San Francisco. His background is in Computer Science and Physics at University of... Read More.
David is the Software Development and Engineering Manager for the Cloud Big Data Platform team at Rackspace. He and his team develop and operate this cloud service which is focused on delivering the Hadoop platform... Read More.
Sheetal is a Principal Architect working with Hortonworks with strong expertise in Hadoop ecosystem and rich field experience. He helps small to large enterprises solve their business problems strategically, functionally as well as at scale... Read More.
Scott is Senior Director at FINRA’s Market Regulation Technology. Scott leads the data and analytics teams responsible for the surveillance of U.S. equities, options and fixed income markets.
Chris DuBois is a data scientist focused on building tools for other data scientists. At Dato, he has helped design and implement tools for creating recommendation systems as well as large-scale text analysis. His current... Read More.
Ted Dunning has been involved with a number of startups with the latest being MapR Technologies where he is Chief Application Architect working on advanced Hadoop-related technologies. He is also a PMC member for... Read More.
Joey Echeverria is the director of engineering at Rocana, where he builds applications for scaling IT operations built on the Apache Hadoop platform. Joey is a committer on the Kite SDK, an Apache-licensed data... Read More.
Jeremy Edberg, the CEO and Founder of MinOps, which makes using the cloud stupid easy. He is an angel investor and advisor for various incubators and startups. Previously, Jeremy was the founding reliability engineer... Read More.
Alexei (Alyosha) Efros joined UC Berkeley in 2013 as associate professor of electrical engineering and computer science. Prior to that, Alyosha spent nine years on the faculty of Carnegie Mellon University. He has also been... Read More.
Daniel Eklund is a software architect and technologist with over 18 years of experience in enterprise software development. As the first employee and Engineering Practice Manager for Think Big Analytics, Daniel has worked with many... Read More.
Ozgun is the CTO and one of the co-founders of Citus Data. Before founding Citus, Ozgun worked as a software developer in the Distributed Systems Engineering team at Amazon. There, he proposed, designed, and... Read More.
Sameer Farooqui is a client services engineer at Databricks, where he works with customers on Apache Spark deployments. Sameer works with the Hadoop ecosystem, Cassandra, Couchbase, and general NoSQL domain. Prior to Databricks, he worked... Read More.
Yuliya Feldman is a Principal Software Engineer at MapR. Since joining MapR, Yuliya has worked on a number of products and features starting from MapR admin infrastructure, Map/Reduce framework and most recently on YARN.
Laura Fennell is senior vice president, general counsel and secretary, leading Intuit’s legal, corporate affairs, information and physical security, privacy, and data services teams.
Before joining Intuit in April 2004, Fennell served as Sun Microsystems’... Read More.
Bruno Fernandez-Ruiz is a Yahoo Senior Fellow and VP of Personalization Platforms, overseeing the development and delivery of Yahoo’s personalization technology, which Bruno’s teams use to harvest deep user insights in order to deliver a... Read More.
Bob Filbin is chief data scientist at Crisis Text Line, the first large-scale 24/7 national crisis line for teens on the medium they use and trust most: texting. Bob specializes in the application of behavioral... Read More.
Shai Fine is a Principal Engineer at the Advanced Analytics group in Intel, focusing on Machine Learning, Business Intelligence, and Big Data. Prior to Intel, Shai worked for the IBM Research Lab in Haifa,... Read More.
Lutz Finger, a director at LinkedIn, is an authority on social media and text analytics. He’s also co-founder and former CEO of Fisheye Analytics, a media data-mining company whose products support governments and various... Read More.
Danyel Fisher is a Senior Researcher in information visualization and human-computer interaction at Microsoft Research’s VIBE group. His research focuses on ways to help users interact with data more easily. His recent work has... Read More.
As Senior Director and General Manager for Cisco’s Data and Analytics Group (IBTG), Mike Flannagan guides Cisco’s Big Data and Analytics business strategies and execution.
Since joining Cisco in 2000, Mike has... Read More.
Jonathan is a full time IPython developer who primarily works on the IPython notebook front-end. In his spare time Jonathan enjoys developing a Python based video game engine and Poster, an open source HTML5 canvas... Read More.
37 years enterprise sales expereince
Dr. Freeman is head of Security Data Science at LinkedIn, where he leads a team charged with detecting and preventing fraud and abuse across the LinkedIn site and ecosystem. He has a Ph.D. in mathematics... Read More.
Chris Fregly is a research scientist at PipelineIO, a San Francisco-based streaming machine learning and artificial intelligence startup. Previously, Chris was a distributed systems engineer at Netflix, a data solutions engineer at Databricks, and a... Read More.
Eric Frenkiel co-founded MemSQL and has served as CEO since inception. Before MemSQL, Eric worked at Facebook on partnership development. He has worked in various engineering and sales engineering capacities at both consumer and... Read More.
Ellen Friedman is a solutions consultant, scientist and author, currently writing about a variety of open source and big data topics including being co-author of Mahout in Action (Manning), the Practical Machine Learning series from... Read More.
Before joining Canaan, Ross was a partner at seed-stage... Read More.
Ajit is the Chief Security Architect at VISA, the worlds largest payment network, which processed $4.5 trillion last year. Areas of expertise include data security, cryptography, and mobile security. He held other senior roles... Read More.
Anil Gadre is the SVP of Product Management at MapR. Prior to MapR, Anil was the EVP of Product Management at Silver Spring Networks, responsible for product strategy, planning and marketing of networking... Read More.
Eddie Garcia is chief security architect at Cloudera, a leader in enterprise analytic data management, where he draws on his more than 20 years of information and data security experience to help Cloudera enterprise customers... Read More.
Alan is a co-founder at Hortonworks and an original member of the engineering team that took Pig from a Yahoo! Labs research project to a successful Apache open source project. Alan also designed HCatalog and... Read More.
A software/systems engineer with a lot of experience building big, real-world systems.
Jonathan is Director of Data Science and Analytics at Intuit. He co-founded Level Up Analytics, a premier data science consulting company focused on data science, big data, and analytics which Intuit acquired in 2013. From... Read More.
Greg Goldsmith is the Chief Product Officer at Attivio. He is responsible for product strategy, product management, and research & development for Attivio’s Enterprise Search and Big Data Discovery & Dexterity businesses. Greg is the... Read More.
Floris Grandvarlet current responsibility at Cisco is Head of Unified Computing in DCV EMEAR Tech Ops. After the merge of European Theater with Emerging Market Theater, he is now leading a team of... Read More.
Brian Granger is an Associate Professor of Physics at Cal Poly State
University in San Luis Obispo, CA. He has a background in theoretical physics, with a Ph.D from the University of Colorado. His... Read More.
Alexander Gray is CTO at Skytree and Associate Professor in the College of Computing at Georgia Tech. His work has focused on algorithmic techniques for making machine learning tractable on massive datasets. He began... Read More.
Scott Gray is a senior architect for IBM’s InfoSphere BigInsights Big SQL solution. Gray has an extensive career in the computer industry focusing heavily on relational database, architecture, design, optimization, and internals. Prior to... Read More.
Michael Greene is vice president of the Software and Services Group and general manager of System Technologies and Optimization at Intel Corporation. Greene is responsible for delivering software and solutions that enable Intel and its... Read More.
Garrett Grolemund is the editor-in-chief of shiny.rstudio.com, the development center for the Shiny R package, and is the author of Hands-On Programming with R as well as Data Science with R, a forthcoming book by... Read More.
Robert Grossman is a faculty member and the Chief Research Informatics Officer in the Biological Sciences Division of the University of Chicago. He is the Director of the Center for Data Intensive Science and a... Read More.
Mark Grover is a committer on Apache Bigtop, a committer and PMC member on Apache Sentry (incubating) and a contributor to
Apache Hadoop, Apache Hive, Apache Spark, Apache Pig, Apache Sqoop and Apache... Read More.
Randy Guck is a Principal Engineer at Dell Software, focused on big data solutions for commercial software applications. He has developed software for over 30 years, building specialized databases including semantic, hybrid object/relational, and NoSQL.... Read More.
Carlos Guestrin is the director of machine learning at Apple and the Amazon Professor of Machine Learning in Computer Science and Engineering at the University of Washington. Carlos was the cofounder and CEO of... Read More.
Gupta runs an R&D group at Google Research focused on designing efficient and transparent statistical learning algorithms. From 2003-2012, she was a professor of electrical engineering at the Univ. of Washington. Gupta received the
Vida is currently a Solutions Engineer at Databricks. In her past, she worked on scaling Square’s Reporting Analytics System. She first began working with distributed computing at Google – where she improved search rankings of... Read More.
John Haddad is Senior Director of Big Data Product Marketing at Informatica Corporation. He has over 25 years’ experience developing and marketing enterprise applications. Today, he advises organizations on Big Data best practices from a... Read More.
Lisa Hammitt is a senior software executive with 25 years of industry experience. Most recently, as vice president of marketing of Salesforce Community Cloud, she is spearheading strategy and is charting out industry-led use cases... Read More.
Ben Hamner is Kaggle’s Chief Scientist and is responsible for the technical side of the business. He is currently focused on applying machine learning to the energy industry, and has previously worked with machine learning... Read More.
Jeff is Trifacta’s Chief Experience Officer and a Professor of Computer Science at the University of Washington, where he directs the Interactive Data Lab. Jeff’s passion is the design of novel user interfaces for exploring,... Read More.
I work with crime data to model patterns and forecast risk; the intersection of geography, data science, and social good.
Keywords: geographic data, raster processing, predictive analysis, spacetime event modeling, weather, demographics, machine learning, early... Read More.
Joseph M. Hellerstein is a Chief Strategy Officer at Trifacta and Chancellor’s Professor of Computer Science at UC Berkeley. His work focuses on data-centric systems and the way they drive computing. He is an
Spencer is a Data Scientist with Accenture.
Craig Hibbeler is principal for big data and security within MasterCard Advisors’ Enterprise Information Management consultancy practice. In his role, Craig leverages practical hands-on experience and broad industry and platform knowledge to develop, execute, secure,... Read More.
Dave Holtz is a data scientist at Airbnb focusing on online reputation and pricing. Previously, he worked as a data science engineer at Yub (acquired by Coupons.com) and as a data scientist and Product Manager... Read More.
Applied biostatistician with research interests in missing data methods, statistical computing, and statistical education
Solomon Hsiang combines data with mathematical models to understand how society and the environment influence one another. In particular, he focuses on how policy can encourage economic development while managing the global climate. His research... Read More.
Leah Hunter writes about the human side of tech for Fast Company, the Guardian, and O’Reilly. She is authoring two upcoming books—one on augmented reality from O’Reilly and the other on the future in five... Read More.
Kurt Hurtado is a Logstash developer based in Los Altos, CA. He has been working with Elasticsearch and Logstash for many years and thrives on building excellent architectures based on the ELK stack for... Read More.
Alysa Z. Hutnik is a partner in the Advertising & Marketing and Privacy & Information Security practices at Kelley Drye & Warren LLP in Washington, D.C. Her practice includes representing clients in all forms... Read More.
Matt is an experienced web architect with a software development background. He has deep expertise in building, scaling and operating global-scale Java, Ruby on Rails and AMP web applications. He has been a contributor... Read More.
Arif joined Lightspeed in 2008 and focuses primarily on investments in the areas of cloud and datacenter technologies, as well as enterprise mobile and SaaS solutions.
Arif is currently working closely with the teams at... Read More.
Dr. Anant Jhingran (PhD Berkeley) joined Apigee from IBM where he was VP and CTO for IBM’s Information Management Division and Co-Chair of IBM wide Cloud Computing Architecture Board. He was responsible... Read More.
Annika is a seasoned leader of analytics initiatives, and came from Pivotal where she built the “Data Science Dream Team” – an industry-leading group of Data Scientists – representing a rich combination of vertical domain... Read More.
Ann Johnson is cofounder and CEO of Interana, the experts in event data analytics, where she has created a community of all-star talent working to make data-informed decisions a natural extension of everyone’s workflow.... Read More.
Anne Johnson is the Head of PBWM Products Risk Technology at Credit Suisse. Her SPEAR+ Investment Risk Technology platform has propelled Credit Suisse into a market leadership position of transparently managing and monitoring the... Read More.
Bobby is co-founder of Interana, where he is building the next generation of tools for analyzing massive amounts of data in real time.
Bobby was Director of Engineering at Facebook where he led the infrastructure... Read More.
Michael I. Jordan is the Pehong Chen Distinguished Professor in the Department of Electrical Engineering and Computer Science and the Department of Statistics at the University of California, Berkeley. His research interests bridge the computational,... Read More.
My passion is helping our clients build management processes and cultures where they are identifying, analyzing and driving their business opportunities through personalized management and analytic solutions. I’ve authored over 10 books on technical and... Read More.
Karthik Kambatla is a Software Engineer at Cloudera in the scheduling and resource management team. He works primarily on MapReduce and YARN, and he is a Committer to Apache Hadoop. He is also a... Read More.
Holden Karau is a Software Development Engineer at Databricks and is active in open source. She the author of a book on Spark and has assisted with Spark workshops. Prior to Databricks she worked on... Read More.
I am an engineer who prides himself on building reliable, scalable infrastructure.
I specialise in maintaining large system infrastructure as demonstrated by work at LinkedIn (applications) and at The University of Queensland (networks). I possess... Read More.
Kyle Kelley was a software developer at Rackspace and a core developer of the IPython/Jupyter project. He wants to help build great environments for collaborative analysis, development, and production workloads for everyone; from small teams... Read More.
Dr. Eamonn Keogh is a professor of computer science at the University of California Riverside, specializing in data mining. He is a highly prolific researcher, as of 2015 he is one of only three people... Read More.
Tigran Khrimian is a Senior Director at FINRA responsible for the development of the organization’s Big Data ingestion and management platform, which processes billions of market order events on daily basis. He oversees technology... Read More.
Jonathan H. King is Vice President, Cloud Strategy and Business Development for CenturyLink Technology Solutions. In this role, Jonathan leads cloud strategy, business development, alliances, M&A and global go to market for CenturyLink. Prior to... Read More.
Jake Klamka is the founder of the Insight Data Science Fellows Program, a post-doctoral fellowship that helps quantitative PhDs transition from academia to careers in data science. Insight Fellows are now data scientists at top... Read More.
Jennifer Klay is an Associate Professor of Physics at Cal Poly San Luis Obispo. She has worked with big data at the CERN Large Hadron Collider’s ALICE experiment for 14 years, unlocking the... Read More.
James Kochuba is a Senior Solution Architecture on the SAS Enterprise Architecture team. He helps strategic SAS customers world-wide designing and building high performance architectures. Focus areas in the past several years have... Read More.
Adam is a Co-founder and CTO of Cloudant, and an IBM Distinguished Engineer. He is an Apache CouchDB developer, joining the project as one of the first ten committers, and the lead architect... Read More.
Marcel Kornacker is the architect and tech lead at Cloudera for Impala. Prior to Cloudera Marcel worked at Google, where he worked on several ads serving and storage infrastructure projects and eventually became the tech... Read More.
Dr. Rado Kotorov is vice president of Product Marketing for Information Builders and works both with the WebFOCUS and the iWay product divisions to provide thought leadership, analyze market and technology trends, aid in the... Read More.
Currently serves as VP of Marketing Strategy at Hortonworks. Previous positions include Director at Red Hat and VP at Cloudera.
John holds a B.S. in computer science from The University of Texas at Austin.
Jay is one of the primary architects for LinkedIn where he focuses on data infrastructure and data-driven products.
He was among the original authors of a number of open source projects in the scalable data... Read More.
Chris was Co-Founder and CEO of ObjectRocket before merging with Rackspace in February 2013. He now leads ObjectRocket by Rackspace in their new digs in downtown Austin. He has 20+ years of experience building... Read More.
Philip Langdale is the Engineering Lead for Cloud at Cloudera. He joined the company in 2010 as one of the first engineers building Cloudera Manager, and served as an engineering lead for that project until... Read More.
Sasha is the founding data scientist and engineer at Polynumeral, a data science consultancy in New York City. She helps clients solve hard data problems and design their data strategy, including the World Bank, New... Read More.
Sylvain joined Havas Media in 2011 to lead the company’s big data efforts. Recognized for his expertise in online marketing and data driven systems, Sylvain develops and implements client solutions for Artemis, Havas Media’s proprietary... Read More.
Julien Le Dem is a Data Systems Engineer at Twitter. Previously he was a Principal Engineer at Yahoo. He contributes to a number of Hadoop-related projects including HCatalog and he’s a PMC member on... Read More.
Costin Leau is an engineer at Elasticsearch, where he leads big data efforts. An open source veteran, Costin led various Spring projects (Spring OSGi, GemFire, Redis, Hadoop) and authored an OSGi spec. He has spoken... Read More.
Denny Lee is a Principal Program Manager at Microsoft. He is a hands-on distributed systems and data sciences engineer with more than 15 years of experience developing internet-scale infrastructure, data platforms, and distributed systems for... Read More.
Cornelia Lévy-Bencheton is a communications strategy consultant and writer whose data-driven marketing and decision support work helps companies optimize their performance.
As Principal of CLB Strategic Consulting, LLC., her focus is on the... Read More.
ChengXiang Li is a software engineer from Intel SSG Big Data Technology team, he is dedicated to enable and improve SQL interfaces in Hadoop ecosystem, and optimize SQL engine performance with IA... Read More.
Dr. Fei-Fei Li is an Associate Professor in the Computer Science Department at Stanford, and the Director of the Stanford Artificial Intelligence Lab and the Stanford Vision Lab. Her research areas are in machine learning, computer... Read More.
As Director of UX Design at New Relic, Etan Lightstone oversees a team of talented designers, leads the user experience design strategy, and on occasion gets the opportunity to contribute to the product codebase. Etan... Read More.
Lucian is director of Data Engineering at Intuit, leading a big data platform and large-scale real time data services group in the US and the EU. Previously, he founded Level Up Analytics, a premier big... Read More.
Bill Loconzolo is the Vice President of Data Engineering for product analytics across Intuit. His team is focused on creating and scaling data products that leverage the collective data of 50 million customers to provide... Read More.
A.J. Loiacono is a co-founder of Truveris. In his current role as the Chief Innovation Officer, his responsibilities include product development, strategic planning and enterprise partnerships. A.J. is a serial entrepreneur and has fifteen years... Read More.
Ben Lorica is the chief data scientist at O’Reilly Media. Ben has applied business intelligence, data mining, machine learning, and statistical analysis in a variety of settings, including direct marketing, consumer and market research, targeted... Read More.
Yucheng Low is a co-founder and Chief Architect of GraphLab Inc. He led the development of the SFrames and SGraphs scalable datastructures underpinning the GraphLab Create Product. He completed his PhD in Machine Learning in... Read More.
Brandon MacKenzie is the Data Science on Hadoop leader on IBM’s Worldwide Technical Sales team for Information Management Software. Brandon is an expert on statistical processing in Hadoop and HPC environments. Brandon earned his... Read More.
Mark Madsen is a research analyst at Third Nature, where he advises companies on data strategy and technology planning. Mark has designed analysis, data collection, and data management infrastructure for companies worldwide. He focuses on... Read More.
Roger Magoulas is the research director at O’Reilly Media and chair of the Strata + Hadoop World conferences. Roger and his team build the analysis infrastructure and provide analytic services and insights on technology-adoption trends... Read More.
Oliver has been working at SAP since 1990, first in Germany, then since 1995 in Palo Alto, California. Oliver worked in various positions, in User Experience design, in Technical Marketing and Knowledge Management, on... Read More.
Ted has worked on close to 60 Clusters over 2-3 dozen clients with over 100’s of use cases. He has 18 years of professional experience working for start-ups, the US government, a number of the... Read More.
Michal is a geek, developer, Java, Linux, programming languages enthusiast developing software for over 10 years.
He obtained PhD from the Charles University in Prague in 2012 and post-doc at Purdue University.
During his... Read More.
Eugene leads data science at Directly, a startup in San Francisco that helps companies scale excellent customer service by letting expert users help other users on demand. He builds augmented intelligence systems that allow humans... Read More.
Nasser Manesh has 25 years of experience in Unix, infrastructure, distributed systems, and backend operations, mostly in DevOps, team lead, and CTO roles. He has founded startups in consumer Internet, mobile, photography and art... Read More.
Tatsiana is a Data Science manager at Stitch Fix. Blending both industrial and academic research, Tatsiana is expert at solving hard business problems. She brings a background in both mathematics and statistics, and has deep... Read More.
Asim is a Senior Data Engineer in Machine Translation team at eBay. He has been at eBay for 7 years working in Data Analytics, Data Engineering and Business Intelligence.
Pankaj manages the growth strategy and sales for Acxiom’s digital products that are offered directly or through distribution partners. He also provides leadership on Acxiom’s SMB digital solutions strategy. Previously, Pankaj managed strategic enterprise... Read More.
Lauri Mazzuchetti is the managing partner of Kelley Drye’s Parsippany office. She represents consumer-facing businesses, including telecommunications carriers, in commercial litigation in federal and state courts, both at the trial and appellate levels. Ms. Mazzuchetti... Read More.
Arianna McClain is a design researcher – data specialist at IDEO. Arianna works at the intersection of technology, data, and human behavior. She leads hybrid research processes that merge quantitative (data) and qualitative (stories)... Read More.
Patrick McFadin is regarded as a foremost expert for Apache Cassandra and data modeling. As Chief Evangelist for Apache Cassandra and consultant working for DataStax, he has been involved in some of the biggest deployments... Read More.
Emma McGrattan is SVP of engineering at Actian, where she leads the Actian Vector, Actian Vector Hadoop Edition, and Actian Matrix development teams. A leading authority in DBMS technologies, Emma has over 20... Read More.
Data systems @ Cloudera. Formerly founder/CEO of DataPad (http://www.datapad.io). Author of “Python for Data Analysis” from O’Reilly Media. Created pandas project.
I have a background in theoretical physics, but now I do fun stuff with data. I enjoy numbers, coding, and learning new tools.
Eden Medina is Associate Professor of Informatics and Computing and Director of the Rob Kling Center for Social Informatics at Indiana University, Bloomington. Her research uses technology as a means to understand historical processes and... Read More.
Chad Meley is the Vice President of Product & Services at Teradata.
Prior to joining Teradata, he led Electronic Arts’ Data Platform organization that supported Financial Analysis, Game Development, Marketing Analysis and CRM. Chad... Read More.
Xiangxiang Meng is a Staff Scientist in the Data Science Technologies department at SAS. Xiangxiang received his PhD and MS from the University of Cincinnati. The current focus of his work is on the... Read More.
Miriah is a USTAR assistant professor in the School of Computing at the University of Utah and a faculty member in the Scientific Computing and Imaging Institute. Her research focuses on the design of... Read More.
With over 20 years experience in deploying mission critical systems, Justin Michaels industry experience covers capacity planning, architecture and industry vertical experience. Justin brings his passion for architecting, implementing and improving Couchbase to the community... Read More.
Ryan is a data scientist in Allstate’s Quantitative Research and Analytics department, where he uses big data to improve the customer experience.
Mostafa Mokhtar is a performance engineer at Cloudera. Previously, he held similar roles at Hortonworks and on the SQL Server team at Microsoft.
Andreas Mueller received his PhD in machine learning from the University of Bonn. After working as a machine learning researcher on computer vision applications at Amazon for a year, he recently joined the Center for... Read More.
Manu has a background in cloud computing and big data, handling billions of transactions per day in real time. He enjoys building and architecting scalable, highly available data solutions, and has extensive experience working in... Read More.
Aaron T. Myers is a Software Engineer at Cloudera and an Apache Hadoop Committer. Aaron’s work is primarily focused on HDFS. Prior to joining Cloudera, Aaron was a Software Engineer and VP of Engineering... Read More.
Jacques Nadeau is MapR’s lead developer on the Apache Drill open source project. He is an industry veteran with over 15 years of big data and analytics experience. Most recently, he was cofounder and
Neha Narkhede is the cofounder and head of engineering at Confluent, a company backing the popular Apache Kafka messaging system. Prior to founding Confluent, Neha led streams infrastructure at LinkedIn, where she was responsible for... Read More.
Chris Neumann is the CEO and Cofounder of DataHero, the leading platform for visualizing data from online services. Chris was previously the first employee at Big Data pioneer Aster Data Systems, where he helped... Read More.
Emi is a Senior Data Scientist at Jawbone. She completed her Ph.D. in Neuroscience at Northwestern University studying memory systems of the brain using functional neuroimaging. She went on to study neuroplasticity in patients with... Read More.
Cait joined Shazam in November 2013 as VP of Product, Music and Platforms. She is responsible for their hugely successful mobile and web products as well as the music roadmap.
Cait joined Shazam from... Read More.
A leading expert on big data architecture and Hadoop, Stephen brings over 20 years of experience creating scalable, high-availability, data and applications solutions. A veteran of WalmartLabs, Sun and Yahoo!, Stephen leads data architecture and... Read More.
Matt Ocko has three decades of experience as a technology entrepreneur and VC, in the U.S. and globally. His prior investments include Cotendo (AKAM), Zynga (ZNGA), Facebook (FB), XenSource (CTRX), UltraDNS (
Vadim Ogievetsky is a frontend developer at Metamarkets. Previously, he was part of the Data Visualization group at Stanford where he contributed to Protovis and D3.js. He is currently focused on Facet, a data visualization... Read More.
Travis Oliphant has a Ph.D. from the Mayo Clinic and B.S. and M.S. degrees in Mathematics and Electrical Engineering from Brigham Young University. Since 1997, he has worked extensively with Python for numerical and scientific... Read More.
Lance Olson is a Partner Group Program Manager responsible for HDInsight, Microsoft’s Hadoop-as-a-Service offering in Azure. Lance has worked extensively on database, business intelligence, and developer technologies for enterprise customers over the last 20 years.... Read More.
Jerry Overton is a data scientist, distinguished engineer, and head of advanced analytics research at CSC. Jerry is also the chief data scientist for Industrial Machine Learning (a strategic alliance between CSC and... Read More.
Andy Palmer is co-founder and CEO of Tamr, Inc., Palmer co-founded Tamr with fellow entrepreneur Michael Stonebraker, PhD, adjunct professor at MIT CSAIL; Ihab Ilyas, professor at the University of Waterloo; and... Read More.
Rahul Pathak runs the Amazon EMR and AWS Data Pipeline businesses for AWS. Amazon EMR is a web service for running frameworks like Hadoop, Spark, and Presto on managed clusters in... Read More.
DJ Patil is the chief data scientist and deputy chief technology officer for data policy at the White House Office of Science and Technology Policy, where he advises on policies and practices to maintain US... Read More.
Pamela Peele, Ph.D., is the Chief Analytics Officer of the UPMC Insurance Services Division. Dr. Peele brings 13 years of patient care experience along with 12 years of academic research experience to her position... Read More.
Dr Srinath Perera, is a Director of Research at WSO2 Inc., where he overlooks the overall WSO2 platform architecture with the CTO. He is a co-founder of Apache Axis2, a member
of... Read More.
Fernando Perez is a research scientist at the UC Berkeley Helen Wills
Neuroscience Institute, and a founding investigator of the Berkeley Institute
for Data Science, created in 2013. He received a PhD in... Read More.
Mike Polcari joined 23andMe in 2008 and works with a talented team of engineers to deliver access, understanding, and benefit from the human genome.
Mike is focused on architecture across 23andMe’s commercial products,... Read More.
Mr. Pollock is an expert data integration technology leader. He is currently Vice President of Product Management for the Oracle Data Integration business unit and previously responsible for all IBM Information Integration products. Prior... Read More.
Chris Pouliot is a real life rocket scientist, who has also spun astronauts until they were motion sick, split atoms to make an aircraft carrier go fast, provided insightful analysis that led to Google to... Read More.
Alexander is the project lead for big data analytics at Lufthansa Airlines and is involved in architecting key analytics transformation initiatives at Lufthansa. An expert in econometrics, Alexander taps into the power of statistical analysis... Read More.
Dr. Priyadarshy is the Chief Data Scientist at Halliburton. He is also the founder of ReIgnite Strategy, an advisory company to CXOs. Dr. Priyadarshy was the VP Data Science at Acxiom. Prior to that he... Read More.
Lisa is a data scientist at Airbnb, where she focuses on search and discovery. Prior to joining Airbnb, Lisa completed a PhD in Applied Physics at Stanford University. Outside of data science, Lisa enjoys playing... Read More.
Fintan is responsible for Kx sales engineering globally. An expert in developing database analytic systems, Fintan joined Kx in 2012 after having worked extensively with quantitative teams at a variety of Wall Street investment banks,... Read More.
Rashmi Raghu has extensive experience in executing complex analytics projects in multiple verticals. She is currently a Principal Data Scientist in the Pivotal Data Labs team at Pivotal with a focus on applications in the... Read More.
Srivatsan Ramanujam is a Principal Data Scientist at Pivotal where he executes Data Sciences labs for their customers, with a special focus on Text Analytics. He brings with him over 8 years of industry experience... Read More.
Jai is the Director of Product Strategy at Cloudera where he is responsible for planning the future roadmap of Cloudera products. Before Cloudera, he spent a decade at VMware, where amongst other things he was... Read More.
Tye Rattenbury is a lead Data Scientist at Trifacta. Tye holds a PhD in Computer Science from UC Berkeley. Prior to joining Trifacta, he was a Data Scientist at Facebook and the Director of Data... Read More.
Christopher (Chris) Re is an assistant professor in the Department of Computer Science at Stanford University. The goal of his work is to enable users and developers to build applications that more deeply understand and... Read More.
Ben Recht is an associate professor in the Department of Electrical Engineering and Computer Sciences and the Department of Statistics at the University of California, Berkeley. Ben’s research focuses on scalable computational tools for large-scale... Read More.
Azarias is the first ever Chief Data Officer of the Republican Party. He earned his PhD in computer science from the University of Michigan, and previously founded Meritful – an enterprise software startup based in... Read More.
Matthew Rocklin is an open source software developer focusing on efficient computation and parallel computing, primarily within the Python ecosystem. He has contributed to many of the PyData libraries and today works on Dask, a... Read More.
Julie Rodriguez is a Boston-based Information Architect with experience in user research, analysis and design for complex systems. Within the global markets domain, Julie has delivered pioneering solutions in such areas as wealth management, investment... Read More.
John Russell is a software developer and technical writer, and he’s currently the documentation lead for Impala. He is the author of the forthcoming book from O’Reilly, “Getting Started with Impala”.
Tara Sainath received her PhD in Electrical Engineering and Computer Science from MIT in 2009. The main focus of her PhD work was in acoustic modeling for noise robust speech recognition. After her PhD,... Read More.
Eric Sammer is the CTO and co-founder of ScalingData. Prior to ScalingData, he was an engineering manager at Cloudera. His background is in the development and operations of distributed, highly concurrent, data ingest and... Read More.
Steve Sarsfield is an author and expert in data quality and data governance. His book “The Data Governance Imperative” is a comprehensive exploration of data governance from the business perspective. Steve draws practical wisdom and... Read More.
As inPowered’s CTO, Nima leads the development of the core technologies at the heart of inPowered, and oversees all aspects of engineering and product development. Once a tenured professor with 50+ peer-reviewed publications, Nima... Read More.
Eric Schmidt is the product management lead for Cloud Dataflow on the Cloud engineering team at Google, where his primary role is to help shape the future of fully managed, large-scale data processing. Eric spends... Read More.
Patrick Schots has been working in the IT industry since 2000 and with Intel since 2000. Previously, before joining Intel, Patrick worked at Alcatel and Lucent Technology where he had different Engineering functions. Since 2007... Read More.
Jim Scott is the director of enterprise strategy and architecture at MapR Technologies, Inc. Across his career, Jim has held positions running operations, engineering, architecture, and QA teams in the consumer packaged goods, digital advertising,... Read More.
Shawn is the VP of Customer Success & Applications at Dato where he helps make it easy to build cool experiences with data. He is data geeky and loves inspired technologies, businesses, and gadgets. His... Read More.
Jonathan is a Solutions Architect on the Partner Engineering team at Cloudera. Before joining Cloudera, he was a Lead Engineer on the Big Data team at Orbitz Worldwide, helping to build out the Hadoop clusters... Read More.
Carter has a firm dislike of mandatory biography fields.
Gwen Shapira is a Solutions Architect at Cloudera and leader of IOUG Big Data SIG. Gwen Shapira studied computer science, statistics and operations research at the University of Tel Aviv, and then went... Read More.
Roman Shaposhnik is a Director of Open Source @Pivotal. He is a member of Apache Software Foundation, committer on Apache Hadoop, founder of Apache Bigtop and, as of late, a man behind Open Data Platform... Read More.
At Intel, Vin Sharma is responsible for strategic ecosystem initiatives driving adoption of end-to-end analytics solutions based on Intel data center platforms. In this role, Vin spearheads technical and marketing engagements partners working with open... Read More.
Clint Sharp is Director of Product Management overseeing all research and development efforts for Splunk’s Big Data & IT product initiatives. Prior to this role, Clint spent 15 years deploying and running complex IT infrastructures... Read More.
Adam Silberstein is a lead software engineer at Trifacta. His main area of interest is large-scale data processing, including in the batch processing and online serving spaces. His work has appeared in top database venues... Read More.
Sumeet Singh is a Senior Director of Products at Yahoo responsible for platforms product management and customer engagements. In this role, he also leads the Hadoop products team responsible for both Apache open source contributions... Read More.
Noelle Sio has a background in mathematics, statistics, and data mining with an emphasis on digital media. She is currently a Principal Data Scientist at Pivotal (formerly Greenplum). Her work has mainly focused on helping... Read More.
Joseph Sirosh is the corporate vice president of the Information Management and Machine Learning (IMML) team in the Cloud and Enterprise group at Microsoft Corp. Sirosh and his IMML team have shipped public... Read More.
Ram Shankar is a security data wrangler in Azure Security Data Science, where he works on the intersection of ML and security. Ram’s work at Microsoft includes a slew of patents in the large intrusion... Read More.
Adam Smith runs sales, marketing and multiple product areas for Automated Insights. He is heavily involved in company strategy, product, and mobile initiatives.
Adam started working with Automated Insights as an advisor in 2008, working... Read More.
Terence Spies has over 19 years of security and systems software development experience, working with leading companies such as Microsoft, Asta Networks and others. He is frequently quoted by business and technology press on today’s... Read More.
Coe Leta is a Design Director at IDEO, Palo Alto where she leads Design Research – IDEO’s human-centered approach of understanding people’s needs, developing insights, and connecting those insights to innovative design solutions for... Read More.
Doug has been working on creating and scaling technology-enabled personalized learning for the better part of the past 20 years. The tools didn’t really exist in 1994 when he established an interactive software simulation business... Read More.
Rick Stellwagen is director for Think Big’s Data Lake program. After years of building highly available MPP relational database systems, he was an Enterprise and Solution architect in Professional Services. Most recently, he served... Read More.
Vijay analyzes big data to drive business decisions across many facets of Rent the Runway – marketing, operations, product and technology, inventory buying and planning, and customer service. He is credited with building the backend... Read More.
Jagane Sundar has extensive big data, cloud, virtualization, and networking experience and joined WANdisco through its acquisition of AltoStor, a Hadoop-as-a-Service platform company. Before AltoStor, Jagane was founder and CEO of AltoScale, a Hadoop... Read More.
India’s 5+ years of industry experience focuses on optimizing data and evaluation to enhance social change, such as community and housing redevelopment and educational achievement programs/initiatives. As Director of Evaluation + Insight at United Way,... Read More.
Doug specializes in the research and development of user interface designs and advanced visualization solutions for complex systems.
He has over 25 years of experience in producing and/or managing the development of award-winning content, visual... Read More.
Cathy has been working with data in one form or another for the past 15 years. She is passionate about bringing data into the culture of startup companies. She has built and run analytics organizations... Read More.
Daniel works in the Cloudera training team building Cloudera’s developer and data science Cloudera Certified Professional certifications. Daniel also has a long history as a software engineer in the high performance computing space and has... Read More.
Data architect, computational scientist, and technical leader. Andy is the CTO of Bold Metrics, where he is bringing his experience building smart scalable data systems to the fashion industry. You will also find him... Read More.
Anuranjita Tewary is Director of Product Management at Intuit. She was a founder at Level Up Analytics, which was acquired by Intuit. Her previous roles have been data scientist at LinkedIn, and product management at... Read More.
Thiruvel Thirumoolan is a developer in the Hive and HCatalog team at Yahoo!. In this role he is responsible for deployment of Hive, HiveServer2 and HCatalog across all the Hadoop clusters at Yahoo! and ensuring... Read More.
Hello, I am James. I’m 29, live in San Francisco, and work in tech. I grew up pretty modestly in small towns in FL and WV, read a lot of books on astrophysics and human... Read More.
Kathleen Ting is a technical account manager at Cloudera where she helps strategic customers deploy and use the Apache Hadoop ecosystem in production. She’s a frequent conference speaker, has contributed to several projects in the... Read More.
Reena Tiwari is an IT leader who delivers technology solutions to solve complex business problems. At Cisco, she is leading a Marketing IT team that is responsible for Data, Analytics, Lead Management, Marketing Operations Automation.... Read More.
Anirudh Todi – Senior Software Engineer at Twitter
At Twitter, Anirudh works on the Data Platform team. Anirudh and his team are chartered with processing and understanding the vast body of data that is generated... Read More.
Omer brings fifteen years of front-line business and technical experience, having been responsible for some of today’s largest modern database management system deployments. He most recently served as Vice President of Field Operations at WibiData,... Read More.
Catherine Truxillo, Ph.D. has been with the advanced analytics education team at SAS since 2000 and has written or co-written SAS training courses for advanced statistical methods including: multivariate statistics, linear and generalized... Read More.
Doug has been engrossed in programming since his parents first bought him an Apple IIe computer in 4th grade. Throughout his early career, Doug proved his flexibility and ingenuity in crafting solutions in a variety... Read More.
Brian Ulicny is a founding member of the new Data Innovation Lab recently stood up by Thomson Reuters in Boston. The Data Innovation Lab will partner with internal teams, customers and third parties, such as... Read More.
Stéfan van der Walt is a senior lecturer in applied mathematics at
Stellenbosch University, South Africa, and an associate project
scientist in the astronomy department at UC Berkeley. He has been
involved... Read More.
Shankar Vederaman leads the Payment Analytics Data science and Engineering team at Netflix. His team is responsible for providing analytical solutions for Payments, Fraud and Retail gift analytics. The solutions include data engineering, BI engineering... Read More.
Sunil Venkayala, Senior Technical Product Manager at HP Vertica in Cambridge, Mass. He leads the Distributed R open-source technology initiative and advanced analytics features of the HP Vertica platform. Prior to joining HP, he was... Read More.
Anand Venugopal has been working with Fortune 1000 Enterprises to deliver real business benefits and ROI from Big Data Solutions since 5 plus years at Impetus. And before that since 1995 – has been... Read More.
LianHui Wang is a Software Engineer from Tencent’s TEG Big Data Department. He is also a contributor to Spark, Hadoop, and Hive. He has been instrumental in Tencent’s Hadoop and Spark applications, with focus... Read More.
Peter Wang is the cofounder and CTO of Continuum Analytics, where he leads the product engineering team for the Anaconda platform and open source projects including Bokeh and Blaze. Peter has been developing commercial... Read More.
Tech Expert Sponsor – Evangelizing IT Tech Careers at Chevron
Chip Wells has over 15 years of experience in implementing theoretical and applied econometrics using the SAS programming language and SAS Solutions. He is a Statistical Services Specialist in the SAS Education... Read More.
Patrick Wendell is an engineer at Databricks as well as a Spark
Committer and PMC member. In the Spark project, Patrick has acted as
release manager for several Spark releases, including Spark... Read More.
John Myles White is one of the main developers of statistical libraries for
Julia. He currently works at Facebook, where he is a member of the Core Data
Science team. John is also... Read More.
Tom White has been an Apache Hadoop committer since February 2007, and is a member of the Apache Software Foundation. He is the author of “Hadoop: The Definitive Guide” for O’Reilly. Previously he worked as... Read More.
Tom White is one of the foremost experts on Hadoop. He has been an Apache Hadoop committer since February 2007, and is a member of the Apache Software Foundation. His book Hadoop: The Definitive Guide... Read More.
Paula Wiles Sigmon is Program Director for marketing of the InfoSphere Information Integration and Governance portfolio at IBM.
She has held various marketing management positions at IBM, leading teams responsible for demand-generation programs,... Read More.
Cack Wilhelm is a principal at Scale Venture Partners, where she focuses on investments in early-stage software companies, with an eye toward those helping businesses better utilize data, automate workflows, incorporate AI, and build more... Read More.
Richard has been at the cutting edge of big data since its inception, leading multiple efforts to build multi-petabyte Hadoop platforms, maximizing business value by combining data science with big data. He has extensive experience... Read More.
Dr. Woodfield is a Statistical Services Specialist in the Education Division at SAS.He provides training and mentoring services in the areas of statistical forecasting, predictive modeling, and data mining.Terry was Chief Statistician at
As Senior Director of Product Management and Strategy at US R&D Center of Huawei, Bing oversees new product R&D, open-source initiatives, and leads a global growing team to drive vertical big data-centric solutions. Prior to... Read More.
Reynold Xin is a committer on Apache Spark. He is also a co-founder of Databricks. Before Databricks, he was pursuing a PhD in the UC Berkeley AMPLab.
Fangjin is one of the main committers to the open source Druid project and one first developers at Metamarkets, a San Francisco based data startup. Fangjin previously worked on diagnostic optimization algorithms at Cisco Systems.... Read More.
Sungwook is a Data Scientist at MapR. Sungwook’s data experience includes malware detection algorithms for packet stream analysis, mobile network signaling analysis, social network analysis, job title analysis as well as call center data analysis.... Read More.
Consulting professor at Stanford within ICME, conducting research and teaching courses targeting doctorate students. Technical Advisor at Databricks. I focus on Discrete Applied Mathematics, Machine Learning Theory and Applications, and Large-Scale Distributed Computing.
Matei Zaharia is an assistant professor of computer science at MIT, and the initial creator of Apache Spark. He is currently on industry leave to start Databricks, a company commercializing Spark, where he is... Read More.
Philip Zeyliger is a Software Engineer at Cloudera. He came to Cloudera from Google, where he worked on scalable storage for user-facing applications. Before that, he worked in finance, at D.E. Shaw. Philip holds a... Read More.
Xuefu Zhang has over 10 year’s experience in software development. Working for Cloudera since May 2013, he spends a lot of his efforts on Apache Hive and Pig. He also worked in the Hadoop team... Read More.
Alice Zheng manages the optimization team on Amazon’s Ad Platform. Alice specializes in research and development of machine-learning methods, tools, and applications. Outside of work, she is writing a book, Mastering Feature Engineering. Previously, Alice... Read More.
As VP of Products, Wei combines her passion for technology with experience in Enterprise Software to define and shape Trifacta’s product offerings. Having founded several startups of her own, Wei believes strongly in innovative technology... Read More.
Adriana Zubiri is a Program Director in the Information Management Big Data group at the IBM Toronto Software Lab. In her current role, Zubiri is responsible for driving the software development execution for Big... Read More.
Monte Zweben is the CEO and co-founder of Splice Machine, provider of the only Hadoop RDBMS. A SQL-on-Hadoop solution, Splice Machine has helped many companies scale real-time applications using commodity hardware without... Read More.
For exhibition and sponsorship opportunities, email email@example.com
For information on trade opportunities with O'Reilly conferences, email firstname.lastname@example.org
For media-related inquiries, contact Maureen Jennings at email@example.com
View a complete list of Strata + Hadoop World contacts
©2015, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • firstname.lastname@example.org
Apache Hadoop, Hadoop, Apache Spark, Spark, and Apache are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries, and are used with permission. The Apache Software Foundation has no affiliation with and does not endorse, or review the materials provided at this event, which is managed by O'Reilly Media and/or Cloudera.