Skip to main content

Strata + Hadoop World Speakers

New speakers are added continuously. Please check back to see the latest updates to the program.

Search Speakers:
Daniel Abadi (Yale University), @daniel_abadi

Daniel is a recognized expert in database systems and one of the inventors of Hadapt’s patent-pending Adaptive Analytical Platform™.

Daniel received his PhD from Massachusetts Institute of Technology, where his dissertation on column-store database systems led to the founding of Vertica (recently acquired by Hewlett Packard). He is a recipient of a Churchill Scholarship, an NSF CAREER Award, a Sloan Research Fellowship, the 2008 SIGMOD Jim Gray Doctoral Dissertation Award, and the 2007 VLDB best paper award.

In addition to serving as a faculty member in Yale University’s Computer Science department, he also serves as Chief Scientist at Hadapt. He blogs at DBMS Musings and tweets at @daniel_abadi.

James Abley
James Abley (Government Digital Service), @jabley

Working for Government Digital Service, providing the next generation of digital services @gdsteam for citizens

Sameer Agarwal
Sameer Agarwal (UC Berkeley)

Sameer Agarwal is a final year Ph.D. student in the AMPLab at Berkeley working on large-scale approximate query processing frameworks. His research interests are at the intersection of distributed systems, databases and machine learning, and he has published over 10 articles in various top-tier conferences including NSDI, EUROSYS, SIGMOD, VLDB and KDD. He received his B.Tech in Computer Science and Engineering from the Indian Institute of Technology and was awarded the President of India Gold Medal in 2009. He was supported by the Qualcomm Innovation Fellowship during 2012-13 and is supported by the Facebook Graduate Fellowship during 2013-14.

Dr. Vijay Srinivas Agneeswaran

Dr. Vijay Srinivas Agneeswaran has a Bachelor’s degree in Computer Science & Engineering from SVCE, Madras University (1998), an MS (By Research) from IIT Madras in 2001 and a PhD from IIT Madras (2008). He was a post-doctoral research fellow in the LSIR Labs, Swiss Federal Institute of Technology, Lausanne (EPFL) for a year. He has done an internship in Siemens Corporate Research in Bangalore and was with another product development company – Oracle for three years, He subsequently spent a year as principal architect position with GTO, the research arm of Cognizant in Chennai, where he led the Extreme Processing group within the High Performance Computing Centre of Excellence and created Intellectual property in the Big-Data space. He... Read More.

John Akred
John Akred (Silicon Valley Data Science), @BigDataAnalysis

With over 15 years in advanced analytical applications and architecture, John is dedicated to helping organizations become more data-driven. He combines deep expertise in analytics and data science with business acumen and dynamic engineering leadership.

Srisatish Ambati (0xdata Inc)

Sri is co-founder and ceo of 0xdata (@hexadata), the builders of H2O. H2O democratizes bigdata science and makes hadoop do math for better predictions. Before 0xdata, Sri spent time scaling R over bigdata with researchers at Purdue and Stanford. Prior to that Sri co-founded Platfora and was the Director of Engineering at DataStax. Before that Sri was Partner & Performance engineer at java multi-core startup, Azul Systems, tinkering with the entire ecosystem of enterprise apps at scale. Before that Sri was at sabbatical pursuing Theoretical Neuroscience at Berkeley. Prior to that Sri worked on nosql trie based index for semistructured data at in-memory index startup RightOrder.

Sri is known for his knack for envisioning killer apps in fast evolving spaces and assembling stellar teams towards... Read More.

Matt Asay
Matt Asay (Adobe), @mjasay

Matt Asay is vice president of corporate strategy at 10gen, the MongoDB company. With more than a decade in open source, Asay has served as VP of business development at real-time analytics company Nodeable (acquired by Appcelerator in October 2012); VP of business development at mobile HTML5 start-up Strobe (now part of Facebook); chief operating officer at Canonical, the Ubuntu Linux company; GM, Americas and VP of business development at Alfresco; and part of the team that helped put Novell on its open source track. Asay is an emeritus board member of the Open Source Initiative (OSI) and earned his juris doctorate at Stanford, where he focused on open source and other IP licensing issues.

Bahman Bahmani
Bahman Bahmani (Stanford University)

Bahman did his PhD at Stanford University, supported by William R. Hewlett Stanford Graduate Fellowship, and focused on the topic of algorithms for big data applications, in which he is a well-published author in some of the best conferences and journals, including PVLDB, SIGMOD, WWW, and KDD. He was the last PhD student of the legendary late Rajeev Motwani, and was also advised by Ashish Goel and Prabhakar Raghavan (formerly Yahoo Chief Strategy Officer, currently Google VP of Engineering). His industry experience during his PhD studies spans several internships and collaborations with some of the best researchers and practitioners from Twitter, Microsoft Research, Yahoo Research, AOL, and Google. A recipient of the Yahoo Key Scientific Challenges Award, he was dubbed... Read More.

Brandon Ballinger (Cardiogram)

Brandon is applying machine learning to cardiology at Cardiogram. Previously, he was part of the healthcare.gov rescue team, a co-founder at Sift Science, and a software engineer at Google working on speech recognition for Android phones, ads anti-spam, and more. He holds a B.S in Computer Science from the University of Washington.

Luca Barone (Cisco)

Luca Barone, Sr. Director, Strategy and Marketing at Cisco Systems, is responsible for monitoring mega-industry and market trends and their implications for Cisco’ strategy. As part of this effort, Luca looks at new and innovative business models, with particular focus on Cisco’s software, cloud, and SDN opportunities. Luca has an extensive background in strategy, P&L management and technology, with deep expertise in software, hardware, IT services and cloud computing. His passion for customer-insight driven strategy development drove his success within Microsoft’s support business, where he turned the business around from losing tens of millions of dollars to a 40% operating margin. Luca also held strategic positions at Bain & Company and Procter & Gamble.

Gabe Batstone

As CEO, Gabe leads NGRAIN in its commitment to deliver innovative 3D simulation software and solutions that exceed customer expectations, address industry trends, and focus on results-driven performance. With more than 15 years of experience working for organizations in the aerospace and defense and automotive industries, Gabe has the proven ability to guide companies through the emerging growth phase of development by focusing on strategic market initiatives and operational effectiveness.

In 2011, Gabe returned to NGRAIN as Chief Operating Officer from his position as Managing Director at CAE Flightscape, the leader in the science of flight safety, specializing in flight data analysis. At Flightscape, Gabe leveraged his more than 15 years of product management, business development, and marketing experience to drive... Read More.

Benjamin Bengfort
Benjamin Bengfort (District Data Labs and University of Maryland), @bbengfort

Benjamin Bengfort is a full stack data scientist with a passion for massive machine learning involving gigantic training data sets. A founding partner and CTO at Unbound Concepts, he lead the development of Meridian, the company’s textual complexity ranking algorithm designed to parse and determine the reading level of educational content for K-6 readers. With a professional background in military and intelligence and an academic background in economics and computer science, he brings a unique set of skills and insights to his work, and is currently pursuing a PhD in computer science at UMBC.

Anjul Bhambhri

Anjul Bhambhri is the Vice President of Big Data Products at IBM. She was previously the Director of IBM Optim application and data life cycle management tools. She is a seasoned professional with over twenty-two years in the database industry. Over this time, Anjul has held various engineering and management positions at IBM, Informix and Sybase. Prior to her assignment in tools, Anjul spearheaded the development of XML capabilities in IBM’s DB2 database server. She is a recipient of the YWCA of Silicon Valley’s “Tribute to Women in Technology” award for 2009. Anjul holds a degree in Electrical Engineering.

Oreilly_BSchmarzo Bill
Oreilly_BSchmarzo Bill (EMC Consulting), @schmarzo

Bill Schmarzo, author of the upcoming Big Data: Understanding How Data Powers Big Business, to be published by Wiley, is responsible for setting the strategy and defining the service line offerings and capabilities for the EMC Consulting Enterprise Information Management and Analytics service line. He’s written several white papers and is a frequent speaker on the use of big data and advanced analytics to power an organization’s key business initiatives.

Bill has more than two decades of experience in data warehousing, BI, and analytics applications. Bill authored the Business Benefits Analysis methodology that links an organization’s strategic business initiatives with their supporting data and analytic requirements, and co-authored with Ralph Kimball a series of articles on analytic applications. Bill has served on The Data... Read More.

Ron Bodkin
Ron Bodkin (Teradata), @ronbodkin

Ron founded Think Big to help companies realize measurable value from Big Data. Previously, Ron was VP Engineering at Quantcast where he led the data science and engineer teams that pioneered the use of Hadoop and NoSQL for batch and real-time decision making. Prior to that, Ron was Founder of New Aspects, which provided enterprise consulting for Aspect-oriented programming. Ron was also Co-Founder and CTO of B2B applications provider C-Bridge, which he led to a staff of 900 service consultants and a successful IPO. Ron graduated with honors from McGill University with a B.S. in Math and Computer Science. Ron also earned his Master’s Degree in Computer Science from MIT, leaving his PhD program after presenting the idea for C-bridge and placing... Read More.

Matt Brandwein

Matt Brandwein is Director of Product Marketing at Cloudera. Prior to Cloudera, he delivered enterprise search, business intelligence, and data discovery products at Endeca and Oracle. Matt holds B.S. degrees in Computer Science and Mathematics from the University of Massachusetts Amherst.

Richard Brath
Richard Brath (Uncharted Software), @oculusbusiness

I’ve been designing and building innovative information visualizations for 20 years, ranging from one of the first interactive 3D financial visualizations on the web in 1995 to visualizations embedded in financial data systems used every day by thousands of market professionals.

Stephen Brobst (Teradata Corporation)

Stephen Brobst is the Chief Technology Officer for Teradata Corporation. His expertise is in the identification and development of opportunities for the strategic use of technology in competitive business environments. Over the past sixteen years Stephen has been involved in numerous engagements in which he has been called upon to apply his combined expertise in business strategy and high-end parallel systems to develop frameworks for data warehousing and data mining to leverage information for strategic advantage. Clients with whom he has worked include leaders such as Office Depot, Fidelity Investments, General Motors Corporation, Kroger Company, Wells Fargo Bank, Wal*Mart, MCI Telecommunications, Aetna Health Plans, Metropolitan Life Insurance, VISA International, BSkyB, Blockbuster Entertainment, Standard & Poor’s, Ameritech, Blue Cross Blue Shield, Nationwide Insurance, American... Read More.

Micheline Casey
Micheline Casey (Federal Reserve Board), @michelinecasey

Ms. Casey is Principal at CDO, LLC, a boutique consultancy supporting the development of large-scale enterprise information management, data governance, and data security strategic plans and implementation efforts. CDO, LLC helps both private and public sector organizations create customer-centric approaches to information and service delivery; improve information accessibility; streamline and link cross-departmental business processes; reduce information fragmentation; maximize the return on funding investments in information architecture and application development; produce accurate and actionable business intelligence and performance metrics; and, improve risk management and cyber security postures. Ms. Casey also authors the dataTrending blog, at www.dataTrending.wordpress.com .

Prior to CDO, LLC, Ms. Casey was the first state Chief Data Officer in the country, and part of the Governor’s Office in... Read More.

Kelly Caylor (Princeton University)

Kelly Caylor is an Associate Professor in Princeton’s Department of Civil & Environmental Engineering. Dr Caylor has worked extensively on water-related issues throughout Africa for over 15 years. Currently he supervises an effort to bring realtime drought forecasts to Africa using cell-based sensors.

John Choi
John Choi (IBM)

John Choi is Director of Product Management at IBM Software Group where he is responsible for product direction and strategy for the Big Data software portfolio. Prior to this role, he has held various product/portfolio management and strategy responsibilities in IBM including Information Management and WebSphere with a focusing on portfolio strategy and emerging technologies. John received his BA and MBA from Yale University.

Michael Chui
Michael Chui (McKinsey Global Institute), @mchui

Michael Chui is a San Francisco-based partner in the McKinsey Global Institute, where he directs research on the impact of disruptive technologies, such as big data, social media, and the internet of things, on business and the economy. Previously, as a McKinsey consultant, Michael served clients in the high-tech, media, and telecom industries on multiple topics. Prior to joining McKinsey, he was the first chief information officer of the City of Bloomington, Indiana, and was the founder and executive director of HoosierNet, a regional internet service provider. Michael is a frequent speaker at major global conferences and his research has been cited in leading publications around the world. He holds a BS in symbolic systems from Stanford University and a PhD in computer science and... Read More.

Quentin Clark
Quentin Clark (Microsoft)

As corporate vice president of program management for the Microsoft Data Platform Group, Quentin Clark oversees the design and delivery of the entire family of SQL Server products as well as the Azure Data Platform services. The Azure Data Platform is a complete end-to-end platform serving data management and processing capability, data integration and refinement, and business analytics as Microsoft Azure services and Microsoft Office and Office 365 offerings. Leading a team of technical engineers, his responsibilities include product direction and definition through program management, user experience and design, and customer engagement programs. This spans SQL Server’s work in all workloads – databases, integration and business intelligence, as well as the release forms of the product – software, appliances and the cloud services.

... Read More.

Community Manager within Google Diversity Programs focusing on CS Education in NY

Cliff Click
Cliff Click (0xdata)

Cliff Click is the CTO and Co-Founder of 0xdata, a firm dedicated to creating a new way to think about web-scale data munging and real-time analytics. He wrote his first compiler when he was 15 (Pascal to TRS Z-80!), although Cliff’s most famous compiler is the HotSpot Server Compiler (the Sea of Nodes IR). Cliff helped Azul Systems build an 864 core pure-Java mainframe that keeps GC pauses on 500Gb heaps to under 10ms, and worked on all aspects of that JVM. Before that he worked on HotSpot at Sun Microsystems, and was at least partially responsible for bringing Java into the mainstream.

Cliff is invited to speak regularly at industry and academic conferences and has published many papers about HotSpot technology.... Read More.

Gabriel Commeau
Gabriel Commeau (Comcast)

Gabriel Commeau is a principal software engineer at Comcast, working on Big Data and real-time analytics for the next generation of video players. He holds a Master’s degree in computer engineering from UTC, France. Since then, he has been working in the telecom and defense industries, both in France and in the US where he moved a decade ago. Throughout his career, he has had the opportunity to put his analytical mind in action, working on a wide variety of complex systems ranging from web conferencing to JTAG. Gabriel is a Flume contributor.

Michael Cote
Michael Cote (Motorola Solutions, Inc.)

Mike is a software architect with the Global Solutions & Services department of Motorola Solutions, Inc. Motorola is working to bring cross-functional solutions to improve the end customer experience at brick and mortar retail stores. Real time analysis and response is crucial to maintaining good customer relations is a highly competitive environment.

With over 25 years of software development experience in both big companies and small start-ups, Mike has delivered solutions throughout the range of software environments. He started with embedded development for Digital Equipment disk drives and adapters and has delivered software in network appliances, hosted web applications and enterprise security infrastructure components.

His latest projects are focused on large scale enterprise solutions deployed in a horizontally scaled event architecture with both real-time and... Read More.

Alistair Croll
Alistair Croll (Solve For Interesting), @acroll

Alistair Croll is an entrepreneur with a background in web performance, analytics, cloud computing, and business strategy. In 2001, he cofounded Coradiant (acquired by BMC in 2011) and has since helped launch Rednod, CloudOps, Bitcurrent, Year One Labs, and several other early-stage companies. He works with startups on business acceleration and advises a number of larger companies on innovation and technology. A sought-after public speaker on data-driven innovation and the impact of technology on society, Alistair has founded and run a variety of conferences, including Cloud Connect, Bitnorth, and the International Startup Festival, and is the chair of O’Reilly’s Strata Data Conference. He has written several books on technology and business, including the best-selling Lean Analytics. Alistair tries to mitigate his chronic ADD by writing... Read More.

Doug Cutting
Doug Cutting (Cloudera), @cutting

Doug Cutting is the chief architect at Cloudera and the founder of numerous successful open source projects, including Lucene, Nutch, Avro, and Hadoop. Doug joined Cloudera from Yahoo, where he was a key member of the team that built and deployed a production Hadoop storage-and-analysis cluster for mission-critical business analytics. Doug holds a bachelor’s degree from Stanford University and sits on the board of the Apache Software Foundation.

Jason (Jinquan) Dai

Dai is currently an Engineering Director and Principal Engineer in Intel SSG (Software and Services Group), leading the SW engineering efforts on advanced big data technology development in Intel. Prior to that, he was the lead architect and engineering manager for building the 1st auto-partitioning and parallelizing compiler product for many-core many-thread processors (Intel Network Processor) in the industry. He received M.S. from National University of Singapore, and BSc from Fudan University, both in computer science.

Brian Dalessandro

Brian is a practicing data scientist with 12 years of modeling experience, ranging from high energy particle physics to online advertising. Brian is currently VP of Data Science at M6D, an online advertising firm. Over the past 4 years, Brian has been a lead researcher in developing media6degree’s patent pending machine learning technology. Brian’s current R&D interests include building autonomous machine learning systems over big data architectures, causal inference and transfer learning. Brian has published multiple papers on these topics in top academic journals and conferences. Brian is also a co-chair of the annual KDD Cup Data Mining competition.

Prior to joining media6degrees, Brian was a Senior Research Analyst at Meetup.com and a risk modeler for American Express. Brian holds an MBA with... Read More.

Tathagata Das
Tathagata Das (Databricks)

Tathagata Das is a third-year Ph.D. student in the AMP Lab in UC Berkeley, working Scott Shenker and Ion Stoica. He leads the development of the Spark Streaming project. His research interests include datacenter networks and frameworks for large scale data processing. Before graduate school, he has worked as an Assistant Researcher in Microsoft Research Lab India.

Jeff  Denworth

Jeff Denworth has over a decade of experience with advanced computation and massively scalable big data storage technologies. Before joining DDN in 2006, Mr. Denworth ran worldwide sales and marketing at Cluster File Systems, Inc. – where he managed all aspects of sales & business development associated with the Lustre® file system. Prior to Cluster File Systems, Jeff managed the HPC and enterprise server memory business development at Dataram Corporation.

Ravi Devireddy

As a senior business leader at Visa Inc, Ravi heads the design, engineering and implementation of advanced analytics and data services. He is currently implementing petabyte scale data platforms leveraging data mining, visualization, and machine learning for advanced cyber security intelligence. Prior to that, Ravi has led several large scale data warehouse and business intelligence implementations putting on various hats as a director, architect, DBA, and developer

nick dimiduk
nick dimiduk (Hortonworks, Inc), @xefyr

Nick stumbled into HBase in 2008 when his nightly ETL jobs started taking 20+ hours to complete. Since then, he has applied Hadoop and HBase to projects in social media, social gaming, click-stream analysis, climatology, and geographic data. Nick also helped establish Seattle’s Scalability Meetup and tried his hand at entrepreneurship. Most recently, he is an author on HBase in Action, available now. He has a passion for distributed, scalable, online access to scientific data.

David Dobbins
David Dobbins (Rackspace Hosting)

David is a seasoned professional with areas of expertise in development and platform design spanning 25 years. Previous roles included CTO at Apex Software and Dado Technologies. Currently David is managing the product development for Rackspace Cloud Big Data, a HDP based cloud service focused on delivering Hadoop based instances through the Openstack platform. Specific areas of focus in his current role include process management, technical architecture, and software development.

Michael Dobrovolsky (Morgan Stanley Wealth Management), @MorganStanley

Michael Dobrovolsky is the Lead Architect of Enterprise Big Data Solutions and Advanced Analytics Services for the Wealth Management business at Morgan Stanley.

Michael brings a wealth of experience in the field of banking and capital markets, data mining, NLP, text mining advanced business analytics and technology strategy.

At his previous job as the Senior Principle and Thought leader in the business intelligence consulting services of the Capital Markets division at Hewlett-Packard, he established a reputation for developing innovative, advanced and strategic solutions in the Financial Services industry.

Michael worked at the capacity of a Strategic Advisor to CTO and Chief Architect for Fortune 500 and Major Information Provider Companies advancing Information Management framework and discipline into realm of new cutting edge technologies.

... Read More.
Howard Dresner
Howard Dresner (sandhill.com), @howarddresner

Howard Dresner is Chief Research Officer of Dresner Advisory Services, LLC, an independent advisory firm and a well known authority and author in the areas of Business Intelligence and Performance Management.

Howard has over 30 years of IT industry experience with 24 years in the Business Intelligence market.

He spent 13 years at Gartner, where he was a Research Fellow and Lead Analyst for BI. He also served as Chief Strategy Officer at Hyperion Solutions prior to forming Dresner Advisory Services in 2007.

Howard is a frequent speaker around the globe and has published two books on the subject – including: Profiles in Performance – Business Intelligence Journeys and the Roadmap for Change (John Wiley & Sons, November 2009) and The Performance Management Revolution:... Read More.

Ted Dunning
Ted Dunning (MapR Technologies), @ted_dunning

Ted Dunning has been involved with a number of startups—the latest is MapR Technologies, where he is chief application architect working on advanced Hadoop-related technologies. Ted is also a PMC member for the Apache Zookeeper and Mahout projects and contributed to the Mahout clustering, classification, and matrix decomposition algorithms. He was the chief architect behind the MusicMatch (now Yahoo Music) and Veoh recommendation systems and built fraud-detection systems for ID Analytics. Opinionated about software and data-mining and passionate about open source, he is an active participant of Hadoop and related communities and loves helping projects get going with new technologies.

Joey Echeverria

Joey Echeverria is a Senior Solutions Architect at Cloudera where he works directly with customers to deploy production Hadoop clusters and solve a diverse range of business and technical problems. Joey joined Cloudera from the NSA where he worked on data mining, network security, and clustered data processing using Hadoop. Prior to working full time for NSA, Joey attended Carnegie Mellon University where he attained an M.S. and a B.S. in Electrical and Computer Engineering.

Israel Ekpo
Israel Ekpo (Walt Disney Parks and Resorts), @israelekpo

Israel Ekpo is a seasoned and experienced software engineer, computer scientist, big data enthusiast and data science practitioner. He uses and/or contributes to a variety of open source projects including but not limited to Apache Lucene, Apache Solr, ElasticSearch, Apache Flume, Mahout, Hadoop, HBase , MongoDB, CouchBase, Neo4j, and Apache Hive.

Amie Elcan
Amie Elcan (CenturyLink)

Amie Elcan is a principal architect in CenturyLink’s Data Network Strategies organization, where her current areas of focus are traffic modeling, application traffic analytics, and data science. Amie has worked in the telecommunications industry for over 20 years delivering traffic-based assessments that drive optimal network architecture and engineering design decisions.

Jim Englert (Gilt)

Jim Englert is a Lead Software Engineer at Gilt, where he focuses on improving email- and Facebook-related initiatives and personalized shopping experiences for the company’s millions of customers. Since joining Gilt in 2010 he has helped to launch Insider, Gilt’s ground-breaking customer loyalty program; brand-new SEO initiatives; and a host of other projects. He codes primarily in Scala but also works with various front-end and user-interface technologies. Prior to joining Gilt, Jim was a software engineer at Quality Health in Jersey City, N.J. and Technology Services Group in Chicago, Ill. He received his undergraduate degree in computer science from the University of Illinois in 2006.

Lyndon Estes (Princeton University)

Lyndon’s primary interest lies in understanding how global change (climate change and food demand) shapes agricultural land uses, and how these in turn impact terrestrial ecosystems and species. He uses remote sensing and spatially-explicit simulation models as his primary tools of investigation. Further details on his research are here.

Deborah Estrin
Deborah Estrin (Cornell NYC Tech)

Deborah Estrin, PhD, is Professor of Computer Science at UCLA, and co-Director of a new non-profit, openmhealth.org. Estrin is known as a thought leader in the innovative application of wireless and mobile technologies. Ongoing projects include self-monitoring applications in support of health and wellness and Participatory Sensing campaigns for community data gathering, citizen science, and STEM education in her career, she was an active member of the Internet research and open standards community. Estrin is a member of the National Academy of Engineering and the American Academy of Arts and Sciences.

Deborah Estrin (Cornell NYC Tech)

Deborah Estrin is a Professor of Computer Science at Cornell Tech in New York City and a Professor of Public Health at Weill Cornell Medical College. She is co-founder of the non-profit open-source software startup, Open mHealth. Previously, she was on the UCLA faculty where she was the Founding Director of the NSF Center for Embedded Networked Sensing (CENS).

Estrin is a pioneer in networked sensing, which uses mobile and wireless systems to collect and analyze real time data about the physical world and the people who occupy it. Her current projects focus on innovative small-data consumer apps and mobile health (mhealth), leveraging the programmability, proximity, and pervasiveness of mobile devices and the cloud for health and life management.

Susan Etlinger
Susan Etlinger (Altimeter Group), @setlinger

Susan Etlinger is an industry analyst at Altimeter. Her research focuses on the impact of artificial intelligence, data and advanced technologies on business and culture and is used in university curricula around the world. Susan’s TED talk, “What Do We Do With All This Big Data?,” has been translated into 25 languages and has been viewed more than 1.2 million times. She is a sought-after keynote speaker and has been quoted in such media outlets as the Wall Street Journal, the BBC, and the New York Times.

Zack Exley
Zack Exley (Brand New Congress), @zackexley

Zack Exley is the Chief Revenue Officer at the Wikimedia Foundation. He has recently worked at ThoughtWorks, a global IT consultancy and OMP, a U.S. fundraising and communications firm. He is the Co-Founder and President of the New Organizing Institute, a technology and strategy training center for social justice organizations.

Zack helped bring U.S. politics online as MoveOn.org’s first Organizing Director, advisor to the Howard Dean campaign, and Director of Online Communications and Organizing at John Kerry’s presidential campaign. He often appeared as a commentator on national and international media and his work was frequently profiled.

He began developing tools for online political organizing in the 2000 U.S. election controversy, when he used a website to allow citizens to self-organize more than 100 rallies... Read More.

Jun Fang
Jun Fang (Facebook)

Jun Fang has spent 10+ years in Microsoft SQL Server engine team, worked
on both relational and storage engine, led a dev team to build various
features in language, runtime, storage, and management. Later he joined
Bing Platform team, worked in Cosmos, leading a team to build large scale
distributed table storage system with transactions. Since joining
Facebook in 2012, Jun worked in Data Infrastructure, built technologies that
transformed the data ETL pipeline from daily batch to incremental and
realtime.

John Foreman
John Foreman (MailChimp), @John4man

John Foreman is the Chief Data Scientist for MailChimp.com where he leads MailChimp’s data product development effort called the Email Genome Project. He also runs the Data Science for Managers course at Analytics Made Skeezy.

John holds a graduate degree in Operations Research from MIT and has worked as an analytics consultant for the Department of Defense, Coca-Cola, Royal Caribbean International, and Intercontinental Hotels Group. His expertise is in optimization modeling, revenue management, and predictive modeling.

Adam Fuchs

Adam Fuchs is the Chief Technology Officer and co-founder of Sqrrl. Previously at the National Security Agency, Adam was an innovator and technical director for several database projects, handling some of the world’s largest and most diverse data sets. He is a co-founder of the Apache Accumulo project. Adam has a BS in Computer Science from the University of Washington and has completed extensive graduate-level course work at the University of Maryland. In his spare time, Adam enjoys racing sailboats, trail running, and getting lost in the woods.

John Garris (UBS)

John Garris, Executive Director at UBS, is a recognized visionary with a combination of technology and marketing expertise. At UBS, John is focused on corporate innovation and the deployment of emerging technology solutions to optimize financial services capabilities delivered to private, corporate, and instutitional clients. Prior to UBS, John has served in a variety of roles in the technology industry, including Technology Director at frog design, competitive and market strategy at salesforce.com, founding team member at eConsultancy Scient, and product management roles at Oracle and Open Media Network.

Amy Gaskins

Amy Gaskins has over 10 years of military and civilian intelligence experience in both strategic and tactical roles, including deployments to Afghanistan and Guantanamo Bay. Her previous assignments include the U.S. Army Intelligence Center, the Office of Military Commissions – Defense, JFCC-ISR, U.S. Army INSCOM, and Defense HUMINT. She holds a Bachelor of Science degree in Political Science from Florida Southern College and a Master of Arts degree in International Commerce & Policy from George Mason University.

Alan Gates
Alan Gates (Hortonworks)

Alan is a co-founder at Hortonworks and an original member of the engineering team that took Pig from a Yahoo! Labs research project to a successful Apache open source project. Alan also designed HCatalog and guided its adoption as an Apache Incubator project. Alan has a BS in Mathematics from Oregon State University and a MA in Theology from Fuller Theological Seminary. He is also the author of Programming Pig, a book from O’Reilly Press.

Amanda Gerdes
Amanda Gerdes (Blizzard Entertainment)

Amanda Gerdes is a data engineer with Blizzard Entertainment, focusing on the Blizzard Data Warehouse and its many supporting data pipelines. With eight years of experience in ETL development and management, she currently rounds out a team responsible for providing fast, accurate data at scale to Blizzard’s Business Intelligence team. Amanda holds a B.A. from UC Berkeley as well as an MBA and M.S. in Systems Engineering from Loyola Marymount University. Whenever she catches herself thinking that maybe she’s not doing too badly at this “life” thing, she looks up her World of Warcraft playtime in the data warehouse and is humbled once again.

Ari Gesher
Ari Gesher (Palantir Technologies), @alephbass

A software/systems engineer with a lot of experience building big, real-world systems.

Brett Goldstein
Brett Goldstein (University of Chicago), @bjgol

Brett Goldstein is a leader in Government Technology, Big Data/Analytics, and Enterprise Architecture. He has 15 years of experience in operations, management and leadership in technical environments in both the public and private sector.

Currently, Brett is the Commissioner and Chief Information Officer of the Chicago Department of Innovation and Technology (DoIT), appointed by Mayor Rahm Emanuel in June of 2012 to accelerate Chicago’s growth as a global hub of innovation and technology. During his tenure as Chicago’s CIO, Brett has assertively worked toward a citywide vision of consolidated technology while rapidly accelerating the role of innovation in government. His achievements have included changing Chicago’s technology strategy to include cloud environments and reshaping the IT portfolio to include advanced analytics with a focus on... Read More.

Joseph Gonzalez
Joseph Gonzalez (UC Berkeley), @mejoeyg

I am a postdoc in the AMPLab at UC Berkeley where I am continuing work on large-scale systems for machine learning as well as the GraphLab project. As a graduate student I worked with Carlos Guestrin in the Machine Learning Department at Carnegie Mellon University (CMU). My research addresses the challenges of designing and building large-scale machine learning algorithms and systems. In particular, my thesis focuses on large-scale structured machine learning using probabilistic graphical models that are capable of reasoning about billions of related random variables. The resulting algorithms and systems have achieved state-of-the-art performance in tasks ranging from predicting ad preferences in social networks to solving complex protein modeling tasks. As part of my thesis work we created GraphLab , a framework that... Read More.

Patricia Gorla
Patricia Gorla (The Last Pickle), @patriciagorla

Patricia is a software consultant with OpenSource Connections. Starting with Python application development, Patricia moved to data analysis after becoming fascinated with machine learning.

From there, she has worked on many full-stack data projects: gathering and scrubbing the data, running analyses, and developing custom visualizations to lay out the information.

She is passionate about information retrieval, and loves tackling the challenges companies face with fast-growing datasets.

Matthew Grace (Objective Logistics)

Matt has been a programmer since the age of 13. After starting a marketing aggregation software company in his teens he attended the University of Massachusetts for computer science. He has built financial software for State Street Global Advisors, schedule optimization software for ClickSoftware, and most recently spent 4 years as a director of product management at Oracle. Matt likes building stuff (recently things of the culinary sort) and is interested in mathematics, machine learning, non-linear optimization, literature, and studying new languages.

Lisa Green (Domino Data Lab), @boudicca

Lisa is motivated by a strong belief in the power of open systems to drive innovation in education, arts and research. Over the last several years she has been active in the areas of Open Access publishing, Open Science, Open Data, Data for Good, digital rights, and internet policy. Lisa is currently the head of the social impact and public policy initiative, Domino for Good, at Domino Data Lab. Previously, Lisa was Executive Director of Common Crawl and Chief of Staff at Creative Commons. She holds a PhD in physical chemistry from
the University of California Berkeley.

Brian Griffith
Brian Griffith (Blizzard Entertainment)

Brian Griffith’s career spans more than twelve years in the software, financial, and entertainment industries. Prior to Blizzard, he was the lead DBA and data warehouse engineer for Eastern Bank, integrating disparate systems into a single, secure enterprise data system. Currently at Blizzard, he works passionately with vast amounts of data to help game designers make their games even more epic. He holds a B.S. and an M.S. from Northeastern University, specializing in neuroanatomy and statistics. His Blood Elf paladin wears pink armor.

Paul Groom
Paul Groom (Kognitio), @kognitio

Paul Groom is a Big Data Advocate (he hates the word “Guru”) with 20+ years working with MPP Database Design, Business Intelligence, Data Warehousing, and what was called simply ‘VLDBs’ (Very Large Databases) since the time of Briton Lee, a foundational technology that became part of Teradata.

A cartographer by education, he has worked for the British Government and has broken the Amazon Cloud (no major damage). Paul brings practical client experience from tens of thousands of hours building new analytical solutions with MPP and in-memory DBs plus ingenuity to his role as Chief Innovation Officer at Kognitio.

Robert Grossman
Robert Grossman (Open Data Group)

Robert Grossman is the Founder and a Partner of Open Data Group, which specializes in building predictive models over big data. He is also a Senior Fellow at the Institute for Genomics and Systems Biology (IGSB) and Computation Institute (CI) at the University of Chicago and a Professor in the Biological Sciences Division. His areas of research include: big data, bioinformatics, and data intensive computing. He has led the development of new open source software tools for analyzing big data, cloud computing and high performance networking. Prior to starting Open Data Group, he founded Magnify, Inc. in 1996, which provides data mining solutions to the insurance industry. Grossman was Magnify’s CEO until 2001 and its Chairman until it was sold... Read More.

Carlos Guestrin
Carlos Guestrin (Apple | University of Washington )

Carlos is the CEO of GraphLab, and the Amazon Professor of Machine Learning in Computer Science & Engineering at the University of Washington. A world-recognized leader in the field of Machine Learning, Carlos was named one of the 2008 “Brilliant 10″ by Popular Science Magazine, received the 2009 IJCAI Computers and Thought Award for his contributions to Artificial Intelligence, and a Presidential Early Career Award for Scientists and Engineers (PECASE).

Amir Halfon
Amir Halfon (ScalingData)

Amir oversees the development of technology strategy to address Big Data challenges across the industry. Before joining Marklogic, Amir was the CTO for Global Financial Services at Oracle, as well as Sun Microsystems. He possesses a wealth of technical and industry experience, and is a frequent speaker at conferences and contributed to several publications.

Harold Hannon
Harold Hannon (SoftLayer)

Harold Hannon has been working in the field of software development as both an Architect and Developer for over 15 years, with a focus on workflow, integration and distributed systems. He is currently a Sr. Software Architect at SoftLayer Technologies working within their Product Innovation team. Harold has a passion for leveraging open source solutions to bring real value to the Enterprise space, and has implemented open source solutions with many companies across the globe. Harold is also active in mobile application development, with multiple published applications.

Leah Hanson

Leah is a software engineer at Google.
She spent the Spring of 2013 at Hacker School1, where she started learning Julia2 and wrote WebSockets.jl3.
The rest of her year was spent writing Julia code, including TypeCheck.jl4.

She has presented 3 hour Intro to Julia workshops at three conferences: LambdaJam, StrangeLoop, and Strata NYC.

  1. http://www.hackerschool.com/
  2. http://julialang.org/
  3. https://github.com/JuliaLang/WebSockets.jl
  4. https://github.com/astrieanna/TypeCheck.jl
Matt Harrison

Matt is a Python fanatic. He has authored best selling books on the subject, spoken at OSCON, PyCON, SCALE, and local user groups in Utah and the Bay Area, and taught many. He has over a dozen years of Python experience analyzing and optimizing manufacturing yields, writing websites, tooling and scripts for search engines, build systems, and business intelligence.

Shawndra Hill
Shawndra Hill (University of Pennsylvania)

Shawndra Hill is an Assistant Professor in Operations and Information Management at the Wharton School of the University of Pennsylvania. Generally, she studies data mining, machine learning and statistical relational learning and their alignment with business problems. Specifically, she researches the value to companies of mining data on how consumers interact with each other — for targeted marketing, advertising and fraud detection. Here current research focusses on the interactions between TV content and Social Media. Her past and present industry partners include AT&T Labs Research, ClearForest, and Siemens Energy & Automation. Her recent work appears in IEEE Transactions on Data and Knowledge Engineering, Journal of Computational and Graphical Statistics, SIGKDD Explorations, and Statistical Science. Her research is funded in part by the Office... Read More.

Gary Hoberman (Unqork Inc.)

Gary Hoberman is CEO and Founder of Unqork Inc. Gary was formerly Executive Vice President and CIO of MetLife. In this role, Hoberman managed technology for MetLife’s businesses throughout Americas, EMEA, and APAC. Hoberman focuses on driving emerging technologies to leapfrog the insurance industry ahead.

Hoberman has more than 19 years of experience in leading application development within financial services. Prior to MetLife, Hoberman was Managing Director within Citigroup’s Operations and Technology group where he was an award-winning innovator, leading multiple patents across mobile, e-Commerce and social media.

Hoberman graduated from the Stern School of Business at NYU and also served as an NYU adjunct instructor in advanced computer sciences.

Erich Hochmuth
Erich Hochmuth (Monsanto)

Erich Hochmuth is lead of the Big Data and Analytics IT team that supports R&D at Monsanto. For the past three years Erich has been working on a cross functional team to build a heterogeneous computing stack to meet the compute and analytic needs of Monsanto’s scientific community.

Mike Hoskins (Actian Corporation), @MikeHSays

Actian CTO Michael Hoskins directs Actian’s technology innovation strategies and evangelizes accelerating trends in big data, and cloud-based and on-premises data management and integration. Mike, a Distinguished and Centennial Alumnus of Ohio’s Bowling Green State University, is a respected technology thought leader who has been featured in TechCrunch, Forbes.com, Datanami, The Register and Scobleizer. He speaks at events worldwide including recent participation as the keynoter at DeployCon, the “Open Standards and Cloud Computing” panel at the Annual Conference on Knowledge Discovery and Data Mining, the “Scaling the Database in the Cloud” panel at Structure 2010, and the “Many Faces of Map Reduce – Hadoop and Beyond” panel at Structure Big Data 2011. Mike received the AITP Austin chapter’s 2007 Information Technologist of the... Read More.

Juliet Hougland
Juliet Hougland (Cloudera), @j_houg

Juliet Hougland answers complex business problems using statistics to tame multiterabyte datasets. She succeeds in applying and explaining the results of mathematical models across a variety of industries including software, industrial energy, retail, and consumer packaged goods. Juliet is currently the head of data science, engineering at Cloudera, where she focuses on using data to help engineering build high-quality products. Juliet’s been sought after by Cloudera’s customers as a field-facing data scientist advising on which tools to use, teaching how to use them, recommending the best approach to bring together the right data to answer the business problem at hand, and building production machine-learning models. For many years, Juliet has been a contributor in the open source community working on projects such as Apache Spark,... Read More.

Jonathan Hsieh
Jonathan Hsieh (Cloudera, Inc), @jmhsieh

Software Engineer @ Cloudera. Apache HBase Commiter, Apache Flume Founder.

Ravi Hubbly
Ravi Hubbly (Leidos)

“Ravi Hubbly has 20+ years of Information Technology experience in supporting various businesses. As a Principal Architect and Big Data evangelist he is responsible for providing Big Data solutions to Lockheed Martin’s customers. Ravi Hubbly currently leads Lockheed’s Big Data Integrated Project team to provide consistent and proven Big Data solutions to its customers, leads Big Data implementation projects, and supports multiple internal research initiatives.

Trevor  Hughes
Trevor Hughes (International Association of Privacy Professionals)

J. Trevor Hughes is the President and CEO of the International Association of Privacy Professionals (IAPP). In this role, Hughes leads the world’s largest association of privacy professionals.

Hughes is an experienced attorney in privacy, technology and marketing law. He has provided testimony before the U.S. Congress Commerce Committee, the Senate Commerce Committee, the Federal Trade Commission, the Home Affairs Committee of the British Parliament and the EU Parliament on issues of privacy, surveillance, spam and privacy-sensitive technologies. He is a member of the first class of Certified Information Privacy Professionals (CIPP) and recently completed, with co-author Reed Freeman, Privacy Law in Marketing, published by CCH.

Hughes has previously served as the executive director of the Network Advertising Initiative, a leading... Read More.

Noah Iliinsky
Noah Iliinsky (Amazon Web Services), @noahi

Noah Iliinsky strongly believes in the power of intentionally crafted communication. He has spent the last several years thinking, writing, and speaking about best practices for designing visualizations, informed by his graduate work in user experience and interaction design. He is a frequent speaker in both industry and academic contexts.

Noah works as a Visualization Expert at IBM’s Center for Advanced Visualization. He has a master’s in Technical Communication from the University of Washington, and a bachelor’s in Physics from Reed College.

Noah Iliinsky is the co-author of Designing Data Visualizations and technical editor of, and a contributor to, Beautiful Visualization, published By O’Reilly Media.

Albert Isern
Albert Isern (BISmart)

Albert Isern is the CEO of Bismart. Bismart is a Business Intelligence specialist company. Bismart help decision making by transforming data into knowledge, through Big Data, Datawarehousing and Data Mining.
Albert founded Bismart 5 years ago, as the result of his passion for innovation, his ambition to achieve new challenges and his experience of over 15 years.
Ever since its creation, Bismart has won awards, such as the Barcelona Entrepreneurial Capital award and recently the Microsoft Worldwide Partner of the Year 2013 award in Business Intelligence.

Albert is an entrepreneur with wide experience in strategy and organizational management also in marketing and customer-orientation. Capable of extracting maximum team performance through the transmission of positive energy. Quick and efficient, his working... Read More.

Ravi Iyer
Ravi Iyer (Ranker.com), @ravi_polipsych

Ravi is the Principal Data Scientist at Ranker.com. He combines 15 years of hands-on experience with database technology with a PhD in social psychology from the University of Southern California. He publishes regularly in top scholarly journals and his research, which focuses on the psychology of values, has been featured in the New York Times, Wall St. Journal, The Atlantic, and at SXSW Interactive.

Annika Jimenez
Annika Jimenez (Pivotal)

Annika is a seasoned leader of analytics initiatives, and came from Greenplum where she built the “Data Science Dream Team” – an industry-leading group of Data Scientists – representing a rich combination of vertical domain and horizontal analytical expertise – who are facilitating Data Science-driven transformations for Greenplum customers.

Before Greenplum, Annika worked for Yahoo! for six years in data leadership roles: She led Audience and International data solutions for Yahoo!’s central data organization, Strategic Data Solutions, and led Insights Services – comprised of a team of 40 researchers covering Web analytics, satisfaction/brand health metrics, and audience/ad measurement. Annika is a recognized evangelist for “applied data” and well known for her acute focus on action-enablement.

Robert Johnson (Interana)

Bobby is co-founder of Interana, where he is building the next generation of tools for analyzing massive amounts of data in real time.

Bobby was Director of Engineering at Facebook where he led the infrastructure engineering team for 6 years. During that time the team scaled the site from just a few million users to almost a billion. He personally wrote a number of the early systems including Scribe and the first version of Haystack. His team built Facebook’s hadoop/hive analytics stack and the memcache/mysql stack – a database that served trillions objects and answered billions of queries per second. He received a B.S. in Engineering and Applied Science from Caltech.

Posts by Bobby:
http://www.quora.com/Facebook-Engineering/What-have-been-Facebooks-greatest-technical-accomplishments/answer/Robert-Johnson

Scaling Facebook to 500 Million Users and Beyond (Facebook... Read More.

David Jonker
David Jonker (Uncharted Software Inc.)

David Jonker is EVP and a founder of Uncharted Software Inc. (formerly Oculus Info Inc). He is a visual analytics designer and technical architect with twenty years experience. David is interested in the visual elegance of information and the underlying visual and technical design to create powerful solutions.

Ritu Kama
Ritu Kama (Intel)

Ritu Kama is the Director of Product Management for Big Data at Intel. She has over 15 years of experience in building software solutions for enterprises. She has led Engineering, QA and Solution Delivery organizations within Datacenter Software Division for Security and Identity products. Last year she led the Product and Program management responsibilities for Intel’s Distribution of Hadoop and Big Data solutions.
Prior to joining Intel, she led technical and architecture teams at IBM and Ascom. She has a MBA degree from University of Chicago and a Bachelor’s degree in Computer Science.

Sean Kandel
Sean Kandel (Trifacta)

Sean Kandel is the founder and chief technical officer at Trifacta. Sean holds a PhD from Stanford University, where his research focused on new interactive tools for data transformation and discovery, such as Data Wrangler. Prior to Stanford, Sean worked as a data analyst at Citadel Investment Group.

Jim Kaskade
Jim Kaskade (Infochimps), @jimkaskade

Jim Kaskade is CEO of Infochimps, a Big Data subsidiary of CSC. The Infochimps Big Data Platform is an open-standards based analytics platform for private cloud deployments used by enterprise global 2000. It is recognized as the fastest way to deploy big data analytic environments.

Prior to Infochimps Jim was an Entrepreneur-in-Residence at PARC, a Xerox company, where he established PARC’s Big Data program. His work helped PARC understand how to best integrate its in-memory data processing technologies and high-performance data graph analytics to the burgeoning online services ecosystem, with a focus on predictive analytics for the retail sector. Jim also helped build PARC’s Private Cloud platform.

Jim also served as the SVP, General Manager and Chief of Cloud at... Read More.

Adam Kawa
Adam Kawa (GetInData), @adam_kawa

Adam Kawa works as Data Engineer at Spotify and Hadoop instructor at Compendium (Authorized Cloudera Training Partner).

He is a frequent speaker at HUGs, and the coorganizer of Warsaw and Stockholm HUGs. He blogs about Hadoop at HakunaMapData.com.

Jacqueline Kazil

Jackie is a lead software engineer working on python and data related projects for CACI on a contract at The Library of Congress. She is also a Ph. D student at George Mason University. Previously, she worked for The Washington Post on news driven data applications.

She is a lover of all things data, a pythonista, a djangonaut, and a computational social scientist in training. Jackie is a journalist by degree and a self-taught programmer. She is the founder of the PyLadies DC chapter and co-organizer for Django District. She lives in D.C. with her fiance and three super awesome dogs.

Eron Kelly
Eron Kelly (Microsoft Corporation)

Eron Kelly is a 12 year veteran of Microsoft with a broad set of experiences in product management across the Microsoft Office and Server and Tools Divisions. He is currently the General Manager responsible for product marketing for Microsoft SQL Server and the Data Platform. In this capacity he leads core product marketing, technical readiness, and value proposition for the Data Platform. Before leading the SQL product marketing team, he was a General Manager in the Windows Azure group leading an incubation effort focused on Microsoft’s commerce platform.

Prior to that, he led the product management team for Office 365, the fastest growing business in Microsoft’s history. As the founding member of the product management team, he led the development of the initial... Read More.

Paul Kent

Paul Kent is Vice President of Big Data initiatives at SAS. He spends his time between Customers, Partners and the Research & Development teams discussing, evangelizing and developing software at the confluence of big data and high performance computing. A datacenter rack full of current-generation 64bit x86 processors represents a very large aggregate memory space, thousands of threads and plentiful IO that can be harnessed to solve problems at a much larger scale than we have traditionally been accustomed to.

Amandeep Khurana
Amandeep Khurana (Cloudera)

Amandeep is a Solutions Architect at Cloudera where he’s involved in the entire lifecycle of Hadoop adoption for customers – from use case discovery to taking systems to production. Amandeep is also a co-author of HBase In Action, a book geared towards building applications using HBase. Prior to Cloudera, Amandeep was at Amazon Web Services, where he was a part of the Elastic MapReduce team and built the first version of EMR’s HBase offering.

Robert Kirkpatrick (UN Global Pulse), @unglobalpulse

Robert Kirkpatrick is Director of the Global Pulse initiative of the United Nations Secretary-General, which aims to harness big data for a real-time understanding of human well-being.

Robert’s activities in government, academia, UN agencies, NGOs and the private sector have focused on developing innovative solutions at the intersection of technology, policy and social change. Robert has more than 15 years of experience in the design and use of advanced technology tools in business, public health, disaster relief, security coordination, citizen journalism, telemedicine, crisis monitoring, conflict mediation, and peace building activities. His work has focused on strengthening public policy, enhancing crisis resilience and catalyzing organizational change. Robert advocates open data, open standards, open source software, and participatory development.

He co-founded and led software development for two... Read More.

Josh Klahr
Josh Klahr (Pivotal)

Josh has been working with data and analytics since 2000, including being the product manager for the first “Datamart in a Box” (Broadbase) and running product management for one of the largest Data and Analytics operations in the world (Yahoo!). Josh is now applying these learnings at Pivotal, where he is building the industry’s first unified Big Data and Analytics Platform.

Greg Kleiman (Red Hat)

Greg Kleiman leads business strategy for the Storage Business Unit at Red Hat. With more than 25 years as a technology innovator, he is responsible for the overall direction of Red Hat Storage open source storage product portfolio with a focus on storage, big data and cloud architectures for enterprises.

With a broad technology background in hardware, software, and services, Greg has led key company initiatives across product management, product marketing, business development, and sales organizations. Prior to Red Hat, he held management positions at leading IT companies in product management, marketing, and business strategy, including NetApp and HP. He is a frequent speaker at IT conferences worldwide and covers topics such as storage, big data, and cloud.

Greg holds a masters in business administration... Read More.

Donna Knutt (Black Girls Code), @donnaknutt

Donna Knutt is a mom, host, and serial entrepreneur. She is the founder of LuxieLab.com, a Marketing & Web Design studio that specializes in creating professional websites for companies looking to plan, launch, or grow their business. She is also the Co-Tech Lead of the NY Chapter of Back Girls Code, a nonprofit dedicated to empowering young girls by teaching them to be innovators and leaders in STEM fields. When Donna isn’t brainstorming with other passionate techies and entrepreneurs, she’s usually travelling, exercising, or spending time with family and friends. You can find Donna on twitter (@donnaknutt) where she shares motivational messages, tips, & advice on living a full life through entrepreneurship & service.

Samuel Kommu
Samuel Kommu (Cisco Systems)

Samuel currently works at Cisco Systems, Data Center Group focusing on profiling network and compute patterns of various applications. His focus also includes low latency networks, SDN, monitoring, management and automation.

Samuel is a certified Hadoop Developer and Administrator. In his spare time, he enjoys learning and playing various musical instruments.

Danielle Kramer (Palantir Technologies)

Danielle Kramer leads the Backend Infrastructure Engineering Team for the Palantir Gotham Data Fusion Platform. An accomplished software engineer in her own right, Danielle herds the cats that build the low-level data plumbing that makes the magic Palantir’s platforms possible.

Julien Le Dem
Julien Le Dem (Apache Parquet), @J_

Julien is the lead for Parquet’s java implementation. He also leads Pig development at Twitter and is the Apache Pig PMC Chair. His French accent makes his talks attractive.

Yann LeCun

Yann LeCun is a computer science researcher with contributions in machine learning, computer vision, mobile robotics and computational neuroscience. He is well known for his work on optical character recognition and computer vision using convolutional neural networks. He is also one of the main creators of the DjVu image compression technology (together with Léon Bottou and Patrick Haffner). He co-developed the Lush programming language with Léon Bottou.

Haoyuan Li
Haoyuan Li (Alluxio), @haoyuan

Haoyuan Li is founder and CEO of Alluxio (formerly Tachyon Nexus), a memory-speed virtual distributed storage system. Before founding the company, Haoyuan was working on his PhD at UC Berkeley’s AMPLab, where he cocreated Alluxio. He is also a founding committer of Apache Spark. Previously, he worked at Conviva and Google. Haoyuan holds an MS from Cornell University and a BS from Peking University.

Nong Li
Nong Li (Cloudera)

Nong Li is a software engineer at Cloudera working on the RecordService and Impala projects. Before joining Cloudera, he worked at Microsoft developing new APIs for the Windows graphics system (DirectX). Nong holds a Sc.B. in computer science from Brown University.

Chris Lintz (Comcast)

Chris Lintz is currently a Principal Engineer and team lead for the Comcast VIPER (Video IP Engineering and Research) Big Data team. His 18 years in software development and architecture span many different industries including Entertainment, Telecommunication and Defense. Prior to Comcast he was an Application team lead at OneRiot (acquired by Walmart) helping build a mobile real-time social ad platform. He holds a degree in Computer Science from the University of Colorado at Colorado Springs.

Barry Livingston (Riot Games)

Barry Livingston has a long, grizzled history with scalable, back-end service development. A foundational member of Riot Games’ “Platform” engineering team, Barry co-architected and led the implementation of much of the online service structure that powers League of Legends’ 32 million monthly active users today. He is also the only “ginger” known to have survived the initial League of Legends launch. In addition to serving as the Director of Engineering for Riot’s Big Data group, Barry is working on a recently declassified project with the US government aimed at officially determining how many yordles are killed in action during a typical day on the Fields of Justice.

Steve  Lohr
Steve Lohr (The New York Times | Brown Institute for Media Innovation at Columbia University), @SteveLohr

Steve Lohr reports on technology, business and economics. He was a foreign correspondent for the Times for a decade and served brief stints as an editor, before covering technology, starting in the early 1990s.

In 2013, he was part of the team awarded the Pulitzer Prize for Explanatory Reporting “for its penetrating look into business practices by Apple and other technology companies that illustrates the darker side of a changing global economy for workers and consumers.”

He has written for magazines including The New York Times Magazine, The Atlantic Monthly and The Washington Monthly. He is the author of a history of computer programming, “Go To: The Story of the Math Majors, Bridge Players, Engineers, Chess Wizards, Maverick Scientists and Iconoclasts — The Programmers Who... Read More.

Jorge A. Lopez
Jorge A. Lopez (Amazon Web Services), @zanilli

Jorge A. Lopez
Director, Product Marketing – Data Integration, Syncsort

With over 13 years of experience in Business Intelligence and Data Integration, Jorge A. Lopez is responsible for data integration product marketing and strategy at Syncsort. Prior to joining Syncsort, Mr. Lopez was a Senior Product Manager at MicroStrategy, where he oversaw the technical direction of key business intelligence products and lead performance and scalability initiatives. Mr. Lopez holds a Master of Business Administration degree from Virginia Polytechnic Institute and State University, and bachelor’s degree in electronic systems engineering from the Monterrey Institute of Technology in Mexico City.

Giorgia Lupi

Giorgia Lupi is an information designer and researcher.
Merging her past design experiences, in 2011 she co-founded Accurat, where she is workinig as a design director. Accurat is an information design company based in Milan and New York city. Accurat researches and analyses data and context, and design visual tools and narratives to bring information alive, raise awareness, understanding and engagement. By using an iterative design process of perpetual prototyping to develop data visualisations, interactive applications and multimedia environments, Accurat helps new stories and phenomena emerge from an increasingly complex world.

Since 2011 Giorgia is also researching as a PhD student at Politecnico di Milano, at the Design Faculty as well within Density Design Lab, which focuses on the visual representation of complex social,... Read More.

Mikhail Lyukmanov

As the Vice President of Data Services for Adconion, Mikhail is responsible for the leading the company’s data technology, data engineering, and machine learning algorithms, creating an industry leading distributed data warehouse and data reporting platforms processing over 40 billion audience events monthly. Mikhail Lyukmanov has over a decade of experience in both engineering and technology leadership positions, having proven track of successful implementations in Publishing, Healthcare, and Internet Media industries. Prior to joining Adconion, Mikhail held a leadership role within the Data Architecture department of one of the most innovative publishing companies, Edmunds.com. Mikhail graduated from Nizhni Novgorod State University in Russia with an MS in Automation Engineering.

Mikhail helps organizing regular discussion forums at Adconion, where Los Angeles’ top data scientists share their... Read More.

Roger Magoulas
Roger Magoulas (O'Reilly Media), @rogerm

Roger Magoulas is the research director at O’Reilly Media and chair of the Strata + Hadoop World conferences. Roger and his team build the analysis infrastructure and provide analytic services and insights on technology-adoption trends to business decision makers at O’Reilly and beyond. He and his team find what excites key innovators and use those insights to gather and analyze faint signals from various sources to make sense of what others may adopt and why.​

Justin Makeig
Justin Makeig (MarkLogic)

Justin Makeig is a Director of Product Management at MarkLogic where he oversees the suite of applications, tools, and APIs built around MarkLogic’s enterprise NoSQL database. He manages the company’s Hadoop strategy along with front-end application development and administration tools. Justin has over 10 years of experience designing, developing, and bringing to market data-driven applications for start-ups and large organizations using web and Big Data technologies. He holds an MBA from the University of California, Berkeley.

Colin Marc

Colin Marc is a developer at Stripe, where he’s recently been spending his building analytics and modeling infrastructure. Besides programming all the things, Colin is also interested in Tuvan throat-singing and iambic tetrameter.

Will Marshall
Will Marshall (Planet Labs), @wsm1

Will is responsible for setting the company’s vision and and for architecting the company strategy. Previously, Will was a Scientist at NASA/USRA where he served as Co-Investigator for PhoneSat, Science Team member on the LCROSS and LADEE lunar missions. He led research projects in orbital space debris remediation. Will has published over 30 articles in scientific publications. Will received his Ph.D. in Physics from the University of Oxford and was a Postdoctoral Fellow at Harvard University.

Sahar Massachi (Independent), @sayhar

Sahar is a startup founder, online organizer, and software developer recently out of Brandeis with a BS and MA in Computer Science. After founding a start up to provide community organizations with better mobile tools, he joined up with the Wikipedia fundraising team to help complete a review of their AB testing methods, analyzing and preparing for publication hundreds of past AB tests.

Q McCallum
Q McCallum (@qethanm)

Q. Ethan McCallum works as a professional-services consultant. He is eager to help businesses improve their standing – in terms of reduced risk, increased profit, and smarter decisions – through practical applications of data and technology.

His written work has appeared online and in print, including Parallel R: Data Analysis in the Distributed World and Bad Data Handbook: Mapping the World of Data Problems. He also speaks at conferences and user groups on business, data, and technology.

Dan McClary

Dr. Dan McClary serves as Principal Product Manager for Big Data and Hadoop at Oracle. Prior to joining Oracle he served as Director of Business Intelligence at Red Robot Labs in Palo Alto, CA. He previously was as a Visiting Scholar at Northwestern University where his research focused on applying Hadoop to large scale graph problems. Dr. McClary holds a PhD in Computer Science from Arizona State University, where his work centered on adaptive optimization of mobile and ad hoc networks. He holds an M.S. in Computer Science from Arizona State focused on real-time scheduling in distributed systems.

Stephen McDaniel
Stephen McDaniel (Freakalytics)

Stephen McDaniel is Chief Data Officer Advisor at Freakalytics, LLC. He advises Chief Data Officers and CIO’s on how to develop and implement successful strategies for all aspects of data utilization, from collection to integration to storage to effective analytics that lead to greater profits and happier customers. He has led and provided vision for data warehousing, business intelligence and advanced analytic teams at over two hundred companies including Loudcloud, Tableau Software, SAS Institute, Brio Technology and Netflix. Stephen is co-author of The Accidental Analyst, SAS for Dummies, Rapid Graphs with Tableau, and the Rapid Dashboards Reference Card.

Richard McDougall

Richard McDougall is the Application Infrastructure CTO and Principal Engineer in the Office of the CTO at VMware. He is responsible for driving advanced development and strategy for VMware’s application platform architecture – including the performance and integration of applications, runtimes, middleware, and application encapsulation technologies.

Richard’s is known as an expert in the areas of performance measurement and optimization, and in application deployment architectures.

Before the CTO office, as the Chief Performance architect Richard drove the performance strategy and initiatives to enable virtualization of high-end mission critical applications on VMware products.

Prior to joining VMware, Richard was a Distinguished Engineer at Sun Microsystems. During his 14 years at Sun, he was responsible for driving high performance and scalability initiatives for Solaris... Read More.

Wes McKinney
Wes McKinney (Two Sigma Investments), @wesmckinn

Innovating analytics and data visualization tools. Author of “Python for Data Analysis” from O’Reilly Media. Author of pandas library, contributor to statsmodels and other Python projects.

Stephanie McReynolds (ClearStory Data), @slangenfeld

Stephanie McReynolds is Vice President of Marketing at ClearStory Data, which helps businesses analyze and explore Big Data from public and private sources. Previously, Stephanie was Senior Director of Product & Technical Marketing at Teradata Aster, where she brought to market the Teradata Aster MapReduce Platform. Stephanie joined Teradata through the acquisition of Aster Data.

Stephanie has more than a decade of experience in product management and marketing for business intelligence, data warehouse and complex event processing products at organizations including, Oracle, PeopleSoft, and Business Objects.

Douglas Merrill

Dr. Douglas C. Merrill is the founder and CEO of ZestFinance, a financial services technology startup dedicated to serving the needs of the underbanked. He is also the author of Getting Organized in the Google Era, a book on personal and workplace organization published by Random House. Previously, Merrill was CIO and VP of Engineering of Google Inc. where he oversaw all aspects of internal engineering, including Google’s 2004 IPO. He most recently served as COO of New Music and President of Digital Business at EMI Music. Merrill holds an MA and Ph.D. in Psychology from Princeton University, and a BA from the University of Tulsa in Social and Political Organization.

Mark Mims
Mark Mims (Silicon Valley Data Science), @m_3

Mark’s a physicist by training and programmer by trade.

He’s architected data-driven solutions, on both bare metal and clouds, across a
variety of industries including Energy, Education, and Commercial Modeling and Simulation.

Mark received a doctorate in Mathematical Physics from UT Austin for research
simulating quantum algorithms. He is interested in what it takes to train data
scientists and is working to add “data science” tracks to various degree programs
at Stanford and Utah State University.

Mark’s passion is Data Plumbing, where Data Science meets the real world of
DevOps and Infrastructure Engineering. He is currently employed by Canonical
building DevOps tools for Ubuntu Server and making sure that the Ubuntu Server operating system meets the
needs of... Read More.

Sharmila Mulligan
Sharmila Mulligan (ClearStory Data), @ShahaniMulligan

Sharmila is CEO and founder of ClearStory Data. She has spent 18+ years building game-changing software companies in a variety of markets. Sharmila has been EVP & CMO at numerous software companies, including Netscape, Kiva Software, AOL, Opsware, and Aster Data. She drove the creation of several multi-billion dollar market categories, including application servers, data center automation and big data analytics. She is on the board of Hadapt and Lattice Engines, advisor to numerous companies, large and small, and an active investor in early stage companies.

Sean Murphy
Sean Murphy (PingThings), @sayhitosean

Sean Patrick Murphy, with degrees in math, electrical engineering, and biomedical engineering and an MBA from Oxford, has served as a senior scientist at Johns Hopkins University for over a decade, advises several startups, and provides learning analytics consulting for EverFi. Previously, he served as the Chief Data Scientist at a series A funded health care analytics firm, and the Director of Research at a boutique graduate educational company. He has also cofounded a big data startup and Data Community DC, a 2,000 member organization of data professionals.

Arun Murthy (Hortonworks), @acmurthy

Arun is the lead of the MapReduce project in Apache Hadoop where he has been a full-time contributor to Apache Hadoop since its inception in 2006. He is a long-time committer and member of the Apache Hadoop PMC and jointly holds the current world sorting record using Apache Hadoop. Prior to co-founding Hortonworks, Arun was responsible for all MapReduce code and configuration deployed across the 42,000+ servers at Yahoo!. In essence, he was responsible for running Apache Hadoop?s MapReduce as a service for Yahoo!. Twitter: @acmurthy. He is directly responsible for every bit of code and configuration of Map-Reduce deployed at over 40,000 machines running Apache Hadoop.

Aaron Myers
Aaron Myers (Cloudera, Inc.), @atm

Aaron T. Myers is a Software Engineer at Cloudera and an Apache Hadoop Committer. Aaron’s work is primarily focused on HDFS. Prior to joining Cloudera, Aaron was a Software Engineer and VP of Engineering at Amie Street, where he worked on all components of the software stack, including operations, infrastructure, and customer-facing feature development. Aaron holds both an Sc.B. and Sc.M. in Computer Science from Brown University.

Prakash Nanduri (Paxata)

Prakash Nanduri, co-founder and CEO of Paxata, is a seasoned entrepreneur and enterprise software executive with over 20 years of experience in both start-ups and large companies. As co-founder and Vice President of Velosel Corporation, Prakash pioneered Master Data Management (MDM) in 1999, a segment which Gartner estimates to be a $3.2 billion market by 2015. Velosel was acquired by TIBCO in 2005 and has been a significant contributor to the overall growth of the company in recent years. Prakash also spent three years leading Product and Technology Strategy within the Office of the CEO at SAP, where he worked on key strategic initiatives including the SAP Cloud and Big Data (Hana) business strategy.

Jonathan Natkins

Jonathan Natkins is a Member of Technical Staff on the Field Engineering team at WibiData. He helps customers use their data to create better application experiences. Prior to WibiData, Jonathan was an engineer at Cloudera, working primarily on Cloudera Manager and contributing to various Hadoop related projects. Before joining Cloudera, Jonathan worked both as an engineer and a field engineer at Vertica, first building core database features and then working closely with customers to help them move their systems into production. Jonathan holds an Sc.B in Math-Computer Science from Brown University.

Vaibhav Nivargi
Vaibhav Nivargi (ClearStory Data), @vnivargi

Vaibhav is an expert in query optimization, high performance data processing and distributed systems. He was one of the first engineers at Aster Data, developing key areas of the Aster MapReduce Platform. At NetApp, he contributed to patent-pending technology for the WAFL filesystem, significantly boosting boot times. His prior experience includes optimizing the Linux kernel for multi-core processors and distributed file systems. Vaibhav graduated from Stanford University with a Masters in Computer Science and a specialization in distributed systems and machine learning.

Jack Norris
Jack Norris (MapR Technologies), @Norrisjack

Jack has over 20 years of enterprise software marketing experience. He has demonstrated success from defining new markets for small companies to increasing sales of new products for large public companies. Jack’s broad experience includes launching and establishing analytic, virtualization, and storage companies and leading marketing and business development for an early-stage cloud storage software provider. Jack has also held senior executive roles with EMC, Rainfinity, Brio Technology, SQRIBE, and Bain and Company. Jack earned an MBA from UCLA Anderson and a BA in economics with honors and distinction from Stanford University.

Owen O'Malley
Owen O'Malley (HortonWorks), @owen_omalley
Owen has been contributing to Apache Hadoop since before it was first called Hadoop. He was the first committer added to the project and has provided technical leadership on MapReduce, and security. Using Hadoop in 2008 he set the world record for sorting a terabyte of data in 3.5 minutes and in 2009 he sorted a petabyte in 16.25 hours. In 2011, Own co-founded Hortonworks, which commercially supports and trains users of the Hadoop ecosystem. Prior to Hortonworks, Owen worked on Yahoo! Search’s WebMap project, which built the know web. Once ported to Apache Hadoop, it became the single largest low Hadoop application.
Marie O'Reilly
Marie O'Reilly (International Peace Institute)

Marie O’Reilly is the Associate Editor at the International Peace Institute (IPI) and a co-author of the IPI report New Technology and the Prevention of Violence and Conflict. Prior to joining IPI, Marie conducted research on the impact of natural resources on peacebuilding in South Sudan for the United Nations Department of Peacekeeping Operations and developed conflict-prevention strategies targeting youth for the United Nations Development Programme in Lebanon. She has also worked in management consulting with Accenture, on human rights and video advocacy with Witness, and on event planning and communications at the Center for International Studies and Research (CERI) in Paris.

Stephen O'Sullivan
Stephen O'Sullivan (Silicon Valley Data Science), @steveos

A leading expert on big data architecture and Hadoop, Stephen brings over 20 years of experience creating scalable, high-availability, data and applications solutions. A veteran of WalmartLabs, Sun and Yahoo!, Stephen leads data architecture and infrastructure.

Mike Olson
Mike Olson (Cloudera), @mikeolson

Mike Olson cofounded Cloudera in 2008 and served as its CEO until 2013, when he took on his current role of chief strategy officer. As CSO, Mike is responsible for Cloudera’s product strategy, open source leadership, engineering alignment, and direct engagement with customers. Previously, Mike was CEO of Sleepycat Software, makers of Berkeley DB, the open source embedded database engine, and he spent two years at Oracle Corporation as vice president for embedded technologies after Oracle’s acquisition of Sleepycat. Prior to joining Sleepycat, Mike held technical and business positions at database vendors Britton Lee, Illustra Information Technologies, and Informix Software. Mike holds a bachelor’s and a master’s degree in computer science from the University of California, Berkeley.

Richard Park (SkipFlag)

Richard Park is a software engineer at LinkedIn and has been a member of their Hadoop Developer group since 2009. He has been an instrumental part of developing LinkedIn’s Hadoop infrastructure. He is the lead developer on Azkaban and has contributed to open-source projects including Apache Kafka. He has previously worked at PayPal in the fraud detection group.

Yong Ik Park
Yong Ik Park (LG CNS )

Yong Ik Park is a senior director of Advanced Analytics Business Unit at LG CNS, a global IT service company delivering values through big data solutions such as hadoop-based big data platform and advanced analytics services. He played a main role to establish this business in LG CNS since 2010 by leading the Smart Big Data Platform and Appliance development. He is now developing the vertical solutions which contain the expertise captured from the big data business and analysis experiences so far. Yong Ik’s career also spans as strategy and business consultant in various industries. Prior to joining LG CNS, he was a manager and director in Arthur Anderson and BearingPoint. Yong Ik holds Master’s and Ph.D in Economics from Yonsei University,... Read More.

David Parker
David Parker (SAP)

David is responsible for the solution packaging and pricing, definition and execution of the Big Data organization go-to-market strategy, education and enablement. David is a seasoned IT professional with over 28 years within the finance and banking industry, covering both business and technical areas.

David has also held several senior management positions in other industries including retail, telecommunications and academia, providing architectural solutions and consultative services for real-time analytics and data warehouse projects.

In early 2000, David joined Aleri as vice president of services to drive its Complex Event Processing (CEP) product into new markets in the UK, and subsequently moving to New York to help grow the company. David was responsible for customer adoption and success, the internal IT infrastructure and management of... Read More.

Feng Peng (LinkTime Cloud)

Feng Peng is the tech lead of Analytics Data Pipeline at Twitter. His current work focuses on ETL/workflow tools and data pipeline management. Prior to Twitter, he was a Principal Software Engineer and Director of Analytics at Ask.com, where he led the analytics team to build the Hadoop analytics infrastructure and successfully migrated the legacy analytics applications to the new platform. Feng has a Ph.D. in Computer Science from University of Maryland, College Park.

Claudia Perlich

Prior to joining Dstillery (former Media6Degrees), Claudia spent five years working at the Data Analytics Research group at the IBM T.J. Watson Research Center, concentrating on research in data analytics and machine learning for complex real-world domains and applications. She has been published in over 30 scientific publications and holds multiple patents in the area of machine learning. Claudia has won many data mining competitions, including the prestigious 2007 KDD CUP on movie ratings, the 2008 KDD CUP on breast-cancer detection, and the 2009 KDD CUP on churn and propensity predictions for telecommunication customers. Claudia received her Ph.D. in Information Systems from Stern School of Business, New York University in 2005 and holds a Master of Computer Science... Read More.

Chris Perry
Chris Perry (International Peace Institute)

Chris joined IPI as a Policy Analyst with the Coping with Crisis team in July 2008. His current work deals with a range of international peace and security issues through the lens of multilateralism. His research focuses on applying data science to problems of sustainable development, conflict mediation and prevetion, and peacekeeping. His most recent work involved mining UN archival documents to create a database of UN Peacekeeping contributions over time, which can be used to test hypotheses regarding the motivations for contributing to UN peacekeeping missions.
Chris holds a master’s in public administration from the Daniel J. Evans School (University of Washington) with a concentration in international development policy and a BA in philosophy from the University of Washington.

Vaclav Petricek

Vaclav Petricek is a Principal Data Scientist at Santa Monica based eHarmony where he is responsible for optimization, and machine learning applications for eHarmony core matchmaking algorithms. He also runs a series of invited ML talks at eHarmony, part of the Los Angeles Machine Learning Meetup. Prior to eHarmony, Vaclav was visiting Researcher at University College London where his research spanned recommender systems, social networks, web structure and online auctions. Prior to that he has worked at several Czech startups as a developer and sysadmin. He earned his PhD in Computer Science from Charles University in Prague as well as his Masters in Distributed Systems.

http://www.linkedin.com/in/petricek

Antonio piccolbo@gmail.com

Antonio Piccolboni is a data scientist with both industrial and academic experience. His recent work includes the design and implementation of a big data analysis package in R, social network analysis for a top 20 global web site and web analytics for a major web ratings company. He is currently an independent consultant with clients including Dataspora and Revolution Analytics. He blogs at blog.piccolboni.info about big data and analytics. His papers have received more than 4000 citations and his Erdős number is 3.

Tanel Poder

Tanel Poder is one of the leading Oracle Database performance specialists in the world, having helped solve complex problems for customers in 25 countries on five continents. He works for Enkitec as a consultant and technology evangelist, focusing on advanced Linux, Oracle and Hadoop performaance topics, including optimization of Oracle’s Big Data Appliance and Exadata systems. Tanel has received the “Oracle ACE Director” recognition for his community contribution, research and publications, he is also a co-author of the Expert Oracle Exadata book.

Foster Provost
Foster Provost ( NYU | Stern ), @FakeFoster

Foster Provost is coauthor of the O’Reilly best-selling book, Data Science for Business (http://data-science-for-biz.com). He has designed data science solutions for businesses for over two decades, and has co-founded several successful companies focusing on data science for advertising (incl., Dstillery & Integral Ad Science). In his current job as Professor and NEC Faculty Fellow at the NYU Stern School of Business, Foster teaches in the MS in Data Science, MS in Business Analytics, MBA, and PhD programs. His data science research has won many awards and is broadly cited. He served as Program Chair for the ACM SIGKDD Conference and for many years as Editor-in-Chief for the journal Machine Learning.

Ahmed Radwan
Ahmed Radwan (Google's Motorola Mobility), @a_radwan

I have a diversity of interests focused around areas of cloud computing, data/metadata management, semantics and data integration, with emphasis on using advancements in these areas to build solutions that are useful for customers. I am an Apache Sqoop PMC member and committer, Apache Flume PMC member and committer.

Currently I am working at Google’s Motorola Mobility as part of the Cloud team. Before that, and after finishing my PhD, I worked at Yahoo and Cloudera.

Greg Rahn
Greg Rahn (Cloudera), @gregrahn

Greg Rahn is a Solutions Architect in the Partner Engineering group at Cloudera. He focuses on helping Cloudera’s hardware partners optimize their platforms for Hadoop. In addition, he works on performance engineering and benchmarking for the Impala project. Before joining Cloudera, Greg worked for eight years as a database performance engineer at Oracle in the esteemed Real-World Performance Group in the Server Technologies organization.

Anand Raman
Anand Raman (Impetus Technologies, Inc.)

As Vice President of Sales and Account Management, Anand Raman is responsible for business growth at Impetus—from sales strategy ideation and execution of Big Data strategy for Fortune 500 companies.

With over 20 years of experience in the IT industry, Anand has a successful track record in working with large clients and come up with a solution based approach, improve business strategy and strengthen the creation of engagement models.

In his previous roles, Anand led Business Development and Global outsourcing. He competently built the IT sales team and our Big Data practice from scratch, demonstrating monumental entrepreneurial drive to make new projects successful.

Anand holds an MBA and Bachelors degree in Mathematics. He began his career in 1993 as a programmer, developing enterprise applications... Read More.

Nirmal Ranganathan
Nirmal Ranganathan (Rackspace)

Nirmal Ranganathan is a Principal Engineer working on the Data Stores Platform at Rackspace. He constantly works with various teams within Rackspace and customers alike, directing them on how best to take advantage of Big Data technologies. Nirmal plays an active role in the local Austin tech scene by volunteering for organizing meetups and other events in the Austin area. Nirmal was one of the founding members of Trove (Openstack’s Database as a Service) and has contributed to various Openstack initiatives, Cassandra, Alluxio and Thrift.

Syed Rasheed
Syed Rasheed (Red Hat)

As Solution Marketing Manager at Red Hat, Syed Rasheed coordinates marketing, evangelism, and consulting activities. In addition to helping customers address Integration challenges today, Syed is responsible for working with customers, partners, and industry analysts to ensure the next generation of Red Hat technology meets customer’s requirements for building Business Process Automation and Integration solutions. Syed is 18 years veteran of IT industry with extensive experience in the Business Process Management Systems, Business Intelligence and Data Management technologies areas. His work spanned several industries including financial services, banking and telecommunication.

lover of travel & photography, math + art & written word nerd, designer @IDEO & digital lead @cstiming.

Nelson Ray
Nelson Ray (Metamarkets)

Nelson Ray develops data-analytical algorithms as a software engineer at Metamarkets. He holds a B.S., M.S., and Ph.D. in statistics from Stanford University, where he wrote his thesis on the use of machine learning techniques for statistical inference. He has prior experience running large-scale, adaptive experiments at Facebook.

Joseph Rickert
Joseph Rickert (Revolution Analytics)

I am a marketing manager at Revolution Analytics with a passion for analyzing data. I have worked a number of successful Silicon Valley start-ups including Sytek, Alantec, Parallan Computer and Scotts-Valley Instruments. I have graduate degrees in both the Humanities and Statistics. I taught statistics briefly at SJSU and I blog at blog.revolutionanalytics.com

Naomi Robbins

Naomi B. Robbins is the author of Creating More Effective Graphs and blogged on effective graphs for Forbes. She is a consultant, keynote speaker, and seminar leader who specializes in the graphical display of data. She trains employees of corporations and organizations on the effective presentation of data with customized programs. She also reviews documents and presentations for clients, suggesting improvements or alternative presentations as appropriate. Naomi received her Ph.D. in mathematical statistics from Columbia University, M.A. from Cornell University, and A.B. from Bryn Mawr College. She had a long career at Bell Laboratories before forming NBR, her consulting practice.

Henry Robinson
Henry Robinson (Cloudera), @henryr

Henry Robinson is a software engineer at Cloudera, where he has worked for several years on a variety of distributed systems. Most recently he helped build the first versions of Cloudera Impala, a low-latency SQL query engine for data stored inside HDFS.

Julie Rodriguez
Julie Rodriguez (Eagle Investment Systems), @juliargentinag

Julie Rodriguez is a Boston-based Information Architect with experience in user research, analysis and design for complex systems. Within the global markets domain, Julie has delivered pioneering solutions in such areas as wealth management, investment research, securities lending, commodities and retail and institutional trading platforms. She’s patented her work in commodities trading and data visualizations for MATLAB.

Julie holds a BA in Industrial Design from Carnegie Mellon University.

Scott Rose (Think Big Analytics), @scottmrose

Scott Rose leads Think Big’s services practice, helping clients and strategic partners to successfully deliver Big Analytics solutions that drive value. Scott brings nearly 25 years of technology consulting experience having held executive positions at established and start-up companies. He specializes in creating business innovation utilizing new technologies and techniques. Most recently, Scott served as VP Strategy and Business Development for Inkling Markets, a predictive analytics software provider. Prior to Inkling, Mr. Rose was a Partner at Accenture (Technology Practice) where he ran consulting engagements in the US, Europe and Asia. Scott holds a MS in Computer Science from Northwestern University and a BS in Electrical Engineering from the University of Illinois at Urbana-Champaign.

Rob Rosen
Rob Rosen (Pentaho)

Rob Rosen leads Big Data Go-to-Market for industry-leading analytics and data integration software supplier Pentaho. He has led Big Data initiatives and field technical teams for a number of software and infrastructure vendors, most recently Hadoop distributor MapR Technologies. Prior to MapR, Rob led a variety of pre- and post-sales teams, most notably for storage infrastructure leader NetApp, enterprise security leader Check Point Software Technologies and Unix pioneer Sun Microsystems. He holds a B.S. in Electrical Engineering and Computer Science from the University of California, Berkeley.

Ulrich Rueckert
Ulrich Rueckert (Datameer)

Ulrich Rueckert is Data Scientist at Datameer. Prior to Datameer he worked as a research scholar at UC Berkeley and the International Computer Science Institute. His research on machine learning and data mining has been published in renowned journals and has won awards at international conferences. Ulrich serves on the program committees of the main machine learning conferences and he has organized workshops and held tutorials on his research.

Matthew Russell
Matthew Russell (Digital Reasoning), @ptwobrussell

Matthew Russell (@ptwobrussell) is Chief Technology Officer at Digital Reasoning where he is responsible for the company’s technology vision, strategy and direction. He is also the author of several books on technology including Mining the Social Web (O’Reilly, 2013), now in its second edition. He is passionate about open source software development, data mining, and creating Web technologies to amplify human intelligence. Matthew studied computer science and jumped out of airplanes at the United States Air Force Academy. When not solving hard problems, he enjoys practicing Bikram Hot Yoga, CrossFitting and participating in triathlons.

Tony Salvador
Tony Salvador (Intel Corporation )

Dr. Tony Salvador, Senior Principal Engineer, currently directs research in the Experience Insights Lab within Intel Corporation. His team’s role is to identify new, strategic opportunities for technology based on an understanding of fluctuating, global socio-cultural values. Tony leads a team of social scientists and business analysts to look for, find and develop viable opportunities to create local, sustainable value with new high tech products, services and infrastructures. His ongoing research interests concern disruptive innovation practice, development and new market creation with an ethnographic perspective.

Previously, he directed research for the Emerging Markets Platforms Group and was instrumental in the research and design of the Intel powered classmate PC. Prior to that he was a research scientist and co-founder of Intel’s People & Practices Group.

... Read More.
Eric Sammer
Eric Sammer (Rocana), @esammer

Eric Sammer is currently a Principal Solution Architect at Cloudera where he helps customers plan, deploy, develop for, and use Hadoop and the related projects at scale. His background is in the development and operations of distributed, highly concurrent, data ingest and processing systems. He’s been involved in the open source community and has contributed to a large number of projects over the last decade.

Eddie Satterly
Eddie Satterly (Splunk)

Eddie has played a key role in big data adoption at his former employers and was a big data guy back in Telco during the 90’s (before it was cool). He plays a key role with Splunk in adoption of the product as well as in partnerships with the big data community.

Peter Schlampp

Pete is the VP of Products at Platfora. He is responsible for early development, product design, and the roadmap for Platfora’s products. Pete spends most of his time working with enterprise customers to solve their big data BI needs. Prior to Platfora, he was VP of Products and Marketing at Solera Networks, building network security analytics products to store, search, and analyze every packet traversing 10 GigE networks. Early in his career he lead Products for IronPort Systems – the first company to solve spam at Internet scale – which was acquired by Cisco Systems in 2007.

Matt Schumpert
Matt Schumpert (Datameer)
Matt has been working in enterprise software of over 10 years in various capacities, including sales engineering, strategic alliances and consulting. Matt currently runs the pre-sales engineering team at Datameer, supporting all technical aspects of customer engagement through roll-out of customers into production.Matt holds a BS in Computer Science from the University of Virginia.
Baron Schwartz
Baron Schwartz (VividCortex), @xaprb

Baron Schwartz is founder and CEO of VividCortex, the best way to see what your production database servers are doing. He is the lead author of High Performance MySQL and a variety of open source software.

Russell Sears (Microsoft)

Russell Sears is one of the core developers of REEF in Microsoft’s new Cloud and Information Services Laboratory, and was previously a member of Yahoo! Research. He obtained has PhD at UC Berkeley, where he was advised by Eric Brewer. He works on scalable storage systems for analytical processing and log-structured indexing for online and low-latency applications.

Chris Selland
Chris Selland (HP Vertica), @cselland

Chris Selland is vice president of Marketing for HP Vertica. In this role, he leads global marketing for the HP Vertica Analytics platform.
Selland has more than 20 years of experience in online, search and inbound marketing programs. He also has led strategic alliance and corporate development initiatives for entrepreneurial, high-growth companies.
Selland is an established thought leader, speaker and author on customer strategy-related topics, including social media analytics and marketing, customer relationship management, customer analytics, metrics and loyalty.
Earlier in his career, Selland was vice president of CRM and Internet Research at the Yankee Group, and later he founded Reservoir Partners, a customer strategy research firm that merged with Aberdeen Group.
He received a bachelor’s degree in operations research... Read More.

Giovanni Seni (Intuit)

Giovanni Seni is currently a Senior Data Scientist with Intuit where he leads the Applied Data Sciences team. As an active data mining practitioner in Silicon Valley, he has over 15 years R&D experience in statistical pattern recognition and data mining applications. He has been a member of the technical staff at large technology companies, and a contributor at smaller organizations. He holds five US patents and has published over twenty conference and journal articles. His book with John Elder, “Ensemble Methods in Data Mining – Improving accuracy through combining predictions”, was published in February 2010 by Morgan & Claypool. Giovanni is also an adjunct faculty at the Computer Engineering Department of Santa Clara University, where he teaches an Introduction to Pattern Recognition and Data... Read More.

Siddharth Seth (Hortonworks Inc)

Siddharth Seth works as a software engineer at Hortonwork, where he works on the Apache Tez project and the Apache Hadoop project – with a focus on YARN and MapReduce. He is a member of the Apache Tez PPMC and the Apache Hadoop PMC. Prior to this he spent several years working on search platforms and Oozie at Yahoo.

Hitesh Shah (Hortonworks Inc)

Hitesh Shah currently works on various things related to Apache Hadoop at Hortonworks. He is an Apache Hadoop committer as well as a PPMC member of the Apache Ambari and Apache Tez projects. Earlier to that, he spent 8+ years at Yahoo! building various frameworks all the way from data storage platforms for social content to a multi-threaded event-driven framework for building high throughput advertising serving platforms.

Vin Sharma
Vin Sharma (Intel)

At Intel, Vin Sharma is responsible for strategic ecosystem initiatives driving adoption of open source solutions based on Intel datacenter technologies. In this role, Vin spearheads technical and marketing engagements with OEM and OSV partners working with open source projects such as Hadoop, OpenStack, KVM, and Linux. Before Intel, Vin worked at HP for 15 years, most recently as the business strategist for open source and Linux on HP servers. Vin has an academic background in history of technology and electrical engineering.

Jayant Shekhar
Jayant Shekhar (Sparkflows Inc.), @jshekhar

Jayant is Sr. Solutions Architect at Cloudera working with various large and small companies in various Verticals on their Big Data Use Cases, Architecture, Algorithms and Deployments. Prior to Cloudera Jayant worked at Yahoo where he was instrumental in building out the large scale Content/Listings Platform using Hadoop & Big Data technologies and working with various Yahoo Properties, Real Estate, Autos, Local, News, Movies etc. Prior to Yahoo, Jayant worked at eBay building out a new Shopping Platform (K2) using Nutch/Hadoop, Search Intelligence Platform, among others. Jayant has Bachelor’s degree in Computer Science from IIT Kharagpur and Master’s degree in Computer Engineering from San Jose State University

I am a data scientist in the Nordstrom Data Lab and a specialist in statistical computing. I am interested in the development of scalable machine learning methods with applications in recommendation systems, market segmentation and customer engagement. I have a master of science degree in biostatistics and a PhD in Bioinformatics, both from the University of Michigan in Ann Arbor. Big data problems are my passion.

Brett Sheppard
Brett Sheppard (Splunk)

Brett Sheppard is director of big data product marketing at Splunk Inc. Brett’s career combines roles as a data analyst at Gartner and U.S. Department of Defense and as a product and solution marketer at both large technology vendors and startups. Brett is a certified Hadoop system administrator. Based six years overseas, in Europe and Asia, Brett has enjoyed working onsite across the U.S. and in 35+ countries to help enterprises and public sector accounts evolve their data architectures to manage and benefit from big data.

Sumeet  Singh

Sumeet Singh is Head of Products for Cloud Services and Hadoop at Yahoo! responsible for Product Management, Customer Engagements, Evangelism and Community Development, and Program Management for the Cloud Engineering Group. In this role, he also leads the Hadoop products team responsible for both Apache open source contributions and Yahoo! projects. Sumeet has over 14 years of Product Management, Product Development, and Strategy Consulting experience in the technology industry. Sumeet earned his MBA from UCLA Anderson School of Management and MS from Rensselaer Polytechnic Institute, NY.

Mark Slusar
Mark Slusar (Allstate), @markslusar

Mark Slusar is currently a Research Fellow at Allstate where he is responsible for the research and prototyping in the corporate portfolio of big data projects. Mark has a diverse background in software development, operations, research, data science, leadership, design, and prototyping. He has worked across an extremely wide variety of disciplines and subjects of data joining many ideas, sources, and view points. This blending has positioned Mark as an inventor and extremely active contributor to many organizations intellectual property portfolios and disruptive product development.

Randy Smerik
Randy Smerik (Osunatech, Inc.), @randysmerik

Randy Smerik is an experienced serial entrepreneur, with over 20 years’ experience in the high technology fields of mobile, networking, database systems, and semi-conductors as well as businesses in the spirits and restaurant markets.

Randy’s background and experience includes: Currently Founder/CEO of Osunatech, Inc. Before Osunatech, Randy was Senior VP/GM of LSI Corp’s Networking business which he led after LSI acquired his company Tarari, Inc. Randy was the Founder and President/CEO of Tarari which focused on software and silicon solutions for deep packet inspection. Before Tarari, Randy was General Manager of Intel’s Network Equipment Division, VP of Product Development/Marketing at iPivot, and AVP running the Teradata Data Warehousing Division. Randy has also held executive and senior leadership positions at Read More.

Rod Smith
Rod Smith (IBM Emerging Internet Technologies ), @IBM

Rod Smith is an IBM fellow and Vice President of the IBM Emerging Internet Technologies organization, where he leads a group of highly technical innovators who are developing solutions to help organizations realize the value of big data. His early advocacy in the industry has played an important role in the adoption of technologies such as J2EE, Linux, Web services, XML, rich Internet applications, and various wireless standards.

As an IBM Fellow, Rod is helping lead IBM’s efforts around big data analytics and the application of IBM Watson like technologies to business solutions, helping companies make better decisions more quickly for improved business outcomes. His early identification of emerging technologies has led to a sustained record of achievement in the... Read More.

Zach Snyder
Zach Snyder (The Walt Disney Company)

Staff Systems Engineer with The Walt Disney Company. Over 8 years supporting large scale Internet operations delivering millions of page views per hour in a highly dynamic, and user-content driven environment.

Scott Sorensen (Ancestry.com)

Scott Sorensen has served as Ancestry.com’sChief Technology Officer since April 2013. Since joining the family history search giant in 2002, Scott has held multiple positions including Senior Vice President of Engineering, Vice President of Search and Vice President of Commerce and the first piece of code Scott wrote for the company is still used today. Prior to joining Ancestry.com, Scott was co-founder and Vice President of Engineering and then President at Coresoft Technologies. Scott was an engineering manager at WordPerfect / Novell and a software engineer at IBM. He holds a B.S in Computer Science from Brigham Young University.

Srini Srinivasan
Srini Srinivasan (Aerospike Inc.), @aerospikedb

Srini V. Srinivasan, Aerospike founder and vice president of engineering and operations brings 20-plus years of experience in designing, developing and operating Web-scale infrastructures, including Aerospike customers. He holds over a dozen patents in database, Internet, mobile, and distributed system technologies. Srini co-founded Aerospike to solve the scaling problems he experienced with Oracle databases at Yahoo! where, as senior director of engineering, he had global responsibility for the development, deployment and 24×7 operations of Yahoo!’s mobile products, in use by tens of millions of users. Srini joined Yahoo! as part of the Verdisoft acquisition, where as vice president of engineering, he oversaw the development of high-performance data synchronization products for mobile users. Srini also was chief architect of IBM’s DB2 Internet products, and he served... Read More.

M. C. Srivas

Srivas is CTO and Founder of MapR Technologies. Previously at Google, he ran one of the major search infrastructure teams where GFS, BigTable and MapReduce were used extensively. He wanted to provide that powerful capability to everyone, and started MapR on his vision to build the next-generation platform for semi-structured big data. His strategy was to evolve Hadoop and bring simplicity of use, extreme speed and complete reliability to Hadoop users everywhere, and make it seamlessly easy for enterprises to use this powerful new way to get deep insights. Srivas brings to MapR his experiences at Google, Spinnaker Networks (now NTAP), and Transarc (now IBM) in building game-changing products that advance the state of the art.

James Stewart
James Stewart (jystewart.net), @jystewart

Technical Architect of UK Government single-domain project, GOV.UK. East Londoner. Husband. Father.

Jim Stogdill
Jim Stogdill (O'Reilly Media, Inc.)

Jim Stogdill heads up O’Reilly’s Radar and Strata businesses. A lifelong technology practitioner he’s finding this media thing ridiculously fun. In a previous life he traveled the world with the U.S. Navy. Unfortunately from his vantage point it all looked like the inside of a submarine. He spends his free time hacking silver halides with decidedly low-tech gear. @jstogdill.

Ion Stoica (UC Berkeley)

Ion Stoica is a Professor of Computer Science at UC Berkeley, where he does research on cloud computing and networked computer systems. Past work includes the Dynamic Packet State (DPS), Chord DHT, Internet Indirection Infrastructure (i3), declarative networks, replay-debugging, and multi-layer tracing in distributed systems. His current research includes resource management and scheduling for data centers, cluster computing frameworks, and network architectures. He is the recipient of a SIGCOMM Test of Time Award, the CoNEXT Rising Star Award, the PECASE Award, and the ACM doctoral dissertation award. Ion also co-founded Conviva, a startup to commercialize technologies for large scale video distribution.

Dave Stokes
Dave Stokes (MySQL Community Team)

MySQL Community Manager

Tsz-Wo Sze (Hortonworks Inc.)

Dr. Tsz-Wo Nicholas Sze is a Member of Technical Staff at Hortonworks and also a Member of the Project Management Committee at Apache Hadoop. His interests include distributed computing, algorithms and mathematical analysis. Two of his recent Hadoop contributions were HDFS Snapshots and WebHDFS. He used Hadoop with Yahoo’s clusters to accomplish a new computation world record of Pi in 2010. He received his Ph.D. degree in Computer Science from the University of Maryland College Park in 2007, and his M.Phil. and B.Eng. degrees from the Hong Kong University of Science and Technology respectively in 2001 and 1999.

Anurag Tandon is responsible for product marketing and strategy of MicroStrategy’s big data and analytics capabilities. Previously, Mr. Tandon has led MicroStrategy’s product management and marketing efforts in leading technology areas such as social, mobile and high performance, in addition to spending over 10 years providing business intelligence solutions to numerous Fortune 1000 customers of MicroStrategy. Mr. Tandon holds a BS and MS in Mechanical Engineering from Indian Institute of Technology, Bombay and University of Maryland, College Park respectively, and an MBA from University of Michigan.

David Thompson
David Thompson (Western Union)

John David Thompson
Executive Vice President, Global Operations and Technology, Chief Information Officer

As Executive Vice President, Global Operations and Chief Information Officer, John “David” Thompson is responsible for building and developing the IT infrastructure to support the next generation of payment products and services. He also is responsible for maximizing efficiency, quality and customer delivery for the company’s global Agent network of more than 510,000 locations in 200 countries and territories. He is based in Englewood, Colo., at Western Union’s global headquarters.

Thompson has more than 20 years of experience in the high-technology industry, most recently as Group President, Services and Support and Global CIO of Symantec Corporation. Prior to this role, he served as Symantec’s Executive Vice President and Chief Information... Read More.

Bonnie Tijerina
Bonnie Tijerina (Data & Society), @bonlth

Bonnie Tijerina is a researcher at Data & Society, a think tank in NYC. She is founder of ER&L (Electronic Resources & Libraries) conference and organization, created to facilitate communication and foster collaboration among information management professionals working to manage electronic content in libraries.

Volkmar Uhlig
Volkmar Uhlig (Adello)

Volkmar is the founder of Adello, a leading real-time media-buying platform built on Hadoop. Before, Volkmar worked on massive scale low-latency stream processing in High Frequency Trading and at IBM Research. Volkmar was also the lead architect of the L4 microkernel which is commercially deployed in over two billion cell phones. Volkmar has published numerous papers, holds multiple patents, and is actively participating in the research community. He holds a PhD in Computer Science from the University Karlsruhe, Germany.

Milan Vaclavik
Milan Vaclavik (CenturyLink Technology Solutions), @Savvis

Milan Vaclavik is Sr. Director and Solution Lead for CenturyLink Technology Solution’s big data solutions. For more than 20 years, he has been bringing innovative software solutions to market in a variety of industries including enterprise messaging and collaboration, digital rights management, document automation, supply chain management, and physical security. He has held senior product management, marketing and business development positions with startup software firms, as well as larger organizations such as Lotus Development/IBM, GE and LexisNexis. Milan holds a bachelors degree in Regional Science from the University of Pennsylvania and an MBA in Finance and Management of the Organization from Columbia Business School.

Shreepadma Venugopalan

Shreepadma Venugopalan is a software engineer in the Platform Team at Cloudera. Prior to Cloudera, Shreepadma was a member of the Server Technologies group at Oracle where she focused on the relational engine, query optimizer, and unstructured data management. She holds a Master’s degree in Computer Science from the University of Wisconsin-Madison.

Mona Vernon
Mona Vernon (Thomson Reuters Labs), @monavernon

Mona Vernon is vice president of Thomson Reuters Labs, which partners with customers and third parties, such as startups and academics, on new data-driven innovations. Previously at Thomson Reuters, Mona ran the Emerging Technology group and launched an Open Innovation Challenge program across the enterprise. Prior to joining Thomson Reuters, she held product development and management roles in technology startups. Mona is an executive board member of the FinTech Sandbox in Boston, an advisory board member of the Commonwealth of Massachusetts Big Data Advisory Committee, and winner of the Boston 50 on Fire. Mona holds a BS and an MS in mechanical engineering from Tufts University and an SM in engineering and management from MIT, where her research focused on the role of customer... Read More.

David is a full stack developer with 15 years experience building B2B and B2C applications. He spent 6.5 years as a Sr. Application Architect leading the design and implementation of many customer facing features of Nordstrom.com. Recently he joined the Innovation Lab team which focuses on eliminating uncertainty around high risk technology endeavors. Currently he is focusing on partnering with the newly formed Nordstrom advanced analytics team to tackle some of the organizations toughest data challenges.

Heather Wasserlein is a Senior Product Manager at Intuit, where she partners with Data Science to create data-driven New Business Initiatives. Prior to Intuit, Heather worked on web content classification at Yahoo! and personalization at Travelocity. Heather holds a Master’s degree in Mechanical Engineering from MIT.

Timothy Weaver (Dannon)

Timothy Weaver is the CIO for Danone’s North American business units, where the focus is to bring health through food to as many consumers as possible. Its Dannon subsidiary is the market leader in the USA’s fast growing yogurt category. Timothy has a proven track record for defining, aligning, and delivering IT strategies that focus on top line growth across Danone’s North American markets. His focus is on the delivery and use of leading edge tools and capabilities around pattern-based analytics and predictive modeling to enhance connections with shoppers and retailers while growing the overall yogurt category. Prior to joining Danone, Timothy was a top IT executive with Red Bull Energy Drink where he served various roles across both its North American business and... Read More.

Ben Werther
Ben Werther (Platfora), @bwerther

Ben Werther is the Founder and Executive Chairman of Platfora. Ben launched Platfora, and was the founding CEO for four years, with the goal of transforming how ‘citizen data scientists’ in every company make sense and drive action through direct and effortless use of big data. Before founding Platfora, Ben was vice president of products for DataStax, where he shaped the company’s enterprise and Hadoop strategy, and was also head of products at Greenplum through its acquisition by EMC. Ben has a B.S. in Computer Science from Monash University (Australia) and an M.S. in Computer Science from Stanford University.

Tom White
Tom White (Cloudera), @tom_e_white

Tom White is one of the foremost experts on Hadoop. He has been an Apache Hadoop committer since February 2007, and is a member of the Apache Software Foundation. His book Hadoop: The Definitive Guide (O’Reilly) is recognized as the leading reference on the subject. In 2011, Whirr, the project he founded to run Hadoop and other distributed systems in the cloud, became a top-level Apache project.

Tom is a software engineer at Cloudera, where he has worked since its foundation, on the core distributions from Cloudera and Apache. Previously he was an independent Hadoop consultant, working with companies to set up, use, and extend Hadoop. He has written numerous articles for O’Reilly, java.net and IBM’s developerWorks, and has spoken at several conferences, most recently... Read More.

Chris Wiggins (hackNY/Columbia), @chrishwiggins

Chris Wiggins is co-founder of hackNY and an associate professor of applied mathematics at Columbia University.
and is also a founding member of Columbia’s Center for Computational Biology and Bioinformatics (C2B2).

His research focuses on applications of machine learning to real-world data,His research is at the intersection of data science and the natural sciences, particularly biology.

This includes inference, analysis, and organization of naturally-occurring networks; statistical inference applied to time-series data; applications of information theory and optimization in biological networks; and large-scale sequence informatics in computational biology.

He originally moved to NYC in 1989 to attend Columbia.

Prior to joining the faculty at Columbia he was a Courant Instructor at NYU and earned his PhD at Princeton University.

Since 2001 he has... Read More.

Edd Wilder-James
Edd Wilder-James (Silicon Valley Data Science), @edd

Edd Wilder-James is a technology analyst, writer, and entrepreneur based in California. He’s helping transform businesses with data as VP of strategy for Silicon Valley Data Science. Formerly Edd Dumbill, Edd was the founding program chair for the O’Reilly Strata conferences and chaired the Open Source Convention for six years. He was also the founding editor of the peer-reviewed journal Big Data. A startup veteran, Edd was the founder and creator of the Expectnation conference-management system and a cofounder of the Pharmalicensing.com online intellectual-property exchange. An advocate and contributor to open source software, Edd has contributed to various projects such as Debian and GNOME and created the DOAP vocabulary for describing software projects. Edd has written four... Read More.

Richard Williamson
Richard Williamson (Silicon Valley Data Science)

Richard has been at the cutting edge of big data since its inception, leading multiple efforts to build multi-petabyte Hadoop platforms, maximizing business value by combining data science with big data. He has extensive experience creating advanced analytic systems using data warehousing and data mining technologies

David Winters
David Winters (Teradata)

Dave is the Senior Data Architect in Teradata’s Big Data Center of Excellence. In this role he is responsible for architecture, best practices, and fielding of Teradata’s UDA appliance. He is a member of the Hadoop SWAT and Whitecoat teams and has cross responsibility for technical product development and professional services delivery. His account list contains light-house and major logo accounts in Teradata’s portfolio. He is regularly called to brief senior leadership teams and to work directly with Silicon Valley entrepreneurs and leading technologists. His specialty is solving tough business problems with state of the art technology. He enjoys working across all levels of technology from executive to developer roles.

Before joining Teradata, Mr. Winters specialized in database engineering and architectures for Informatica.... Read More.

Aaron Wolf (Datascope Analytics)

Problem solver at heart with a focus on maximizing business value from complex data sets. Vast leadership experience with an extensive technical background provides unique perspective on business needs.

Adam Wolf (Pulsepod)

Adam Wolf is a postdoctoral researcher at Princeton’s Department of Ecology and Evolutionary Biology and lecturer in Civil and Environmental Engineering, teaching a class on environmental sensor design and implementation. Dr Wolf has published on turbulence, Siberian mammoths, and trees in outer space.

Alex Wright
Alex Wright (www.alexwright.org)

Alex Wright is a writer, researcher and designer who has led user experience initiatives for The New York Times, Etsy, The Long Now Foundation, Yahoo!, IBM, and Harvard University, among others. He is the author of the forthcoming book Cataloging the World: Paul Otlet and the Birth of the Information Age.

Reynold Xin
Reynold Xin (Databricks)

Reynold Xin is an Apache Spark committer and the lead developer for Shark and GraphX, two computation frameworks built on top of Spark. He is also a co-founder of Databricks. Before Databricks, he was pursuing a PhD focusing on large scale data systems in the UC Berkeley AMPLab.

Fangjin Yang
Fangjin Yang (Imply)

One of Metamarkets’ first developers on Druid, their database platform, Fangjin is responsible for core infrastructure development including real-time data ingestion. He joined Metamarkets from Cisco where he optimized packet diagnostic algorithms for Cisco’s flagship Cat6k router. Previous to this Fangjin held various engineering and architecture roles at Ericsson and Barclay’s Capital. He holds a BASc in Electrical Engineering and a MASc in Computer Engineering from the University of Waterloo, Canada.

Charles Zedlewski

Charles leads Cloudera’s product function, setting the strategic direction for Cloudera’s Distribution for Apache Hadoop and Cloudera Enterprise. Prior to Cloudera Charles held a number of leadership roles at SAP, BEA Systems as well as venture backed startups.

Charles holds a bachelor’s degree from Carleton College and an MBA from MIT.

Philip Zeyliger
Philip Zeyliger (Cloudera)

At Cloudera, Philip Zeyliger started and leads the Cloudera Manager product. He came to Cloudera from Google, where he worked on scalable storage for user-facing applications. Before that, he worked in finance, at D.E. Shaw. Philip holds a bachelor’s degree in mathematics from Harvard University. His interests include systems and databases. He’s a committer on the Apache Avro project.

Jing Zhao (Hortonworks, Inc.), @Jing9

Jing Zhao is a software engineer at Hortonworks. Currently he is working on HDFS. Before joining Hortonworks, he got his B.E. from Tsinghua University, China, and Ph.D. from University of Southern California, USA.

Ari Zilka
Ari Zilka (Hortonworks), @ikarzali

Ari Zilka has more than 20 years of software development expertise and a deep understanding of open source, enterprise software, and the execution required to build successful products. Ari was previously founder and CTO at Terracotta. Previously, Ari was an Entrepreneur-in-Residence at Accel Partners. Before joining Accel, Ari was the Chief Architect at Walmart.com, where he led the innovation and development of the company’s new engineering initiatives. Prior to Walmart.com, Ari worked as a consultant at Sapient and at PriceWaterhouseCoopers. Ari holds a B.S. in Electrical Engineering Computer Science as well as in Mechanical Engineering from University of California, Berkeley. Follow Ari on Twitter: @ikarzali

Brian Zotter
Brian Zotter (ClearStory Data)

Brian Zotter has 15+ years of engineering and product management experience building and delivering enterprise-class products. Most recently, Brian was Vice President of Engineering at Salesforce.com. Prior to Salesforce, Brian held senior engineering positions at EMC, BEA Systems and MobiTV. He is also a two-time founder and entrepreneur having founded Connectize and Boxcar Software (acquired by Documentum). With 8 patents in PaaS, SaaS and multiple other fields, he’s passionate about building and leading innovative, high-performance engineering teams.

Sponsors

Sponsorship Opportunities

For exhibition and sponsorship opportunities, contact Susan Stewart at sstewart@oreilly.com

Media Partner Opportunities

For information on trade opportunities with O'Reilly conferences email mediapartners
@oreilly.com

Press & Media

For media-related inquiries, contact Maureen Jennings at maureen@oreilly.com

Contact Us

View a complete list of Strata + Hadoop World 2013 contacts