Strata Speakers

New speakers are added regularly. Please check back to see the latest updates to the agenda.

Search Speakers

Peter Adolphs
Peter Adolphs (Neofonie), @padolphs

R&D Project Manager at Neofonie, a Berlin based full service company providing text and data mining software and services. Peter studied computer science and linguistics at the Humboldt University in Berlin and worked for 6 years in the German Research Centre for Artificial Intelligence (DFKI). His focus of work is the development and improvement of information access and knowledge management solutions by leveraging linguistic analyses at various levels and applying them to large amounts of data, for arriving at an application-specific semantics with minimalized human efforts.

Abed Ajraou
Abed Ajraou (Solocal)

Abed Ajraou is the Director of the BI & Big Data in Solocal Group (YellowPages in France). He also Professor of BI in University in Paris. He is the VP Education for DAMA France and he is an author in media.

John Akred
John Akred (Silicon Valley Data Science), @BigDataAnalysis

With over 15 years in advanced analytical applications and architecture, John Akred is dedicated to helping organizations become more data driven. As CTO of Silicon Valley Data Science, John combines deep expertise in analytics and data science with business acumen and dynamic engineering leadership.

Assaf Araki
Assaf Araki (Intel)

Assaf Araki is responsible for big data analytics path findings in a group within Intel Information Technology that delivers advanced analytics and big data solutions across Intel.
He drives the overall work with the academy and industry for Big Data Analytics and merge new technologies inside Intel Information Technology.
Assaf has over 10 years of experience in Data Warehousing, Decision Support solutions and applied analytics within Intel.

carme  artigas
carme artigas (Synergic Partners), @carmeartigas

Carme Artigas is the founder and CEO of Synergic Partners, a strategic and technological consulting firm specializing in big data and data science (acquired by Telefónica in 2015). She has more than 20 years of extensive expertise in the telecommunications and IT fields and has held several executive roles in both private companies and governmental institutions. Carme is a member of the Innovation Board of CEOE and the Industry Affiliate Partners at Columbia University’s Data Science Institute. An in-demand speaker on big data, she has given talks at several international forums, including Strata Data Conference, and collaborates as a professor in various master’s programs on new technologies, big data, and innovation. Carme was recently recognized as the only Spanish woman among the 30... Read More.

Alicia Asin
Alicia Asin (Libelium)

Alicia Asín is CEO and co-founder of Libelium, the wireless sensor hardware
provider, creators of Waspmote, a modular, open source sensor platform for the
Internet of Things (IoT). Alicia is a computer engineer focused on how IoT can
change our world, starting with Smart Cities and smart agriculture. She is a
frequent speaker at international conferences on issues related to Smart Cities,
wireless sensor networks and IoT.
Alicia is recognized for achievements in business and technology: she was
named best business manager in Foreign Trade (ADEA 2011); she was awarded
the highest recognition for her paper on Computer Architecture (WCAE 2007,
San Diego). Alicia... Read More.

Shay Banon
Shay Banon (Elasticsearch)

Shay Banon is the creator of Elasticsearch, the open source, distributed, RESTful, search and analytics engine. Before creating Elasticsearch, Shay spent more than a decade building complex distributed systems and open source search solutions.

Claudiu Barbura

Claudiu is Atigeo’s Senior Director of Engineering, Platform Services, and oversees agile engineering teams in the US and Romania while also acting as Lead Architect in building and operating xPatterns, an enterprise-class, Big Data Analytics platform. Claudiu has 17 years of industry experience in various roles, with a strong passion for Software Architecture leveraging industry best patterns and practices and contributing with a significant level of innovation. His experience spans across the Open Source, Big Data and Microsoft’s Windows/.Net technology stacks

Jodok Batlogg
Jodok Batlogg (CRATE Technology GmbH), @jodok

Jodok has deep experience and wide recognition for his expertise in open source and big data. As early innovator he started with cloud services in 2006 and entered the world of billions of records many years ago. He is co-founder and CEO of Crate Technology and the former CTO of StudiVZ (Germany’s Facebook), the CTO of Sevenload and the CEO of Lovely Systems. Additionaly, he was a director of the Plone & Zope foundations. Loves his skis, his chainsaw and is an adventurous eater. @jodok @cratedata

Hadley Beeman
Hadley Beeman (U.K. Government | W3C)

Hadley is an architect for open data and open standards. She is an advisor on technology policy and is active in the U.K. Government and W3C. She is also the co-founder of @300_Seconds.

Francine Bennett
Francine Bennett (Mastodon C), @fhr

Francine Bennett is a data scientist and the CEO and cofounder of Mastodon C, a group of Agile big data specialists who offer the open source Hadoop-powered technology and the technical and analytical skills to help organizations to realize the potential of their data. Before founding Mastodon C, Francine spent a number of years working on big data analysis for search engines, helping them to turn lots of data into even more money. She enjoys good coffee, running, sleeping as much as possible, and exploring large datasets.

Tim Berglund
Tim Berglund (Confluent), @tlberglund

Tim is a teacher, author, and technology leader with DataStax. He is a conference speaker internationally and in the United States, and contributes to the Denver, Colorado tech community as president of the Denver Open Source User Group. He is the co-presenter of various O’Reilly training videos on topics ranging from Git to Mac OS X Productivity Tips to Apache Cassandra, and is the author of Gradle Beyond the Basics. He blogs very occasionally at, and lives in Littleton, CO, USA with the wife of his youth and their three children.

Marcello Bianchetti

Post DevOps Engineer at UniCredit.
Area of interest: APM, Tuning, Troubleshooting, Monitoring, Capacity Planning, BigData

Sharon Biggar
Sharon Biggar (Social Point)

Sharon is the Head of Analytics at Social Point, one of the world’s leading developers of games for mobile with a fan base of more than 50 million active monthly players across the world. As Head of Analytics Sharon’s role involves providing the necessary Big Data architecture required to process 1 billion events per day, creating predictive analytics to customize the player experience and building text analytics to ensure that Social Point listens to and responds to the ideas of its players.
Prior to joining Social Point Sharon was co-founder and CEO at Path Intelligence. Path Intelligence is a retail analytics firm that uses proprietary mobile location technology to understand and predict the way that shoppers behave inside shopping malls. The firm... Read More.

Joerg Blumtritt
Joerg Blumtritt (Datarella), @jbenno

Joerg Blumtritt (*1970) is founder and CEO of Datarella™, a Big Data Development Lab based in Munich, Germany.

After having graduated in statistics and political sciences with a thesis on machine learning, he started working as a researcher in behavioural sciences, focused on nonverbal communication. Projects had been funded by EU commission, German federal government and the Max-Planck-Society. After deep diving into insurance mathematics, he subsequently ran marketing and research teams for TV-channels ProSiebenSat.1, RTL II and magazine publisher Hubert Burda Media, introducing new qualitative methods like Netnography (< interNET ethNOGRAPHY) into media research. As European Operations Officer at Tremor Media, Joerg was in charge of building this New York based video ad-network’s European enterprises. Recently he had been working as managing director... Read More.

Anne-Lise Bouyer is the project manager and the COO of Journalism++, an agency for data-driven stories
After studying computer engineering, she worked 3 years as project manager in a web agency and for

She is a specialist in user experience and committed to applying her skills to innovative journalism. Her technical expertise, her skills in marketing and customer management enable her to have a 360-degree vision on all of the projects of Journalism++.

David Boyle
David Boyle (Audience Strategies), @beglen

David Boyle leads the work of the Insight team at BBC Worldwide, the commercial and global wing of the BBC. His task is to help to transform the relationship that BBC Worldwide has with its audience by building premium, industry-leading insight capabilities that are understood, valued and used by the organisation. Whether that’s Insight into consumers, BBC brands and the market, or Insight into what connects with audiences emotionally and what inspires them. This understanding will help teams to grow by identifying opportunities that will resonate with audiences and drive returns through understanding what consumers want in-market.

David has spent the last 7 years constructing global insight capabilities for the publishing and music industries that were widely acknowledged as having helped... Read More.

Mikio Braun

Mikio Braun is co-founder of Streamdrill, a startup focused on approximative approaches for real-time big data. He holds a Ph.D. in Machine Learning and has worked in research for a number of years, before becoming interested in putting research results into good use in the industry. His current interests focus on anything to do with real-time data analysis, in particular using approximative approaches beyond scaling.

Matt Casters (Pentaho)

Matt Casters is founder and primary sponsor of the Kettle project and is responsible for leading the architecture and delivery of Pentaho’s Data Integration solutions. Matt has more than 12 years experience in data warehousing and business intelligence. His background as a Unix support professional, a programmer, and a database consultant, combined with the business knowledge he gained from leading BI implementations lead him to the create the Kettle (now Pentaho Data Integration) software platform.

Alistair Croll
Alistair Croll (Solve For Interesting), @acroll

Alistair Croll is an entrepreneur with a background in web performance, analytics, cloud computing, and business strategy. In 2001, he cofounded Coradiant (acquired by BMC in 2011) and has since helped launch Rednod, CloudOps, Bitcurrent, Year One Labs, and several other early-stage companies. He works with startups on business acceleration and advises a number of larger companies on innovation and technology. A sought-after public speaker on data-driven innovation and the impact of technology on society, Alistair has founded and run a variety of conferences, including Cloud Connect, Bitnorth, and the International Startup Festival, and is the chair of O’Reilly’s Strata Data Conference. He has written several books on technology and business, including the best-selling Lean Analytics. Alistair tries to mitigate his chronic ADD by writing... Read More.

Doug Cutting
Doug Cutting (Cloudera), @cutting

Doug Cutting is the chief architect at Cloudera and the founder of numerous successful open source projects, including Lucene, Nutch, Avro, and Hadoop. Doug joined Cloudera from Yahoo, where he was a key member of the team that built and deployed a production Hadoop storage-and-analysis cluster for mission-critical business analytics. Doug holds a bachelor’s degree from Stanford University and sits on the board of the Apache Software Foundation.

Piet Daas
Piet Daas (Statistics Netherlands), @pietdaas

Piet is involved in and coordinates the research in the area of the use of secondary data, such as administrative data, internet data and other Big Data sources, for official statistics.

Vince Darley (King Digital Entertainment)

Vince Darley is the Vice President Data Analytics and BI at King, a leading interactive entertainment company for the mobile world and creator of games such as Candy Crush Saga. King listed on the NYSE in March 2014 (NYSE: KING).

With more than 17 years’ experience in data science, at King, Vince oversees a team working on all aspects of analytics and business intelligence, including downstream and upstream data pipelines, data warehousing, real-time and batch reporting, segmentation and real-time analytics – information which is used across the business to improve the player experience.

Prior to his role at King, Vince was Head of Analytics and Optimisation, Supply Chain and Trading Systems for Ocado, where he was responsible for analytics, modeling,
... Read More.

Aaron Davidson (Databricks)

Aaron Davidson is an Apache Spark committer and software engineer at Databricks. He has implemented Spark standalone cluster fault tolerance and shuffle file consolidation, and has helped in the design, implementation, and testing of Spark’s external sorting and driver fault tolerance.

tod davis
tod davis (Children's Healthcare of Atlanta), @todwith1d

Tod Davis is the Manager of BI and Data Warehousing for Children’s Healthcare of Atlanta, a leading pediatric healthcare facility in the US. In his 17 years at Children’s, Tod has implemented web and database solutions using a variety of technology platforms. Most recently, Tod has been charged with new technology research and implementations, beginning with Apache Hadoop.

Gianmarco De Francisci Morales

Gianmarco De Francisci Morales is a Research Scientist at Yahoo Labs Barcelona.
He received his Ph.D. in Computer Science and Engineering from the IMT Institute for Advanced Studies of Lucca in 2012. His research focuses on large scale data mining and big data, with a particular emphasis on Web mining and Data Intensive Scalable Computing systems. He is an active member of the open source community of the Apache Software Foundation working on the Hadoop ecosystem (Giraph, S4), and a committer for the Apache Pig project. He is a co-organizer of the workshop series on Social News on the Web (SNOW) co-located with the WWW conference. He is one of the lead developers of SAMOA, an open-source platform for mining... Read More.

Edwin De Jonge
Edwin De Jonge (Statistics Netherlands)

Edwin de Jonge is a statistical consultant and data scientist at Statistics Netherlands: the Dutch government agency that is responsible for producing official demographic, economic, social and environmental statistics. His expertise is statistical computing, data visualisation and exploratory techniques. He well versed in several programming languages including R and Python. Edwin is author of several R packages and book on using RStudio. Currently he is writing a book on data cleaning with applications in R.

Yves-Alexandre de Montjoye
Yves-Alexandre de Montjoye (Imperial College London | MIT Media Lab), @yvesalexandre

Yves-Alexandre de Montjoye is researcher at the MIT Media Lab where he is engineering stochastic tools to harness the power of rich behavioral datasets, such as human movement data and communication patterns in networks. He is also interested in how the unicity of human behavior and the richness of these datasets impact individual’s privacy. His research has been covered in BBC News, CNN, The New York Times, MIT Technology Review, Wired, and The Huffington Post. Before coming to MIT, he was a researcher at the Santa Fe Institute where he used cell phone data to model the dynamics of social support. Over a period of 6 years, he obtained an MSc in applied mathematics and his BSC in engineering... Read More.

nick dimiduk
nick dimiduk (Hortonworks, Inc), @xefyr

Nick is an HBase committer and an author on HBase in Action. He works on the HBase team at Hortonworks where he focuses on usability and performance. Nick got started with HBase in 2008 when his nightly ETL jobs started taking 20+ hours to complete. Since then, he has applied Hadoop and HBase to projects in social media, social gaming, click-stream analysis, climatology, and geographic data.

Alex Dorman
Alex Dorman (Magnetic)

Alex Dorman, CTO at Magnetic, holds more than twenty years of technology experience and fifteen years of experience managing engineering and data science teams. Alex is using Hadoop technologies for last 7+ years. Magnetic is an online advertising company and is leader in search retargeting. Before joining Magnetic, Alex built Big Data platforms and teams at Proclivity Media and ContextWeb/PulsePoint. Alex began his career at Intel Software Labs in Israel.

Ted Dunning

Ted Dunning is Chief Application Architect at MapR Technologies and committer and PMC member for the Apache Mahout project. He contributing to the Mahout clustering, classification and matrix decomposition algorithms. He was the chief architect behind the MusicMatch, (now Yahoo Music) and Veoh recommendation systems and built fraud detection systems for ID Analytics.

Guy Ernest
Guy Ernest (Amazon Web Services), @guyernest

Guy Ernest is part of the solutions architecture team of Amazon Web Services, where he helps customers with their first and advanced steps in the Amazon cloud.
Guy specialises in Mobile, Big Data, Analytics and Machine Learning, thanks to his background in these fields prior to joining AWS. Guy founded a couple of start up companies in mobile search, personalisation, and big data analytics.

Hossein Falaki
Hossein Falaki (Databricks Inc.)

Hossein Falaki is a software engineer at Databricks working on the next big thing. Prior to that he was a data scientist at Apple’s personal assistant, Siri. He graduated with Ph.D. in Computer Science from UCLA, where he was a member of the Center for Embedded Networked Sensing (CENS).

Sameer Farooqui

Sameer Farooqui is a client services engineer at Databricks, where he works with customers on Apache Spark deployments. Sameer works with the Hadoop ecosystem, Cassandra, Couchbase, and general NoSQL domain. Prior to Databricks, he worked as a freelance big data consultant and trainer globally and taught big data courses. Before that, Sameer was a systems architect at Hortonworks, an emerging data platforms consultant at Accenture R&D, and an enterprise consultant for Symantec/Veritas (specializing in VCS, VVR, and SF-HA).

Camille Fournier
Camille Fournier (Independent), @skamille

Camille Fournier is the former head of engineering at Rent the Runway. She was previously a vice president at Goldman Sachs. Camille is an Apache ZooKeeper committer and PMC member and a Dropwizard framework PMC member.

Aaron Frazer
Aaron Frazer (Seeking Alpha)

I am a hands-on data science leader excited about the potential of data mining and analysis tools to make a real difference in business, public health, medical research, and clinical practice. I am interested in working in organizations that leverage these technologies to succeed in business and improve global health.

I am passionate about the technologies I use, but also about the people I work with. I have been privileged to recruit, train and manage an exceptional team, and to be managed by insightful mentors. Collaborating with them is a constant source of excitement and inspiration for me.

Michael Freeman
Michael Freeman (University of Washington), @mf_viz

Michael Freeman is a senior lecturer at the Information School at the University of Washington, where he teaches courses on data science, data visualization, and web development. With a background in public health, Michael works alongside research teams to design and build interactive data visualizations to explore and communicate complex relationships in large datasets. Previously, he was a data visualization specialist and research fellow at the Institute for Health Metrics and Evaluation, where he performed quantitative global health research and built a variety of interactive visualization systems to help researchers and the public explore global health trends. Michael is interested in applications of data visualization to social change. He holds a master’s degree in public health from the University of Washington. You can find samples... Read More.

Lars Marius Garshol

I’m a consultant, switching between the roles of developer, architect, and advisor, focusing mostly on semantic technology and data integration. Looking at getting into Big Data analytics to derive more value from the data we’ve integrated. Developer of an open source data linking tool called Duke .

Lars George
Lars George (Cloudera), @larsgeorge

Lars George has been involved with HBase since 2007, and became a full HBase committer in 2009. He has spoken at many Hadoop User Group meetings, and conferences such as ApacheCon, FOSDEM, QCon, or Hadoop World and Hadoop Summit. He also started the Munich OpenHUG meetings. Lars now works for Cloudera, as the EMEA Chief Architect, acting as a liaison between the Cloudera professional services team and customers as well as partners in and around Europe, building the next data driven solution. He is also the author or O’Reilly’s “HBase – The Definitive Guide”.

Olivier Girardot
Olivier Girardot (Lateral Thoughts)

Olivier Girardot is a software engineer and co-founder of Lateral Thoughts working on Machine Learning, Big Data and DevOps solutions with clients to help them tackle problems that require both expertise and experience. In order to become more efficient both as a company and as a team.

Joe Goldberg
Joe Goldberg (BMC Software), @GoldbergJoe

Joe is an IT professional with over 35 years of experience in the design, development, implementation, sales and marketing of enterprise solutions to Global 2000 organizations. Joe has been active in helping BMC products leverage new technology to deliver market-leading solutions with a focus on Workload Automation.

Jesús  Gorriti

Jesús may have 12 years of experience in interaction design, but he’s secretly a magician. He’s passionate about finding the most innovative design solution: one that captures user needs, strengthens a client’s business and pushes technology until it seems to have disappeared, thus becoming indistinguishable from—that’s right—magic.

As Group Director of the Madrid studio, he leads a growing team to ensure outstanding work and the overall success of Fjord’s clients. Jesús has a few tricks up his sleeve to help Fjord achieve their best work: transparency, efficiency, and happiness. He believes in open dialogue, using the right tools to achieve timely problem-solving.

Jesús has worked on everything from strategy to product development, providing a unique perspective of how digital products are made and delivered to... Read More.

Floris Grandvarlet current responsibility at Cisco is Head of Unified Computing in DCV EMEAR Tech Ops. After the merge of European Theater with Emerging Market Theater, he is now leading a team of experts for EMEAR, enabling market adoption of new technologies : Cloud, SDN, BigData through Technical resources/partners and supporting strategic account major deals in close business partnership, as well as driving both internal community and standards body development (IEEE, SNIA).
Before the merge (2012), he was CTO DCV for Emerging Market, as well as responsible for the Enterprise Architect Team for Cisco Large Opportunity Team (deals >100M$), across Emerging Market.
In 2008-2009, Floris was responsible for coordinating Cisco’s technology and product strategies in... Read More.

Lisa Green
Lisa Green (Common Crawl), @boudicca

Lisa Green is the Director at the Common Crawl Foundation where she
oversees the foundation’s mission of building, maintaining and openly
disseminating a comprehensive crawl of the web. Prior to joining
Common Crawl, she was the Chief of Staff at Creative Commons. Lisa
holds a PhD in physical chemistry from the University of California
Berkeley, lives in San Francisco and is passionate about open systems.

Garrett Grolemund
Garrett Grolemund (RStudio)

I specialize in teaching people how to use R – and especially Hadley Wickham’s R packages – to do insightful, reliable data science. Hadley was my dissertation advisor at Rice University, where I gained a first-hand understanding of his R libraries. While at Rice, I taught (and helped developed) the courses “Statistics 405: Introduction to Data Analysis,” and “Visualization in R with ggplot2”. Before that, I taught introductory statistics as a Teaching Fellow at Harvard University.

I’m very passionate about helping people analyze data better. I have travelled as far as New Zealand, where R was born, to learn new ways to teach data science. I worked alongside some of the original developers of R to hone my programming skills, and I collaborated with the... Read More.

Mark Grover

Mark Grover is a committer on Apache Bigtop, a committer and PMC member on Apache Sentry (incubating) and a contributor to
Apache Hadoop, Apache Spark, Apache Hive, Apache Sqoop and Apache Flume. He is currently co-authoring O’Reilly’s Hadoop Application Architectures title and is a section author of O’Reilly’s book on Apache Hive – Programming Hive. He has written a few guest blog posts and spoken at many conferences about technologies in the hadoop ecosystem.

Sebastian Gutierrez
Sebastian Gutierrez (, @dashingd3js

Sebastian Gutierrez is a data entrepreneur who focuses on data-driven companies. Sebastian founded to provide online and corporate training in data visualization and D3.js to a diverse client base, including corporations like the New York Stock Exchange, American Express, Intel, General Dynamics, Salesforce, Thomson Reuters, Oracle, Bloomberg Businessweek, universities, and dozens of startups. More than 1,000 people have attended his live training sessions, and many more have succeeded with his online D3.js training. Sebastian also cofounded, which provides news, analysis, and commentary in data science. Its Data Science Weekly newsletter reaches tens of thousands of aspiring and professional data scientists. He is also the author of Data Scientist at Work, a collection of interviews with many of the world’s most influential and interesting... Read More.

Amir Halfon
Amir Halfon (ScalingData)

Amir Halfon is the Director of Product at ScalingData, focusing on high volume machine data analytics at the intersection of Big Data and IT operations.

Before joining ScalingData, Amir was Chief Technologist for Financial Services at MarkLogic, Oracle, and Sun Microsystems. He is a frequent speaker at industry conferences and contributed to several publications.

Michael Hausenblas

Big Data geek, developer and advocate

Michael works at MapR Technologies in the role of Chief Data Engineer EMEA, where he helps people to tap the potential of Big Data. He has a background in large-scale data integration, the Internet of Things, and Web applications and is experienced in advocacy and standardisation. Michael has been using NoSQL datastores and Hadoop in a number of use cases and he shares his experiences with polyglot persistence at public events and via blogs. Michael contributes to Apache Drill, a distributed system for interactive, ad-hoc analysis and query
of large-scale datasets.

Jeremy Heffner
Jeremy Heffner (Azavea)

I’m the Senior Data Scientist at Azavea, a geospatial software firm located in Philadelphia. My primary focus is working with crime data to model patterns and forecast risk — the intersection of geography, data science, and social good.

Keywords: geographic data, raster processing, predictive analysis, spacetime event modeling, weather, demographics, machine learning, early warning systems, R, Scala, Python

Amy Heineike

Amy Heineike is the vice president of product engineering at Primer, where she leads teams to build machines that read and write text leveraging natural language processing (NLP), natural language generation (NLG_, and a host of other algorithms to augment human analysts. Previously, she built out technology for visualizing large document sets as network maps at Quid. A Cambridge mathematician who previously worked in London modeling cities, Amy is fascinated by complex human systems and the algorithms and data that help us understand them.

Amanda Hill (BBC Worldwide)

Amanda developed and oversees the overarching brand strategy for all BBC Worldwide’s businesses and formats. This includes maximising the commercial value of our current portfolio of 14 global brands – including Doctor Who and Dancing with the Stars – and also developing new BBC brands to build future brand revenue growth.

Amanda Hill joined BBC Worldwide in 2003. Before becoming Chief Brands Officer, she was responsible for BBC Earth, a global guardian brand representing the BBC’s natural history content and Walking with Dinosaurs.

Amanda has driven the growth of these brands leading them into theatric features with the creation of BBC Earth Films, exciting live events and exhibitions and creating a new digital home for BBC Earth content with... Read More.

Jonathan Hsieh
Jonathan Hsieh (Cloudera, Inc), @jmhsieh

Software Engineer @ Cloudera. Apache HBase Commiter, Apache Flume Founder.

Dan Jabry (CrowdEmotion)

Dan is a PhD student and data scientist. He has worked with clients like PWC and LSE to make sense of messy data. Dan brings a unique ability to translate product development into business value.

Jeroen Janssens
Jeroen Janssens (Data Science Workshops), @jeroenhjanssens

Jeroen is a Senior Data Scientist at YPlan in New York City. He has an M.Sc. in Artificial Intelligence and a Ph.D. in Machine Learning. He is authoring a book titled “Data Science at the Command Line”, which will be published by O’Reilly in summer 2014. Jeroen enjoys biking the Brooklyn Bridge, building tools, and eating stroopwafels. He tweets at @jeroenhjanssens.

Hajkan Jonsson
Hajkan Jonsson (Sony Mobile Communications), @hajons

Håkan started out in research in genetic programming for financial forecasting and artificial life, but has since then been developing telco and mobile applications and software for 15 years as programmer, product manager, CTO in startups as well as established companies. For the past few years he has returned to research at Sony Mobile Communications, enrolling as a PhD student at Lund University, with mobile sensing and pervasive social computing as his subjects.

Ameya Kantikar (Groupon)

Ameya Kanitkar is the lead architect building real time analytics infrastructure that powers Groupon’s real time relevance and personalization systems.
Before personalization infrastructure, he also lead the design and development of global message bus infrastructure at Groupon. Ameya holds degrees from Carnegie Mellon University and Pune University.

Robert Kaye
Robert Kaye (MusicBrainz), @mayhem

Robert is the founder and lead geek for MusicBrainz ( ), and Executive Director of the MetaBrainz Foundation, the non-profit organization that pays the bills for MusicBrainz. After studying Computer Engineering at Cal Poly, San Luis Obispo he joined Xing Technology where he shaped the MP3 efforts and then spent time for the dot com boom on EMusic’s FreeAmp team. While at EMusic he started working on MusicBrainz, the open music encyclopedia and fell in love with Open Source software.

Robert is also an active hardware hacker, creating art projects for Burning Man and Nowhere. He is the co-founder of Party Robotics ( ) where he co-created Bartendro the open source cocktail robot. He lives in Barcelona and cooperates with the Music Technology... Read More.

Samuel Kerrien (RESEREC)

Sam is a Data Engineer at Jagex, the largest independent games developer and publisher in the UK. Sam has a wealth of experience on leveraging the Hadoop stack to build business Intelligence and analytics data products.

James Kinley (Cloudera), @jrkinley

James Kinley is a Principal Solutions Architect at Cloudera and has been involved with Hadoop since early 2010. James joined Cloudera from the UK defence industry where he specialised in cyber security. James now works with Cloudera’s customers across EMEA to help them succeed in their Hadoop endeavours.

Greg Kleiman (Red Hat)

Greg Kleiman leads the big data strategy for Red Hat. With more than 25 years as a technology innovator, he is responsible for the overall direction of big data across Red Hat’s open source product portfolio with a focus on storage, middleware, and cloud architectures for enterprises.

With a broad technology background in hardware, software, and services, Greg has led key company initiatives across product management, product marketing, business development, and sales organizations. Prior to Red Hat, he held management positions at leading IT companies in product management, marketing, and business strategy, including NetApp and HP. He is a frequent speaker at IT conferences worldwide and covers topics such as storage, big data, and cloud.

Greg holds a masters in business administration from Harvard Business... Read More.

Joshua Koran (Turn)

Joshua has more than 15 years of experience in the online advertising industry. He is one of the most knowledgeable marketing analytics and behavioral targeting experts, having designed applications that generate more than 30 percent of the industry’s annual behavioral targeting revenue.

Prior to joining Turn, Joshua was vice president of product management, research and data monetization at AT&T AdWorks. In this role, he was responsible for the architecture, product vision, and roadmap for the industry’s first three-screen audience platform. Joshua also advised the company on positions to take regarding digital advertising and privacy.

Prior to AT&T AdWorks, Joshua was vice president of targeting and optimization for ValueClick, Inc., where he oversaw product development of the company’s behavioral targeting platform and served as head of... Read More.

Marcel Kornacker
Marcel Kornacker (Cloudera)

Tech lead at Cloudera for new products. Graduated in 2000 with a PhD in databases from UC Berkeley, followed by engineering jobs at a few database-related startup companies. Marcel joined Google in 2003, where he worked on several ads serving and storage infrastructure projects. His last engagement was as the tech lead for the distributed query engine component of Google’s F1 project.

Michal Laclavik
Michal Laclavik (Magnetic)

Michal Laclavik, Sr. Data Scientist at Magnetic, has more than ten years experience on R&D in the field of Semantic Technologies, Information Retrieval and Big Data technologies. Michal is using Hadoop for his research since 2008. Before joining Magnetic, he was doing PhD at Slovak Academy of Science and working as a researcher on several EU funded projects. Michal has multiple publications. He is also giving lectures on Information Retrieval at Slovak University of Technology.

Mounia Lalmas

Mounia Lalmas is a director of research and the head of tech research in personalization at Spotify. Her work focuses on studying user engagement in areas such as native advertising, digital media, social media, search, and music. Mounia also holds an honorary professorship at University College London. Previously, she was a director of research at Yahoo, where she led a team of researchers working on advertising quality for Gemini, Yahoo’s native advertising platform. She also worked with various teams at Yahoo on topics related to user engagement in the context of news, search, and user-generated content. She has given numerous talks and tutorials and is the coauthor of a book written as the outcome of her WWW 2013 tutorial on “measuring user engagement.”

Cindy Lamm
Cindy Lamm (comSysto GmbH)

Cindy Lamm works as a Data Scientist for the comSysto GmbH in Munich, Germany, where she focuses on combining data analysis (mostly done with R) with software development (Python or Java) in an agile environment. She holds an M.Sc. in Statistics from HU Berlin and a Diplôme Statisticien Economiste from ENSAE Paris.

Uri Laserson
Uri Laserson (Cloudera), @laserson

Uri Laserson is a data scientist at Cloudera. Previously, he obtained his PhD from MIT developing applications of high-throughput DNA sequencing to immunology. During that time, he co-founded Good Start Genetics, a next-generation diagnostics company focused on genetic carrier screening. In 2012, he was selected to Forbes’s list of 30 under 30.

Costin Leau
Costin Leau (Elastic), @costinl

Costin Leau is an engineer at Elasticsearch, currently working with NoSQL and Big Data technologies on the Elasticsearch for Apache Hadoop project. An open-source veteran, Costin led various Spring projects (Spring OSGi, GemFire, Redis, Hadoop) and authored an OSGi spec. Speaker at various editions of EclipseCon/OSGi DevCon, JavaOne, Devoxx/Javapolis, JavaZone, SpringOne, TSSJS on Java/Hadoop/Spring related topics.

r0ml Lefkowitz
r0ml Lefkowitz (Retired), @r0ml

Robert (a.k.a. r0ml) Lefkowitz is a frequent speaker on the intersection of software and literacy. Previously, he was a CTO at a fintech startup and held senior technology positions in the telecommunications and financial industries. He is a distinguished engineer of the ACM.

Roger Magoulas
Roger Magoulas (O'Reilly Media), @rogerm

Roger Magoulas is the vice president of O’Reilly Radar. Previously, Roger was the research director at O’Reilly, where he and his team built the company’s analysis infrastructure and provided analytic services and insights on technology-adoption trends to business decision makers at O’Reilly and beyond. He and his team found what excites key innovators and use those insights to gather and analyze faint signals from various sources to make sense of what others may adopt and why.​

Ted Malaska
Ted Malaska (Capital One), @TedMalaska

Ted has worked on close to 60 Clusters over 2-3 dozen clients with over 100’s of use cases. He has 18 years of professional experience working for start-ups, the US government, a number of the worlds largest banks, commercial firms, bio firms, retail firms, hardware appliance firms, and the US’s largest non-profit financial regulator. He has architecture experience across topic such as Hadoop, Web 2.0, Mobile, SOA (ESB, BPM), and Big Data. Ted is a regular committer to Flume, Avro, Pig and YARN.

Neil Martin (

Neil Martin is a Senior Project Manager at, part of the BGL Group in Peterborough, England. He has a 17 year career in project management across the financial services and utilities industries, having spent time at Pearl Assurance and EON. Neil has been part of the BGL Group for 7 years, delivering new insurance schemes for several well know global banks, leading the change programme in Legal Services and most recently delivering the data roadmap in comparethemarket.
He resides in Peterborough and is a keen runner. Neil can be contacted at

Juan Mateos Garcia

Juan is Economics Research Fellow at Nesta, the UK Innovation Foundation.

Juan’s job is to create knowledge to inform better decisions driving innovation and growth in the UK’s creative and digital economy.

Juan has 15 years of experience using a range of methods and data sources (structured and unstructured) to understand the cutting edge of innovation practice, and its implications.

Juan is currently working on a project that looks at the skills implications of the data revolution, in partnership with the Royal Statistical Society and Creative Skillset.

Juan is an economist with an MSc on Science and Technology Policy from the University of Sussex.

Liam Maxwell
Liam Maxwell (HM Government), @liammax

Liam Maxwell is the national technology advisor in the UK government, where his role is to create the conditions to make the UK the best place for business to invest in technology and ensure the civil service can make the best use of emerging technologies to deliver effective digital government. Previously, Liam was chief technology officer for HM government. His teams at the Government Digital Service established the Common Technology Services function, which delivered a range of projects from the Crown Hosting Service to the Public Services Network, the Digital Marketplace, and the spend control and delivery assurance function. The changes to technology that his team introduced across the government helped to save £3.5B in the last four years of the 2010 parliament.

Geoff McGrath
Geoff McGrath (McLaren Applied Technologies)

Geoff McGrath trained as a mechanical engineer, initially working in aeronautical, civil, chemical, petroleum and mechanical engineering. Gaining experience in all aspects of the field, from conceptual engineering through to business development for start-up and blue-chip firms alike, he later switched focus to the world of telecoms, media and entertainment.

In 2009, he became the managing director of McLaren Applied Technologies (MAT) and has brought more than 20 years’ experience to bear on the company’s innovative and dynamic approach to pursuing groundbreaking conceptual design, development and engineering.

He is currently focused on developing and applying MAT’s specialist capabilities to a diverse range of subjects in the world of sports, health and wellbeing and strategic operations.

Juliette Melton
Juliette Melton (New York Times), @j

Juliette Melton is a design researcher and strategist formerly based at IDEO in San Francisco. Her work centers on bringing human perspectives to the innovation process, particularly around technology-based products and services. She frequently nerds out on research methodology, scaling human-centered inquiry, and mindfulness/attention/focus.

bob middleton (Tableau Software)

A 20 year veteran of the IT industry, Bob has worked in hardware, software and all points in between. Bob has worked at providing large scale ERP systems to facilitate the capture and availability of data, and in security trying to improve data and system protection. After seeing so many changes in IT Bob is confident that we ain’t seen nothing yet.

Gabriele Modena (Improve Digital)

Gabriele Modena is a Data Scientist at Improve Digital.
In his current position he uses Hadoop to manage, process and analyze behavioural and machine generated data. Prior to joining Improve Digital he held a number of positions in Academia and Industry where he researched and applied machine learning techniques to areas such as Natural Language Processing, Information Retrieval and Recommendation Systems.
He holds a BSc in Computer Science from the University of Trento in Italy and a Research MSc in Artificial Intelligence – Learning Systems from the University of Amsterdam in The Netherlands.

Rodney Mullen
Rodney Mullen (Almost Skateboards)

Rodney Mullen is widely considered the most influential skateboarder in the history of the skateboarding. Despite Alan Gelfand’s justifiable fame for inventing the ollie air (primarily a vert or pool-oriented trick), Rodney is responsible for the invention and development of the street ollie. The ability to pop the board off of the ground and land back on the board while moving has quite likely been the most significant development in modern skateboarding. This invention alone would rank Mullen the most important skateboarder of all time. The majority of ollie and flip tricks he invented throughout the 1980s, including the flatground ollie, the kickflip, the heelflip, and the 360 flip, are now fundamental aspects of modern vertical and street skateboarding. Rodney’s career highlights include winning nearly... Read More.

Paco Nathan
Paco Nathan (, @pacoid

Paco Nathan is known as a “player/coach” with core expertise in data science, natural language processing, machine learning, and cloud computing. He has 35+ years of experience in the tech industry, at companies ranging from Bell Labs to early-stage startups. His recent roles include director of the Learning Group at O’Reilly and director of community evangelism at Databricks and Apache Spark. Paco is the cochair of Rev conference and an advisor for Amplify Partners, Deep Learning Analytics, Recognai, and Primer. He was named one of the "top 30 people in big data and analytics" in 2015 by Innovation... Read More.

Kim Nilsson

Swedish ex-astronomer turned risk management consultant turned entrepreneur. PhD Astrophysics and MBA from Cranfield School of Management. Currently running Europe’s largest PhD-to-Data Science training programme, S2DS, based in London.

Tim O'Reilly
Tim O'Reilly (O'Reilly Media, Inc.), @timoreilly

Tim has a history of convening conversations that reshape the industry. In 1998, he organized the meeting where the term “open source software” was agreed on, and helped the business world understand its importance. In 2004, with the Web 2.0 Summit, he defined how “Web 2.0” represented not only the resurgence of the web after the dot com bust, but a new model for the computer industry, based on big data, collective intelligence, and the internet as a platform. In 2009, with his “Gov 2.0 Summit,” he framed a conversation about the modernization of government technology that has shaped policy and spawned initiatives at the Federal, State, and local level, and around the world. He has now turned his attention to implications of the on-demand... Read More.

Ben Okri
Ben Okri (Self)

Ben Okri has published 8 novels, including The Famished Road and Starbook, as well as collections of poetry, short stories and essays. His work has been translated into more than 20 languages. He is a Fellow of the Royal Society of Literature and has been awarded the OBE as well as numerous international prizes, including the Commonwealth Writers Prize for Africa, the Aga Khan Prize for Fiction and the Chianti Rufino-Antico Fattore. He is a Vice-President of the English Centre of International PEN and was presented with a Crystal Award by the World Economic Forum. He was born in Nigeria and lives in London.

Mike Olson
Mike Olson (Cloudera), @mikeolson

Mike Olson cofounded Cloudera in 2008 and served as its CEO until 2013, when he took on his current role of chief strategy officer. As CSO, Mike is responsible for Cloudera’s product strategy, open source leadership, engineering alignment, and direct engagement with customers. Previously, Mike was CEO of Sleepycat Software, makers of Berkeley DB, the open source embedded database engine, and he spent two years at Oracle Corporation as vice president for embedded technologies after Oracle’s acquisition of Sleepycat. Prior to joining Sleepycat, Mike held technical and business positions at database vendors Britton Lee, Illustra Information Technologies, and Informix Software. Mike holds a bachelor’s and a master’s degree in computer science from the University of California, Berkeley.

srowen om
srowen om (Cloudera), @sean_r_owen

Sean is Director of Data Science for EMEA at Cloudera, helping customers build large-scale machine learning solutions on Hadoop. Previously, Sean founded Myrrix Ltd, producing a real-time recommender and clustering product evolved from Mahout. Myrrix is now part of Cloudera. Sean was primary author of recommender components in Apache Mahout, and has been an active committer and PMC member for the project. He is co-author of Mahout in Action.

Tomas Petricek
Tomas Petricek (University of Cambridge)

Tomas is a computer scientist, book author and open-source developer. He is the lead developer of several F# data-science libraries (Deedle and F# Data), but he also contributed to the design of the F# language itself as an intern and independent consultant. He is the author of a popular book called “Real-World Functional Programming” and is currently editing a collection of practical F# case studies.

Tomas is a PhD student at the University of Cambridge, working on types for understanding context usage in programming languages. He is a founder of DualNotion ltd. where he provides training and consulting services. He recently spent 3 months in New York, working on F# tools for data science at BlueMountain Capital.

Aurélie Pols
Aurélie Pols (Mind Your Privacy), @aureliepols

Mantra: Data Driven Decision Making & Privacy by Design or in other words Data is the New Oil which needs to be refined & Privacy is the new Green, increasing Customer Trust.
Learn from Digital to be agile: test & fail fast through techniques such as A/B testing, MVT, CRO (Conversion Rate Optimization) & UX (User eXperience). Add the power of Big Data and keep an eye on what’s legally acceptable to avoid creepiness.

Aurélie pioneered Digital analytics in Europe by stating vendor independence & focusing on the interaction of tools & data to support her clients’ business strategies.
This allowed her to co-found OX2 in Belgium in 2003, which was successfully sold to UK based Digitas LBi (Publicis) in 2008.Read More.

Alex Priem (Statistics Netherlands)

Alex Priem is a statistical consultant and data scientist at Statistics Netherlands: the Dutch government agency that is responsible for producing official demographic, economic, social and environmental statistics. His primary focus is data analysis and data visualization, and he is fluent in C, Python and various flavours of SQL. Although his work requires him to crunch and analyze ‘Big Data’, he doesn’t mind programming microcontrollers for fun in his spare time.

Foster Provost
Foster Provost ( NYU | Stern ), @FakeFoster

Foster Provost is coauthor of the O’Reilly best-selling book, Data Science for Business ( He has designed data science solutions for businesses for over two decades, and has co-founded several successful companies focusing on data science for advertising (incl., Dstillery & Integral Ad Science). In his current job as Professor and NEC Faculty Fellow at the NYU Stern School of Business, Foster teaches in the MS in Data Science, MS in Business Analytics, MBA, and PhD programs. His data science research has won many awards and is broadly cited. He served as Program Chair for the ACM SIGKDD Conference and for many years as Editor-in-Chief for the journal Machine Learning.

Marco Puts
Marco Puts (Statistics Netherlands), @MarcoPuts

Marco Puts has a background in computer science and cognitive science. He received is PhD degree in the field of psychophysics. At the moment, he works at the big data team of the methodology department of Statistics Netherlands and is involved in the Big Data Sandbox of the UNECE.

Daniele Quercia

Daniele Quercia is a social media researcher at Yahoo Labs in Barcelona. Before that, he was a Horizon senior researcher at The Computer Laboratory of the University of Cambridge. He is interested in the relationship between online and offline worlds and his work has been focusing in the areas of data mining, computational social science, and urban informatics. His research has been published in leading venues including ICSE, Ubicomp, ICDM, CSCW, RecSys, WSDM, and WWW, received honorable mentions from AAAI ICWSM, and has been featured on La Repubblica, The Independent, New Scientist, Le Monde, and BBC. He spoke at TEDx Barcelona and Falling Walls Berlin, and wrote for BBC. He was Postdoctoral Associate at the Massachusetts... Read More.

kim rees
kim rees (Periscopic), @krees

Kim Rees is a founding partner of Periscopic:, an award-winning information visualization firm. Their work has been featured in the MOMA as well as several online and print publications, including CommArts’ Interactive Annual, The Information Design Sourcebook, Adobe Success Stories, CommArts Insights,,, and numerous websites, blogs, and regional media outlets. Periscopic’s body of work was recently nominated for the Cooper-Hewitt National Design Awards.

Kim is a prominent individual in the information visualization community. She has published papers in Parsons Journal of Information Mapping, was an award winner in the VAST 2010 Challenge, and is a guest blogger for Kim has been featured on CommArts Insights and has presented at several industry events including Strata, the Tableau Software Conference, Read More.

Naomi Robbins

Naomi B. Robbins is the author of Creating More Effective Graphs, published by Chart House (originally by Wiley.) She is a consultant, keynote speaker, and seminar leader who specializes in the graphical display of data. She trains employees of corporations and organizations on the effective presentation of data with customized programs. She also reviews documents and presentations for clients, suggesting improvements or alternative presentations as appropriate. Naomi received her Ph.D. in mathematical statistics from Columbia University, M.A. from Cornell University, and A.B. from Bryn Mawr College. She had a long career at Bell Laboratories before forming NBR, her consulting practice. Naomi was recently voted chair-elect of the Statistical Graphics Section of the American Statistical Association.

Ofer Ron
Ofer Ron (LivePerson), @oferron

Senior data scientist and hacker at LivePerson, and tech lead for the research team. After getting a math Ph.D., I’ve spent the last seven years developing classification and recommendation systems over big data, taking them from the drawing board to production.

Duncan Ross
Duncan Ross (Times Higher Education), @teradata

Duncan has been a data miner since the mid 1990s. He was Director of Advanced Analytics at Teradata until 2010, leaving to become Data Director of Experian UK. He rejoined Teradata to lead their International Data Science team.

At Teradata he has been responsible for developing analytical solutions across a number of industries, including warranty and root cause analysis in manufacturing, and social network analysis in telecommunications. These solutions have been developed directly with customers and have been deployed against some of the largest consumer bases in Europe.

In his spare time Duncan has been a city Councillor, chair of a national charity, founded an award winning farmers’ market, and is one of the founding Directors of the Institute of Data Miners. In 2012 he... Read More.

Frank Saeuberlich
Frank Saeuberlich (Teradata)

Dr. Frank Saeuberlich is Director Advanced Analytics as part of the Teradata International Data Science team. He previously worked at Urban Science International where he was responsible for Customer Analytics. In this role he has worked with client teams to implement analytical solutions and he pioneered new types of analysis to improve the efficiency of automotive clients’ marketing efforts. Prior to that as European Customer Solutions Practice Manager he was responsible for the Urban Science Customer Solutions practice in Europe.
While working on his doctoral thesis, he was a scientific assistant at the Institute of Decision Theory and Management Science, University of Karlsruhe, Germany where he developed a course in web mining and supervised several research papers.

Majken Sander
Majken Sander (Majken Sander), @majsander

Majken Sander is a data nerd, business analyst, and solution architect. Majken has worked with IT, management information, analytics, BI, and DW for 20+ years. Armed with strong analytical expertise, she is keen on “data driven” as a business principle, data science, the IoT, and all other things data. Read more

Melissa  Santos
Melissa Santos (Big Cartel), @ansate

Melissa Santos has over a decade of experience with all parts of the data pipeline from ETLs to modeling. At Etsy, her role includes teaching both engineers and non-technical people how to get the data they need. She has a PhD. in Applied Math and runs the blog .

Roy Sasson
Roy Sasson (Outbrain), @Outbrain

Roy Sasson is Outbrain’s Chief Data Scientist. His research focuses on the construction of predictive models and metrics for Outbrain’s recommendation system. Roy’s academic background includes a PhD in Applied Econometrics at Tel-Aviv University, where he has been teaching Econometrics since 2008. His academic research focused on gaining insights and predictions from data concerning the behavior of individuals, firms and managers. Prior to joining Outbrain, Roy served as the R&D group lead (Asia Branch) at Visual Domains.

Baron Schwartz
Baron Schwartz (VividCortex), @xaprb

Baron is co-founder and CEO of VividCortex, a SaaS server performance management product. He is the lead author of High Performance MySQL and a variety of open-source software.

Jim Scott
Jim Scott (NVIDIA), @kingmesal

Jim Scott is the cofounder of the Chicago Hadoop Users Group. As a cofounder Jim has helped build the Hadoop community in Chicago for the past 4 years. He has implemented Hadoop in three different companies supporting enterprise use cases from managing Points Of Interest for mapping applications, Online Transactional Processing in advertising, as well as full data center monitoring and general data processing. His work with high-throughput computing at Dow Chemical was a precursor to more standardized big data concepts like Hadoop.

Shawn Scully
Shawn Scully (Dato)

Shawn is the Director of Product at GraphLab where he helps make it easy to build cool experiences with data. He is data geeky and loves inspired technologies, businesses, and gadgets. His technical background spans recommendation systems and business analytics, physics simulations, and energy. He holds a PhD in Materials Science from Stanford University and a BA in Physics from Cornell University.

Jonathan Seidman

Jonathan has spent more than 15 years as a software developer, with a focus in the last few years on processing large data sets using tools such as Hadoop. Currently, Jonathan is a Solutions Architect on the Partner Engineering team at Cloudera. Before joining Cloudera he was a Lead Engineer on the Big Data team at Orbitz Worldwide, helping to build out the Hadoop clusters supporting the data storage and analysis needs of one of the most heavily trafficked sites on the internet. Jonathan is also a co-founder of the Chicago Hadoop User Group and the Chicago Big Data meetup and a frequent speaker on Hadoop and big data at industry conferences such as Hadoop World, Strata, and OSCON. Jonathan is co-authoring a book... Read More.

Gwen Shapira
Gwen Shapira (Confluent), @gwenshap

Gwen Shapira is a Solutions Architect at Cloudera and leader of IOUG Big Data SIG. Gwen Shapira studied computer science, statistics and operations research at the University of Tel Aviv, and then went on to spend the next 15 years in different technical positions in the IT industry. She specializes in scalable and resilient solutions and helps her customers build high-performance large-scale data architectures using Hadoop. Gwen Shapira is a frequent presenter at conferences and regularly publishes articles in technical magazines and her blog.

Konstantin Shvachko

Konstantin Shvachko, Chief Architect of WANdisco is a veteran Hadoop developer and well-respected industry author and speaker. A technical expert specializing in efficient data structures and algorithms for large-scale distributed storage systems, Konstantin joined WANdisco through the acquisition of AltoStor, a Hadoop-as-a-Service platform company, and prior to AltoStor, he was founder and Chief Scientist at AltoScale, a Hadoop and HBase-as-a-Platform company acquired by VertiCloud. Before AltoScale, Konstantin played a lead architectural role at eBay, building two generations of the organization’s Hadoop platform. Prior to eBay, as Principal Hadoop SE at Yahoo!, he worked on the Hadoop Distributed File System (HDFS). He has dozens of publications and presentations to his credit include those in the fields of Big Data Storage, Distributed Computing, Algorithms, Computational Complexity,... Read More.

Alex Sicoe
Alex Sicoe (Elsevier), @@AlexSicoe

Alex Sicoe recently joined Elsevier as a software developer within the company’s big data analytics platform team. Previously he worked as an engineer with Big Data Partnership
working with clients on projects involving Apache Spark, Apache Cassandra, Apache Storm, Apache Hadoop. He has extensive experience building data pipelines involving such systems as well as giving training courses on them. He also worked at CERN on building a large scale monitoring system for the ATLAS experiment on top of Apache Cassandra.

Georgos Siganos (Qatar Computing Research Institute), @siganos

Georgos Siganos is a Senior Scientist at Qatar Computing Research Institute working on next generation Graph Mining Architectures and Big Data Systems. He is also the lead of the open-source project. Previous to this, he was a Research Scientist at Telefonica Research focusing on Big Data and Peer to Peer Systems. He has authored more than 30 papers in journals and conferences. He received his Ph.D. from the University of California, Riverside.

Rob Siwicki (

Rob Siwicki is a Data Architect at Rob’s career has allowed him to deliver business improvements through enterprise architecture and the delivery of data intensive systems in multiple industries including investment banking, financial services, high-tech manufacturing, consulting and retail ecommerce. Rob holds both technology and business masters degrees and likes to apply both skillsets when creating solutions. Rob is a keen sportsman and competes in Ironman Triathlon. Rob can be contacted via linked in

Rod Smith
Rod Smith (IBM Emerging Internet Technologies ), @IBM

Rod Smith is an IBM fellow & Vice President of the IBM Emerging Internet Technologies organization, where he leads a team of highly technical innovators in seeking out disruptive technologies that aid businesses in future waves of business opportunities. As an IBM Fellow, Rod is closely involved with future IBM’s strategic planning & initiatives.

Throughout his career, Rod has moved the industry and led IBM to a rapid adoption of technologies, such as Web services, XML, Linux, J2EE & big data & analytics and most recently cloud computing around the evolution of platform as a service.

Marcelo Soria-Rodriguez
Marcelo Soria-Rodriguez (BBVA Data & Analytics), @msoriaro

Marcelo Soria-Rodríguez is VP, Data Services at BBVA Data & Analytics, a fully owned subsidiary of BBVA where all the provisioning of big data & data analysis has been centralized in order to ensure best practices, fast deployment, talent retention and global reach.. He is a Telecommunications Engineer and M.Sc. in Signal Processing. His previous work background goes from being a researcher at Nokia, software developer and project manager at two spanish startups and has been with BBVA in its innovation department for over 6 years now. He has an appetite for fine eating, photography, architecture, reading about how our brains work and spending time in initially useless tasks which end up being a massive waste of time.

Vincent pursued his PhD research in Computer Vision, and is an expert on the development of self-learning and context-sensitive algorithms. This knowledge he gladly shares on his VisionDummy blog. Vincent graduated as a Master of Science in electronic engineering, and obtained bachelor degrees in both applied informatics and electronics-ICT. His past as a Java JEE architect at Toyota Motors Europe allowed him to get acquainted with business intelligence and data mining, and encouraged him to focus on deep learning architectures and convolutional neural nets.

At Argus Labs, as Head of Deep Learning, Vincent is in charge of the application of deep learning methods for the Argus data model and intelligence.

David Talby
David Talby (Pacific AI), @davidtalby

David Talby has extensive experience in building & operating web-scale search and business platforms, as well as building world-class, agile, distributed teams. Previously he was with Microsoft’s Bing group where he was a Principal Group Manager leading all business operations for Bing Shopping in the US and Europe, managing teams in Seattle, India and several European countries. Prior to Microsoft, David worked at Amazon both in Seattle and in the UK where he built and ran distributed teams which helped scale Amazon’s financial systems. David is a veteran of the Israeli Air Force and he holds a PhD in Computer Science along with two masters degrees in Science and Business Administration respectively, all of which were obtained at the Hebrew University of Jerusalem. In addition,... Read More.

Martijn Tennekes
Martijn Tennekes (Statistics Netherlands)

Martijn has a Ph.D. in game theory at Maastricht University where he studied the formation of social and economic networks. Currently he works as a methodologist at Statistics Netherlands with a specialization in data visualisation, data science and R. Martijn has written two R packages: treemap and tabplot. He teaches courses on R and data visualization at Statistics Netherlands.

Ankit Tharwani
Ankit Tharwani (Barclays UK)

Ankit Tharwani is Proposition Manager, Information Business, Personal and Corporate Banking, at Barclays Bank PLC.

Jordan Tigani
Jordan Tigani (Google ), @jrdntgn

Jordan Tigani has more than 15 years of professional software development experience, the last 4 of which have been spent building BigQuery. Prior to joining Google, Jordan worked at a number of star-crossed startups, where he learned to make data-based predictions. He is a co-author of Google BigQuery Analytics. When not analyzing soccer matches, he can often be found playing in one.

Kathleen Ting

Kathleen Ting (@kate_ting) is currently a technical account manager at Cloudera where she helps strategic customers deploy and use the Apache Hadoop ecosystem in production. She’s a frequent conference speaker, has contributed to several projects in the open source community, and is a committer and PMC member on Apache Sqoop. Kathleen is also a co-author of O’Reilly’s Apache Sqoop Cookbook.

Marton has degrees in Computer Science and Physics. Before working at Prezi, he was the co-founder of Scalien, a now defunct NoSQL database startup. He joined Prezi as an engineer and is currently serving as the Director of Data and Analytics. He runs triathlons for fun.

Garry Turkington (Improve Digital)

Garry Turkington joined Improve Digital as VP Data Engineering in 2012 and is now the company CTO. One of his current focuses is in building out the company’s ability to derive more value from its substantial data asset. Prior to Improve Digital he was a Software Development Manager at Amazon where he led teams responsible for systems that process the data in the Amazon retail catalog. Before Amazon he spent over a decade in various government roles with a focus on large-scale distributed systems.
He has Ph.D and BSc degrees in Computer Science from the Queens University of Belfast in N.Ireland and a MEng in Systems Engineering from Stevens Institute of Technology in Hoboken New Jersey, USA.

Bart van Leeuwen

Bart van Leeuwen combines his 15 years firefighting experience with 17 years of ownership.

This combination allows him to highlight a new perspective on operational information delivery. With the ever larger amount of available data and changes in tactical approaches to firefighting new and fresh thinking is needed. As a “outside the box” thinker he helps fire departments to approach their information problems in a different way. In this process technology is not the answer, its an enabler and should be treated as such.

Currently he leads a innovation project where proven information management technology is combined with new paradigms like semantic web technology to deal with information flows in a smarter and agile way.

Daniel Waisberg

Daniel Waisberg is an Analytics Advocate at Google, where he is responsible for fostering Google Analytics by educating and inspiring Online Marketing professionals. Both at Google and his previous positions, Daniel has worked with some of the biggest Internet brands to measure and optimize online behavior. He is also the Founder of

Simon Wardley
Simon Wardley (Leading Edge Forum), @swardley

Simon Wardley is a researcher for the Leading Edge Forum focused on the intersection of IT strategy and new technologies. Simon is a seasoned executive who has spent the last 15 years defining future IT strategies for companies in the fast-moving consumer goods (FMCG), retail, and IT industries—from Canon’s early leadership in the cloud computing space to Ubuntu’s recent dominance as the top cloud operating system. As a geneticist with a love of mathematics and a fascination for economics, Simon has always found himself dealing with complex systems, whether in behavioral patterns, the environmental risks of chemical pollution, developing novel computer systems, or managing companies. He’s a passionate advocate and researcher in the fields of open source, commoditization, innovation, organizational structure, and cybernetics. Simon’s... Read More.

Uwe Weiss
Uwe Weiss (Blue Yonder), @WeissU

Uwe Weiss is Blue Yonder’s visionary leader. His objective is to bring together world-class data science and world-class standard enterprise software. He is a passionate software entrepreneur with a noteable and successful track record.

Edd Wilder-James

Edd Wilder-James is a strategist at Google, where he is helping build a strong and vital open source community around TensorFlow. A technology analyst, writer, and entrepreneur based in California, Edd previously helped transform businesses with data as vice president of strategy for Silicon Valley Data Science. Formerly Edd Dumbill, Edd was the founding program chair for the O’Reilly Strata conferences and chaired the Open Source Convention for six years. He was also the founding editor of the peer-reviewed journal Big Data. A startup veteran, Edd was the founder and creator of the Expectnation conference management system and a cofounder of the Pharmalicensing online intellectual property exchange. An advocate and contributor to open source software, Edd has contributed... Read More.

Martin Willcox’s career in IT spans 18 years, 5 organizations and more roles than he cares to remember. In his current role, Martin leads Teradata’s International Big Data CoE – a team of Data Scientists and Technology Consultants charged with helping customers realise value from their data assets and for articulating Teradata’s Big Data strategy and the nature, value and differentiation of relevant Teradata technology and solution offerings to International customers, prospects, partners, 3rd party influencers and the media.

Martin is a “poacher-turned-gamekeeper”, having been a Teradata customer for several years at Co-operative Retail before joining Teradata as an Enterprise Architecture consultant in Autumn 2004.

Martin holds a BSc (Hons) in Physics and Astronomy from the University of Sheffield and a Postgraduate Certificate in Computing... Read More.

Steven Willmott
Steven Willmott (3scale networks), @njyx

Steven Willmott, is the CEO of 3scale which one of the worlds leading providers of API Infrastructure. 3scale powers over 450 APIs for companies such as Skype, Campbell’s Soup, The Daily Telegraph, London Transport, Bitcasa, Optimizely and many others. The company also supports a wide range of community initiatives to enable API adoption including,, and several others. More information can be found at

Prior to 3scale Steven was the head of Research at an R&D Lab at the Technical University of Catalunya (UPC) covering topics including Artificial Intelligence, Distributed Systems, Autonomy, Semantic Web and Semantics of Communication.

Steven Willmott, 3scale Inc.
@njyx on Twitter

Simon Worgan
Simon Worgan (Jagex Ltd)

Simon Worgan is the Senior Data Scientist at Jagex Games Studio. In this role he has applied his machine learning expertise to a variety of challenges within the games industry, these include in-game sentiment analysis, behavioural player clustering and predictive modelling. With a Computer Science Ph.D. from the University of Southampton he has over 6 years’ experience in the real world application of Data Science.

Ann Wuyts
Ann Wuyts (Sentiance), @vintfalken

Ann has extensive experience as a UX & UI designer for serious games and virtual worlds, and served as Community & Editorial Manager for history brand Heritage Key. She’s currently Head of Creative UX UC at Argus Labs, where she’s involved in marketing, product design and behaviour research.

She loves to talk about UX, privacy, quantified self, the future of technology, and about how our minds’ ‘wiring’ shapes our behaviour and thinking.