New speakers are added continuously. Please check back to see the latest updates to the Strata Conference program.
Alasdair Allan is a scientist, author, hacker, tinkerer, and journalist who has recently been spending a lot of time thinking about the Internet of Things, which he thinks is broken. He is the author of a number of books and sometimes also stands in front of cameras. You can often find him at conferences talking about interesting things or deploying sensors to measure them. A couple of years ago, he rolled out a mesh network of five hundred sensor motes covering the entirety of Moscone West during Google I/O. He’s still recovering. A few years before that, he caused a privacy scandal by uncovering that your iPhone was recording your location all the time, which caused several class-action lawsuits and a US Senate hearing. Some years on, he still isn’t sure what to think about that.
Alasdair sporadically writes blog posts about things that interest him or, more frequently, provides commentary in 140 characters or less. He is a contributing editor for Make magazine and a contributor to O’Reilly Radar. Alasdair is a former academic. As part of his work, he built a distributed peer-to-peer network of telescopes that, acting autonomously, reactively scheduled observations of time-critical events. Notable successes included contributing to the detection of what was—at the time—the most distant object yet discovered, a gamma-ray burster at a redshift of 8.2.
Tim serves as Chief Product Officer for DataSift, focusing on product and market strategy. Tim has founded 3 startups during his career in the Enterprise Software market, most recently a content collaboration service acquired by Salesforce.com in 2007. Tim spent 5 years at Salesforce where he held leadership roles from product strategy to running European Marketing.
Francine Bennett is co-founder and CEO of Mastodon C [link: www.mastodonc.com], which offers managed Hadoop and data science services in the cloud in the greenest possible way. She started carbon footprinting cloud providers in 2012 after realising that it was possible to make huge reductions in CO2 emissions by putting together some straightforward data, and that nobody else was doing it.
She previously worked at extracting insight from Big Data at Google and at Ask.com. She is a former mathematician and a massive data nerd, who feels very strongly about bad data visualisations and Comic Sans.
As the VP Business Development for Teradata in EMEA Mikael Bisgaard-Bohr is responsible for identifying new trends and directions in the market for BI, Analytics and Big Data. He interacts with the largest and most sophisticated users of Teradata’s technology as well as leading minds in the industry to gain a better understanding for how technology is changing how organisations are run, products consumed and how organisations and consumers interact in the future. He frequently shares that insight with Teradata customers and prospects as well as at leading conferences across the EMEA region. Mikael has been working with BI for the last 18 years and prior to his current role he was a thought leader and business consultant focused on the retail industry. Mikael holds an MBA from SDA Bocconi and lives in Copenhagen with his wife and 2 children.
Klaas Bosteels is the Lead Data Scientist at Massive Media, the social media company behind Netlog.com and Twoo.com. Before joining Massive Media, he worked on various big data problems as a member of the music information retrieval team at Last.fm. He has contributed code to several open source projects related to data processing and analysis, the largest one being Apache Hadoop, and is the creator of Dumbo, a Python API for writing and running MapReduce applications. He was also the main organizer of a series of Hadoop User Group UK meetups and is a founding member of BigData.be. Klaas has a Ph.D. in Computer Science from Ghent University.
Brian is a Senior Scientist at Sage Bionetworks, a not-for-profit organization dedicated to exploring open source models in the advancement of biomedical research in Seattle, Washington. Brian received a BS at the University of Minnesota in Statistics and worked as a Biostatistician at the Mayo Clinic in Rochester, MN for 7 years. Brian’s extensive experience in working with clinical and genomic data, combined with his passion for exploring innovative ways to make science more open and transparent guided him to Sage Bionetworks.
Brian’s current projects involve implementing strategies and technologies for making complex genomic data more usable and accessible to the community, and re-envisioning how scientists communicate complex genomic science to one another and to the public at large through clearScience.
Marianne Bouchart is the Web Producer EMEA at Bloomberg News. She also initiates data journalism and interactive visualisation projects within the organisation. As the founder of the Data Journalism Blog, she got involved in different projects such as the Data Journalism Handbook, coordinated by the European Journalism Centre and the Open Knowledge Foundation. She studied journalism at the University of Westminster and City University in London. She started her carreer as a web producer and digital marketing manager for brands ranging from music labels to global consumer brands before launching the Data Journalism Blog in April 2011. The online platform which aims at gathering data journalists, designers, programmers and other data enthusiasts around both aggregated and original content or data projects got her to be shortlisted for the XCity Awards 2012. Marianne is also recognized for the passion and enthusiasm she puts in her presentations and guest speaker appearances at various events and media schools such as City University in London or ESJ in Lille and Paris (France).
Liliana Bounegru is project manager on data journalism at the European Journalism Centre and editor of DataDrivenJournalism.net, a collection of useful resources for those who want to get started with data journalism, including relevant events, tools, tutorials, interviews and case studies. You can read more about the data journalism activities she has organized together with partners, including training and the Data Journalism Handbook, on DataDrivenJournalism.net. Liliana is also a Research MA candidate in Media Studies at the University of Amsterdam. She blogs about her academic work at lilianabounegru.org.
Ryan is a Developer Advocate at Google, focused on cloud data services. He’s been at Google for 5 years and previously helped build out the Google Apps ISV ecosystem. He recently published his first book “Getting Started with OAuth 2.0” with O’Reilly.
Laura joined NESTA in June 2009 as a policy advisor on public and social innovation.
Prior to this, Laura was a project coordinator and networks manager at the RSA, building and supporting the Fellowship as a network for civic innovation.
She has research experience working with the think tank Demos, developed a range of independent creative projects and has managed a busy London restaurant. Laura graduated from Oxford University in 2006 with a BA (Hons) in Classics.
Dr. James Cheshire is a lecturer at the UCL Centre for Advanced Spatial Analysis. James’ research focuses on the analysis and visualisation of large population datasets, and he is becoming increasingly renowned for producing visualisations with the R software platform. His work has been featured in the likes of National Geographic Magazine, The Times Atlas of London and the Evening Standard.
Helen began working in data protection and access to information at the Foreign and Commonwealth Office in 2002. She then ran the Public Access Office at the Metropolitan Police Service before moving to the Information Compliance Team at Congestion Charging.
Helen left London to raise her family in 2007 and has worked as a freelance consultant since then.
Shaun has more than 20 years of experience in the software industry, with a track record of building early stage and midsize software companies into successful market leaders. Shaun was previously VP of Product Strategy at VMware where he focused on defining VMware’s cloud application platform vision. Shaun has also held VP and Director level positions at SpringSource, Red Hat, JBoss, Princeton Softech, HP, Bluestone Software, and Primavera Systems. He holds a B.S. in Electrical Engineering from Drexel University. Follow Shaun on Twitter: @shaunconnolly.
Jacomo Corbo is the Chief Scientist for QuantumBlack, a Data Science agency that helps clients meet the analysis challenges of big data to make better decisions.
Corbo is also the Canada Research Chair in Information and Performance Management at the University of Ottawa and a Wharton Clayright Scholar at the University of Pennsylvania’s Wharton School of Business. His research has been funded by grants from the National Research Council, the Alfred P. Sloan Foundation, the Wharton Mack Center for Technological Innovation, the Wharton Customer Analytics Initiative, as well as by companies such as GE Finance and IBM.
Between January 2006 and June 2008, Corbo also served as Race Strategist and subsequently as Chief Race Strategist for the Renault F1 Team Ltd.
Corbo holds a Ph.D. from the Department of Computer Science at Harvard University, an S.M. in Applied Mathematics from the Harvard Graduate School of Engineering and Applied Sciences, and a B.Eng. in Electrical Engineering from McGill University.
Andy Cotgreave was a Tableau user for four years at the University of Oxford before joining Tableau as a Senior Product Consultant. At Oxford, Andy helped develop the university’s business intelligence program and revolutionized its reporting capabilities. He also blogged for UK consultancy, The Data Studio, posting tips and tricks for the benefit of all Tableau users. At Tableau Andy will continue spreading the good word about Tableau.
Kenneth Cukier is a senior editor overseeing data and digital products of the Economist. Previously, he was the paper’s Tokyo correspondent and before that, its technology correspondent in London. Kenneth is the coauthor of the award-winning book Big Data: A Revolution that Transforms How We Work, Live and Think, a New York Times bestseller that has been translated into over 20 languages, as well as a regular commentator in the media on business and technology and a frequent public speaker on trends in big data.
Alastair Dant, Lead Interactive Technologist – Guardian News & Media
Since joining the Guardian, Alastair has played games with the UK budget, created one of Steve Jobs’ favourite iPad apps, visualised the Wikileaks war logs and played ball with Twitter.
This builds on two decades of tinkering, encompassing early experiments with BASIC, architecting popular kids games and assembling art pieces like the Folk Songs Project.
Having won a few awards for his Guardian work, he’s recently been given the chance to build a small team aimed at pushing the boundaries of interactive news development. This has led to a series of talks and workshops in Europe and the US.
Dr. Jason Davis is an engineering director at Etsy.com. He currently leads several efforts including search, browse, advertising, recommender systems, analytics, and data infrastructure. Prior to Etsy, Dr. Davis founded Adtuitive, a venture-backed startup that provided advertising services to small online retailers. Adtuitive was acquired by Etsy in December, 2009. Dr. Davis holds a PhD from the University of Texas at Austin and a BS in computer science from Cornell University.
Dr. Davis blogs at drjasondavis.com.
Alexandra Deschamps-Sonsino is an interaction designer & entrepreneur. She is the founder of Good Night Lamp, a family of internet-connected lamps.
She also leads Designswarm an “internet of things” design studio & consultancy and works with clients who want to design next generation connected products. She also uses her expertise to help shape early business ideas around smart products. Her work has been exhibited at The Victoria & Albert Museum and the Museum of Modern Art in New York.
From 2007 to 2010, she co-founded and ran Tinker London, a smart product design studio. Focused on creating connected product experiences that linked the digital to the physical, Tinker was the first distributor of the Arduino platform in the UK, ran workshops around the world and offered design and consultancy services.
She is also involved in organising technology & design community events in London such as the Internet of Things meetup and This Happened London .
Alexandra has been focused on the “internet of things” and its implications in the design of everyday products since 2005.
Find her on twitter or see her linkedin profile.
Isabel Drost is member of the Apache Software Foundation. She is founder of the Apache Hadoop Get Together in Berlin, was co-organiser of the first European NoSQL meetup as well as the Berlin Buzzwords conference. She co-founded Apache Mahout and is active Apache Mahout committer. Isabel is actively engaged with communities of various Apache projects, e.g. Apache Lucene and Apache Hadoop. She is regular speaker at renown conferences on topics related to free software development, scalability, Apache Lucene, Apache Hadoop and Apache Mahout. Currently Isabel Drost works for Nokia Gate 5 GmbH as Software Developer.
Ted Dunning has been involved with a number of startups with the latest being MapR Technologies where he is Chief Application Architect working on advanced Hadoop-related technologies. He is also a PMC member for the Apache Zookeeper and Mahout projects. Opinionated about software and data-mining and passionate about open source, he is an active participant of Hadoop and related communities and loves helping projects get going with new technologies.
George is a historian of technology whose subjects have included the development (and redevelopment) of the Aleut kayak (Baidarka , 1986), the evolution of artificial intelligence (Darwin Among the Machines , 1997), and a path not taken into space (Project Orion, 2002). His recent Turing’s Cathedral (2012) illuminates the transition from numbers that mean things to numbers that do things in the aftermath of World War II.
Ben leads Musicmetric’s data science team in an attempt to wrangle some sanity into the Internet’s vast supply of horribly formed music data. He has a PhD from the Intelligent Sound and Music Systems group in the Computing Department at Goldsmith University of London. His work there focused on merging social and acoustic similarity spaces to drive playlist creation and related user-facing systems. He is an expert on metadata, structured data, the semantic web and recommendation systems. In his spare time, he is a co-chair of the annual international Workshop On Music Recommendation And Discovery, has given an Ignite London talk about beer styles, occasionally DJs, is an accredited beer judge and homebrews beer. He thinks bios in the third person are weird but figures that’s how they’re meant to be written.
Julie Freeman is an artist for whom technology plays an integral part of her practice. Her dynamic artworks are often data-driven, that is shaped by external influences such as humans, animals or environmental factors. Freeman’s work explores the relationship between science and the natural world, questioning the use of electronic technologies to ‘translate nature’. Freeman often works collaboratively with scientists, experimenting to transform complex processes and data sets into sound compositions, kinetic objects and animations.
Her work is held in a number of private collections, and since 1998 has been shown across the UK in venues such as the Institute of Contemporary Art, the Victoria & Albert Museum, Kinetica, the Barbican Centre and the Science Museum, and internationally in Brazil, Croatia, Lithuania, Luxembourg, Russia and the USA.
Max Gadney is Founder and Design Director at After the Flood, who specialize in data experience design.
We help clients make better use of data in their companies as well as creating great data products around media, sports and finance.
Media clients include BBC, Channel 4, UEFA and Man City FC.
We also work on internal analytics and BI projects as well as product definition work for start-ups.
Max was previously Head of Design at BBC News and a digital commissioning editor at BBC TV.
Hjalmar is a serial entrepreneur, founder of four startups in the gaming, mobile and web sectors since 1996. His company, DataMarket, provides information companies with tools to effectively publish their data and reach new audiences. Their data portal, DataMarket.com, may be the largest collection of open statistics and numerical data available online. DataMarket is based largely on Hjalmar’s vision of the need for a global exchange for such data.
Ben is a best-selling author, broadcaster, medical doctor and academic who specialises in unpicking dodgy scientific claims from drug companies, newspapers, government reports, PR people and quacks. Unpicking bad science is the best way to explain good science.
Bad Science (4th Estate) has sold over 400,000 copies, is published in 18 countries, and reached #1 in the UK paperback non-fiction charts. His book exposing bad behaviour in the pharmaceutical industry will be published in 2012 by 4th Estate.
Ben has written the weekly Bad Science Column in the Guardian since 2003. It’s archived on this site along with blogposts, columns for the British Medical Journal, and other writing.
There are lots of clips of Ben on telly here, and a talk at TEDGlobal here. The Placebo Effect is a two-part documentary series he made for BBC Radio 4. The Rise of the Lifestyle Nutritionists is another. He’s appeared on the Today programme lots of times, Any Questions, Newsnight, Start The Week, The Now Show, Loose Ends, PM, Quote Unquote, Watchdog, and various other things. You can find plenty of it if you dig around on the site, along with lectures, podcast interviews, maybe start Here.
He has given over 250 talks in the past 5 years, from comedy clubs and music festivals to universities and schools, government departments, and more. You can book him for after dinner speaking by emailing email@example.com.
He’s received lots of awards for writing, and a few honorary doctorates.
Ben is 36 and currently works full time as an academic in epidemiology. He does not see private patients.
John Graham-Cumming is a computer programmer and author. He studied mathematics and computation at Oxford and stayed for a doctorate in computer security. As a programmer he has worked in Silicon Valley and New York, the UK, Germany, and France. His open source POPFile program won a Jolt Productivity Award in 2004. John is the author of a travel book for scientists published in 2009 called The Geek Atlas, and has written articles for The Times, The Guardian, The Sunday Times, The San Francisco Chronicle, New Scientist, and other publications.
He can be found on the web at jgc.org and on Twitter as @jgrahamc.
If you’ve heard of him at all, it’s likely because in 2009 he successfully petitioned the British Government to apologize for the mistreatment of British mathematician Alan Turing.
Since finishing a BA in Information Design Alex has always worked at the intersection of design & development working to create data visualisation and interactive content for finance, advertising and media. At the Guardian he has been working to move interactive content development from Flash to open standards that work across desktop, tablet & mobile. Alex is the co-lead developer of the Miso Project.
Kristian Hammond one of the founders and CTO of Narrative Science, a company focused on the automatic generation of stories and related content based upon data. On leave from the Department EECS at Northwestern University, he is a researcher in the areas of human-machine interaction, context-driven information systems and artificial intelligence. After completing his Ph.D. in computer science at Yale University in 1986, Dr. Hammond founded The University of Chicago’s Artificial Intelligence Laboratory. In 1998, Dr. Hammond moved to Northwestern University to form Northwestern’s Intelligent Information Laboratory (InfoLab) and began focusing on the generation of narratives based on search and data analytics. In 2010 he founded Narrative Science with Larry Birnbaum and Stuart Frankel and went on leave in 2011 to help guide the development of the technology into products that are transforming the way in which data is understood and content is generated.
Since joining Tesco in 2005, Tom has led numerous programmes to improve the efficiency of operations, frequently by finding novel ways to use the company’s wealth of data.
Most recently he has been responsible for development of the supply chain, where becoming a more data-driven organisation is a central part of his strategy, including a £9m investment in 2012 to improve analytics capability.
Previously Tom worked in Strategy Consulting, Venture Capital and FMCG.
Dr. Drew Hemment, an artist, designer and researcher, is founder and CEO of the FutureEverything digital art and innovation company (est. 1995), and Associate Director of the ImaginationLancaster creative lab at Lancaster University. He directs the Data Art and Open Data Cities projects at FutureEverything, responsible for commissioning and delivering a range of data visualisation artworks, and helped to establish DataGM.gov.uk, the Greater Manchester Datastore. He is Co-I on £5M The Creative Exchange Knowledge Exchange Hub (AHRC) and Co-I on £1.9M Catalyst project (EPSRC). Shortlisted for Big Chip 2011, Winner of Lever Prize 2010, Shortlisted for Big Chip International Award for Innovation 2010 and Arts & Business Award 2010, Runner Up Lever Prize 2009, Honorary Mention at Prix Ars Electronica 2008 for the artwork Loca: Set To Discoverable. His work has been covered by New York Times, Guardian, Wall Street Journal, BBC and NBC, and he has served on many international Art Juries including UNESCO DigiArts. In 1999, awarded a PhD at Lancaster University, in 2009 elected a Fellow of the Royal Society of the Arts (UK), and in 2010 an Eyebeam resident (USA).
Felienne is a PhD student and an entrepreneur in the field of spreadsheets. Her PhD thesis centers around techniques to extract information from spreadsheets and present that in a visual way, to support users in improving and understanding them. In 2010 Felienne founded Infotron, a start up that uses the algorithms developed during the PhD project to analyze spreadsheet quality for large companies. In her spare time, Felienne volunteers as a judge for the First Lego League, a world wide technology competition for kids.
Erich Huang is Director, Cancer Research at Sage Bionetworks, an not-for-profit biomedical research organization dedicated to an open access model of scientific research. He received his undergraduate degree in History & Literature from Harvard College, and his MD and PhD degrees from Duke University. Erich has a longstanding interest in how information technology can enable better communication of complex science.
I am a Software Developer for Nokia, Berlin. The team I work for applies data-driven methods to improve Nokia’s Maps Search Engine.
Guy Hugot-Derville is the Chief Scientist and Co-Founder at Mesagraph. He shapes and directs Mesagraph technology strategy and leads the company’s research into new products and technologies with partners like Orange Telecom. As the first to deliver a topical graph engine, Guy is on the forefront of semantic search and discovery engines, popularly known as next-generation “pull” capabilities.
Prior to Mesagraph, he was a software engineer at Port25 Solutions, Inc. He was awarded a Bachelors of Science in Engineering at Polytechnical University in Paris and a Masters of Computer Science at Cornell University.
Kathryn Hurley recently joined Google as a Developer Programs Engineer for Fusion Tables. In this role, she helps spread the word about Fusion Tables by presenting at conferences and developer events. She recently worked on Google’s 2010 U.S. Election Ratings gadget. She received an MS in Web Science from the University of San Francisco and a BS in Genetics from the University of California, Davis. Prior work experience includes research in mobile and peer-to-peer computing.
Duncan Irving is a geophysicist and has a PhD in geophysics and geotechnical engineering. He has worked on supercomputing and “big data” problems around surface, subsurface and sensor data on many consulting and research projects – in the data centre, in extreme field conditions and more general upstream data management. He drives internal product development of Teradata’s capabilities in upstream E&P and combines broad domain expertise with deep technical hardware and software knowledge.
Edmund is a data scientist starting up a quantitative hedge fund. He is armed with a PhD in statistical signal processing and well tempered enthusiasm for functional programming.
Nicolas Kayser-Bril, is CEO and co-founder of Journalism++ , a company that brings web technologies in journalistic workflows. Before founding Journalism++ in late 2011, Nicolas was in charge of datajournalism at OWNI, were he led several experiments in crowdsourced data gathering. A self-taught programmer and journalist, Nicolas holds a degree in Media Economics.
Russ Kennedy brings more than 20 years experience in the storage industry to Cleversafe as the company’s Vice President of Product Strategy, Marketing and Customer Solutions. Having rolled up his sleeves working on automated tape libraries, Russ is still attracted to the technological challenges that have shaped the industry and particularly to the innovative approach that Cleversafe delivers to storage.
Russ joined the company initially in 2007 and left in 2009, staying on in an advisory role. In 2011, Russ rejoined the company seeing a clear opportunity to solve the storage needs surrounding the exponential growth of big data and the unique impact that Cleversafe delivers over traditional systems.
Previously, Russ served as the Vice President of Competitive Intelligence at CA Technologies, and was the Senior Director of Engineering and Product Management at Thin Identity Corporation.
Russ has an MBA from the University of Colorado at Denver and a bachelor’s degree in Computer Science from Colorado State University.
Sébastien Lefebvre founded Mesagraph in 2010. As CEO, he leads the company to deliver products and services that harness the power of social media to help organizations and individuals make faster, better decisions.
Prior to founding Mesagraph, Lefebvre held executive positions with innovative companies, including Mandriva, Linbox and Videotron. He was awarded a Masters of Computer Science and Engineering at the University of Nancy.
Thomas Levine has been hacking with computers since he was young. In the process, he noticed back and wrist pain, prompting him to research physical ergonomics of computer use and, in turn, to study statistics. His favorite color is pink.
Mirko Lorenz is a journalist, information architect and trainer based in Cologne, Germany. He has been actively developing training concepts for data journalism for the last two years. Additionally he is involved in cloud computing development, with a focus on media use cases for big data, visualization and digital publishing. He is a member of the innovation team at Deutsche Welle.
Steve Loughran is a member of technical staff at Hortonworks, where he works on leading-edge issues with the Hadoop ecosystem, including service failure modes and availability.
Prior to joining Hortonworks he worked at HP Laboratories on large-scale distributed systems, including cloud computing infrastructures. He is the author of Ant in Action, and is one of the very few UK-based Hadoop committers.
Mark Madsen designs and build analysis and decision support systems, and data management and access infrastructure. His research focus is on emerging technology and practices in analytics, decision support, information management, user experience for data applications. He speaks at conferences worldwide on anything data.
Mark focuses on two types of work: using data to make decisions and manage organizations, and building data technology infrastructure. A big part of making decisions and using data in a corporate setting is ensuring that the right data capture and data delivery infrastructure is in place to manage the business. As a result, Mark does as much information strategy and IT architecture work as he does performance management and decision support. His spare-time projects involve research on history of communication technology adoption and fieldwork in ecology, specializing in botany.
Tim Mallalieu is the Group Program Manager of Microsoft’s Big Data Solutions. The “Big Data” team is responsible for Microsoft’s innovations around SMP & MPP Data Warehouse and Hadoop.
I head the Innovation & Economic Growth team at Nesta, an independent charity with a mission to make the UK more innovative. I work on innovation in businesses using new technologies.
I joined NESTA in 2009 and have authored reports on a semiconductor design cluster and the need for collaboration in the biomedical industry. I also co-ordinate and host a series of events on emerging technologies called ‘Hot Topics’. Big Data is a theme of my work for 2012.
Dipl. Politologe (equv. MA Pol. Sc)
Entrepreneur (OpenDataCity | Lokaler)
Activist (Digitale Gesellschaft e.V.)
Liam Maxwell is the national technology advisor in the UK government, where his role is to create the conditions to make the UK the best place for business to invest in technology and ensure the civil service can make the best use of emerging technologies to deliver effective digital government. Previously, Liam was chief technology officer for HM government. His teams at the Government Digital Service established the Common Technology Services function, which delivered a range of projects from the Crown Hosting Service to the Public Services Network, the Digital Marketplace, and the spend control and delivery assurance function. The changes to technology that his team introduced across the government helped to save £3.5B in the last four years of the 2010 parliament.
Jason McFall is the CTO at Causata, a startup using Big Data to automate real-time marketing. This combines machine learning with large scale data analysis and structured experimentation, to intelligently market to individuals.
Jason started out as an experimental physicist, working on Particle Physics collider experiments. the connection between the two jobs is uncanny: using big data and low latency technology to analyse data fast, and of course to perform and understand experiments with rigorous statistical confidence
Brendan is a Data Scientist with EMC UK&I, responsible for assisting clients transform their businesses to be data driven organisations. Previously at EMC, he has been a director at EMC Consulting and led the Presales organisation. Brendan has a degree in Electrical and Electronic Engineering and a postgraduate diploma in Applied Psychology. He was one of the first in the world to be certified as an EMC Proven Professional Data Scientist (the only vendor neutral certification in the world on Data Science and Big Data Analytics).
Siddartha has been crunching large data sets at Google since 2005 for a variety of products and as a physics grad student before that. He has worked with or on almost every data processing framework at Google and is still looking for ways to make his job easier.
Aron Pilhofer is editor of Interactive News at The New York Times. His team blends journalism and technology to enhance The Times’s reporting online through social media, community and dynamic, data-driven Web applications. He is co-founder of two news startups: DocumentCloud.org and Hacks and Hackers. Aron joined The Times in 2005 as a projects editor on the paper’s newly expanded computer-assisted reporting team, where he specialized in stories related to money, politics and influence for the politics desk and Washington bureau. Prior to joining The Times, Pilhofer was database editor at the Center for Public Integrity in Washington, on the national training staff of Investigative Reporters and Editors and worked for a number of years as a statehouse and projects reporter for Gannett newspapers in New Jersey and Delaware.
Jake Porway is a machine learning and technology enthusiast who loves nothing more than seeing good values in data. He is the founder and executive director of DataKind, an organization that brings together leading data scientists with high impact social organizations to better collect, analyze, and visualize data in the service of humanity. Jake was most recently the data scientist in the New York Times R&D lab and remains an active member of the data science community, bringing his technical experience from his past work with groups like NASA, DARPA, Google, and Bell Labs to bear on the social sector. Jake’s work has been featured in leading academic journals and conferences (PAMI, ICCV), the Guardian, the Stanford Social Innovation Review, and he has been honored as a 2011 PopTech Social Innovation Fellow and a 2012 National Geographic Emerging Explorer. He holds a B.S. in Computer Science from Columbia University and his M.S. and Ph.D. in Statistics from UCLA.
Kim Rees is a founding partner of Periscopic: http://www.periscopic.com, an award-winning information visualization firm. Their work has been featured in the MoMA as well as several online and print publications, including CommArts’ Interactive Annual, The Information Design Sourcebook, Print magazine, and numerous websites, blogs, and regional media outlets. Periscopic’s body of work was nominated for the Cooper-Hewitt National Design Awards.
Kim is a prominent individual in the information visualization community. She has published papers in Parsons Journal of Information Mapping, was an award winner in the VAST 2010 Challenge, and is an advisor to the Congressional Budget Office. Kim has presented at several industry events including Strata, Wolfram Data Summit, Eyeo, VisWeek, and various data visualization groups among others. Recently she has also been an adviser on an upcoming documentary film and was the Technical Editor for Visualize This by Nathan Yau. Kim received her BA in Computer Science from New York University.
Naomi B. Robbins is the author of Creating More Effective Graphs. She is a consultant, keynote speaker, and seminar leader who specializes in the graphical display of data. She trains employees of corporations and organizations on the effective presentation of data with customized programs. She also reviews documents and presentations for clients, suggesting improvements or alternative presentations as appropriate. Naomi received her Ph.D. in mathematical statistics from Columbia University, M.A. from Cornell University, and A.B. from Bryn Mawr College. She had a long career at Bell Laboratories before forming NBR, her consulting practice.
Ian is an engineer at Neo Technology, currently working on research and development for future versions of the Neo4j graph database. Prior to joining the engineering team, Ian served as Neo’s Director of Customer Success, managing training, professional services and support, and working with customers to design and develop graph database solutions. He is a co-author of ‘REST in Practice’ (O’Reilly) and a contributor to ‘REST: From Research to Practice’ (Springer) and ‘Service Design Patterns’ (Addison-Wesley). He presents at conferences worldwide on REST and the graph capabilities of Neo4j, and blogs at http://iansrobinson.com
Simon Rogers is editor of the Guardian’s Datablog and Datastore, an online data resource which publishes hundreds of raw datasets and encourages its users to visualise and analyse them. He is the author of Facts are sacred: the power of data available now on Kindle. Simon is also a news editor on the Guardian, working with the graphics team to visualise and interpret huge datasets. He was closely involved in the Guardian’s exercise to crowdsource 450,000 MP expenses records and the organisation’s coverage of the Afghanistan Wikileaks war logs. Previously he was the launch editor of the Guardian’s online news service and has edited the paper’s science section. He has edited two Guardian books: How Slow Can You Waterski and The Hutton Inquiry and its impact. Simon has just been awarded the Oxford University Internet Institute’s award of ‘Best Internet Journalist’ and was recently honoured at the Knight Batten awards for journalistic innovation. The Datablog and Datastore have won awards in 2011 for innovation from the UK’s Online Media Awards and the Newspaper Awards. In 2010, Simon received a special commendation from the Royal Statistical Society in its awards for journalistic excellence.
Irene Ros is an engineer and artist from Cambridge, MA. Her work focuses on designing and building engaging visualizations and user interfaces using web technologies. Her artwork focuses on the intersection of humanity and technology. Recently her work has been written about in the New York Times and the New York Times Open Blog, as well as Fast Company.
Irene is the creator of NYTWrites, a visualization tool that explores The New York Times authorship. She is also the co-creators of Many Bills, a visualization of congressional legislation aimed at lowering the barrier between the U.S legislative process and citizens. Many Bills combines Visualization techniques with Natural Language Processing and Information Retrieval to offer varying amounts of detail to users. Irene is also one of the developers on the Many Eyes project, a collaborative visualization tool that allows users to gather data, visualize it and discuss their visualizations on and off the site. She was also a part of the effort to launch The New York Times Visualization Lab, a derivative of Many Eyes. Irene also explored MBTA usage patterns in her visualization of A Day of MBTA.
Irene Ros is currently a Senior Programmer at Bocoup and has been collaborating with The Guardian on the development of the Miso Project.
Duncan has been a data miner since the mid 1990s. He was Director of Advanced Analytics at Teradata until 2010, leaving to become Data Director of Experian UK. He recently rejoined Teradata to lead their European Data Science team.
At Teradata he has been responsible for developing analytical solutions across a number of industries, including warranty and root cause analysis in manufacturing, and social network analysis in telecommunications. These solutions have been developed directly with customers and have been deployed against some of the largest consumer bases in Europe.
In his spare time Duncan has been a city Councillor, chair of a national charity, founded an award winning farmers’ market, and is one of the founding Directors of the Society of Data Miners.
Mr Satterly has served in a variety of roles including developer, engineer, architect and CTO over his 23 year career for Fortune 500 companies. He has been a long time big data user, even before it was the cool thing to do. More recently he was able to revolutionize the way a leading online travel agency delivers their core web applications that resulted in improved user experience. At Expedia, he created a highly scalable and flexible Big Data environment using best in breed tools and as a result was able to retire 35 other systems. With Splunk, he saved $14M in the first 12 months and avoided many more $MM in outages. He is a recognized expert in the field of big data and has presented at many conference globally on the topic. Eddie has a BS Comp Sci from Indiana University.
Dr. Edouard Servan-Schreiber is Director for Solution Architecture at 10gen, advising customers on how to make MongoDB make their business simpler, faster, and better.
Previously, Edouard was director for cross-channel analytics at Teradata, leading projects in advanced analytics and predictive modeling with customers in all heavily data-driven industries such as telco, retail, finance, high tech manufacturing.
Edouard’s specialty is to help customers extract business value from their data through the effective use of technology and analytics.
Edouard began practicing artificial intelligence and statistical learning models at Carnegie Mellon University for his bachelor’s degree, before going to UC Berkeley for his PhD in Computer Science.
John Sheridan is Head of Legislation at The National Archives. His academic background is in Computer Science and Information Technology. John is passionate about open data and the possibilities afforded by using Linked Data standards, particularly to public sector information. He is a former co-chair of the W3 e-Government Interest Group.
Arfon is the technical lead for Galaxy Zoo, a citizen science project that has over 200,000 people helping to analyse large-scale astronomy datasets.
William Spooner is a seasoned operational bioinformatician with a track record of delivering tools for high-throughput genomics research. Having worked previously on Ensembl, BioMart, Gramene and WormBase projects at the European Bioinformatics Institute, the Wellcome Trust Sanger Institute and Cold Spring Harbor, his current focus is on making life easier for users of open source/data in commercial settings. His strategic thinking is driven by the huge opportunities for data analysis in the life sciences provided by the near simultaneous arrival of NGS and cloud computing.
Founder and Principal Analyst of Kinran Limited, an independent consultancy focussed on social app analytics. Previously manager of pan-European technology practice at management consutlants A.T. Kearney, Principal Analyst for internet commerce enablers at Europe’s leading tech analyst firm, Ovum, and Associate Dean at Darwin College Cambridge. Stanford Ph.D, Vassar A.B. Lapsed Montrealer. Currently helps to organise Social Developers London (formerly London Facebook Developer Garage.
Gavin is the founder and CEO of AMEE, which provides trusted, comprehensive and timely answers to millions of environmental questions.
Gavin is a serial entrepreneur with over a decade of experience in building data businesses, 20 years in internet and software development, and a background in astrophysics.
His experience spans business, technology, science and media. Leading projects with organisations as diverse as the UK Government, SAS, Google, BBC, Rolls Royce, LogicaCMG, Shell and Christian Aid, he has broad and deep knowledge of how data can change the world.
AMEE aggregates and automates access to the world’s environmental and energy information. Its unique partnerships with world-leading standards providers including the WRI, DECC and CDP, make AMEE the most comprehensive and advanced web-based solution for greenhouse gas emissions, energy efficiency, and other environmental data.
Moritz Stefaner works as a “truth and beauty operator” on the crossroads of data visualization, information aesthetics and user interface design. With a background in Cognitive Science (B.Sc. with distinction, University of Osnabrueck) and Interface Design (M.A., University of Applied Sciences Potsdam), his work beautifully balances analytical and aesthetic aspects in mapping abstract and complex phenomena. He is especially interested in the visualization of large–scale human activity. In 2010, he was nominated for the Design Award of the Federal Republic of Germany; his work has been exhibited at Venice Biennale of Architecture, SIGGRAPH, ars electronica and Max Planck Science Gallery. He is part of the advisory boards for the Places and Spaces Exhibit and the Digital Communities category at Prix Ars Electronica, and serves as a reviewing expert for the Future and Emerging Technologies programme of the European Commission. He has co-authored books for publishers like O’Reilly and Springer and has spoken and lectured on numerous occasions on the topic of information visualization. Find his personal portfolio at http://moritz.stefaner.eu, he also blogs at http://well-formed-data.net and http://infosthetics.com/
Jeni Tennison is the technical director of the Open Data Institute. As a developer, Jeni specializes in open data publishing and consumption, including XML, JSON, and linked data APIs. She trained as a knowledge engineer and holds a PhD in collaborative ontology development. Jeni was the technical architect and lead developer for legislation.gov.uk and worked on the linked data aspects of data.gov.uk. Jeni is author of several technical books and was appointed to the W3C’s Technical Architecture group in 2011. She is also a member of the Open Data user group and the UK Government Linked Data group.
Kaitlin is the director of the Mozilla Science Lab, a new open science initative at Mozilla to help researchers use the power of the web to change science’s future. She’s previously worked at Digital Science, a technology company out of Macmillan Publishers, as well as Creative Commons, where she managed their science program. She also advises the UK government on digital technology and data-intensive science and business, and is on the board of DataKind UK. You can follow her at @kaythaney.
CEO and Co-founder of Atbrox – http://atbrox.com/about/, delivering big data, hadoop and search services for enterprise customers (Atbrox is an AWS and Cloudera Partner).
- Tech.Lead/Manager Google (launched Google News for iPhone)
- PhD Computer Science – http://amundtveit.info/publications/
JD is a User Experience Lead at Salesforce where he specializes in search, online communities and collaboration. Prior work experience includes Director of UX at MarkLogic Inc, and NewCity, Inc. where he was the UX lead for the Virginia Bioinformatics Institute’s most successful project – resulting in a $27 million grant. (largest in the history of Virginia Tech) Awards include multiple IMA, CASE and ADDY’s. A graduate of Carnegie Mellon’s MHCI program, JD is frankly super stoked about how Big Data and design will affect our future.
Simon Wardley is a researcher for the Leading Edge Forum focused on the intersection of IT strategy and new technologies. Simon is a seasoned executive who has spent the last 15 years defining future IT strategies for companies in the FMCG, retail, and IT industries—from Canon’s early leadership in the cloud-computing space in 2005 to Ubuntu’s recent dominance as the #1 cloud operating system. As a geneticist with a love of mathematics and a fascination for economics, Simon has always found himself dealing with complex systems, whether in behavioral patterns, the environmental risks of chemical pollution, developing novel computer systems, or managing companies. He is a passionate advocate and researcher in the fields of open source, commoditization, innovation, organizational structure, and cybernetics.
Simon’s most recent published research, “Clash of the Titans: Can China Dethrone Silicon Valley?,” assesses the high-tech challenge from China and what this means to the future of global technology industry competition. His previous research covers topics including the nature of technological and business change over the next 20 years, value chain mapping, strategies for an increasingly open economy, Web 2.0, and a life-cycle approach to cloud computing. Simon is a regular presenter at conferences worldwide and has been voted one of the UK’s Top 50 Most Influential People in IT in Computer Weekly’s 2011 and 2012 polls.
Noel has over fifteen years experience in software architecture and development, and over a decade in machine learning and data mining. Examples of the projects he’s been involved with include one of the first commercial products to apply machine learning to the Internet (eventually acquired by Omniture), a BAFTA award winning website, and a custom CMS used daily by thousands of students.
Noel is an active writer, presenter, and open source contributor. Noel has a PhD in machine learning from the University of Birmingham.
Tom White is one of the foremost experts on Hadoop. He has been an Apache Hadoop committer since February 2007, and is a member of the Apache Software Foundation. His book Hadoop: The Definitive Guide (O’Reilly) is recognized as the leading reference on the subject. In 2011, Whirr, the project he founded to run Hadoop and other distributed systems in the cloud, became a top-level Apache project.
Tom is a software engineer at Cloudera, where he has worked since its foundation, on the core distributions from Cloudera and Apache. Previously he was an independent Hadoop consultant, working with companies to set up, use, and extend Hadoop. He has written numerous articles for O’Reilly, java.net and IBM’s developerWorks, and has spoken at several conferences, most recently at ApacheCon and OSCON in 2011. Tom has a Bachelor’s degree in Mathematics from the University of Cambridge and a Master’s in Philosophy of Science from the University of Leeds, UK.
Edd Wilder-James is a technology analyst, writer, and entrepreneur based in California. He’s helping transform businesses with data as VP of strategy for Silicon Valley Data Science. Formerly Edd Dumbill, Edd was the founding program chair for the O’Reilly Strata conferences and chaired the Open Source Convention for six years. He was also the founding editor of the peer-reviewed journal Big Data. A startup veteran, Edd was the founder and creator of the Expectnation conference-management system and a cofounder of the Pharmalicensing.com online intellectual-property exchange. An advocate and contributor to open source software, Edd has contributed to various projects such as Debian and GNOME and created the DOAP vocabulary for describing software projects. Edd has written four books including O’Reilly’s Learning Rails.
Chief Executive and Co-Founder of QuantumBlack, a Data Science agency that blends strategy, analytics and creative to help firms derive insight from data to make faster, smarter decisions.
Client projects include Boeing to improve yield in strategic R&D investment, McKinsey & Co in developing cutting-edge business analytics, Microsoft in creating new visual language for biological research and several Formula 1 racing teams building race strategy engines.
Prior to QuantumBlack led several data driven start-ups including SmithBayes, a spin-out from McLaren the Formula 1 racing team. Started career in real-time trading systems at Reuters and ABN AMRO.
Matt Wood is the Product Manager for Big Data and HPC at Amazon, where he discusses the technical and business aspects of data driven computing throughout Europe. With a background in the life sciences, Matt is interested in helping teams of all sizes bring their ideas to life through technology. Before joining Amazon he built web-scale search engines at Cornell University, sequenced DNA in Hinxton and developed analytics software in Cambridge. He is a frequent speaker at international conferences, a blogger, published author and an advocate of reproducible research.
Zena Wood is employed as a lecturer in Computer Science at the University of Exeter, she works in the field of Applied Ontology and spatiotemporal reasoning. Entitled “Detecting and Identifying Collective Phenomena within Movement Data,” her PhD focused on identifying what is meant by the term collective, the different types of collective that exist and developing a method that allowed the identification of such phenomena within large spatiotemporal datasets.
Zena has continued to work within these fields and is currently developing similar techniques that can be used to study human and animal behaviour. In addition to her roles as lecturer, Zena also coordinates the eskills’s endorsed undergraduate and postgraduate IT Management for Business degrees and Computer Science outreach at Exeter.
For information on exhibition and sponsorship opportunities, contact Susan Stewart at firstname.lastname@example.org or +1 (707) 827-7148
For information on trade opportunities contact Kathy Yu at mediapartners
For media-related inquiries, contact Maureen Jennings at email@example.com
View a complete list of Strata contacts.