- GraphLab: Large-Scale Machine Learning on Graphs Carlos Guestrin, Joseph Gonzalez
- Getting a Handle on Hadoop and its Potential to Catalyze a New Information Architecture Model Milan Vaclavik
Strata + Hadoop World is where big data's most influential decision makers, architects, developers, and analysts gather to shape the future of their businesses and technologies. Since joining forces last year, Strata + Hadoop World is also one of the largest gatherings of the Apache Hadoop community in the world, with emphasis on hands-on and business sessions on the Hadoop ecosystem. If you want to tap into the opportunities brought by big data, data science, and pervasive computing, you'll want to be there.
The future belongs to those who understand how to collect and use their data successfully. And that future happens at Strata.
“In a crowded market place of "Big Data" conferences, Strata has firmly established itself as the place where you go to meet people who think and do data science.”
“One of the most valuable events to advance my career.”
“Everyone in big data is there”
“A must for anyone who wants a peek into the future of Big Data."
“At first I was dizzied by the scope of the discussion. The technology gets very deep. The analytical techniques go quite far. You can be anyone—from a person like me who is just learning, to someone who has a PhD in machine learning. The difference in expertise is great, and people are coming from all kinds of different spaces with different research agendas. I found that both dizzying and fascinating. Strata helped me to develop a vocabulary for what I was getting into, and helped me to see where the horizons were and how to map out the space in front of me.”
“There were all the techie bits to delight the geek heart but apart from that there were plenty of presentations on topics and products that were just amazing and eye opening.”
“As an event it is almost perfect for me; a group of like-minded interested people come together and enjoy deep and meaningful conversations about the industry.”
“There was a lot to like...two days of inspirational, eye-opening, even slightly terrifying keynotes...Bring on Strata 2013!”
“There was a palpable sense of excitement in the air. Obviously most of the attendees were already ‘data’ aficionados, but it's clear that ‘data’ in various forms is on the radar for governments, large corporations, and the developer communities.”
“The O'Reilly Strata conferences are always great fun to attend, and this latest installment in New York City is no exception. This one is super-busy though; the conference has been sold out for weeks—and not just marketing-sold-out, it's fire-department-sold out. It's non-stop conversations and presentations, and it's tough to move through the hallways in between...a great conference.”
“There is no doubt that the conference is a great magnet for top big data talent and that it has served well the needs of its audience, hungry for inspiring content on the latest and greatest big data and data science developments.”
“...an amazing event with the latest and greatest of the big data community dropping knowledge and taking names”
Strata attendees have job titles including:
is a founding program chair of Strata, and chaired the O'Reilly Open Source Convention for six years. A technologist and writer, Edd is currently VP Strategy at Silicon Valley Data Science. Edd was the editor of XML.com for many years, and program chair for the XML Europe and XTech conferences.
is the founder of Bitcurrent, a research firm focused on emerging technologies. He's founded a variety of startups, and technology accelerators, including Year One Labs, CloudOps, Rednod, Coradiant (acquired by BMC in 2011) and Networkshop. He's a frequent speaker and writer on subjects such as entrepreneurship, cloud computing, Big Data, Internet performance and web technology, and has helped launch a number of major conferences on these topics.
Alistair has co-authored three books on Internet technology, including Web Operations (2010, O'Reilly), Complete Web Monitoring (2009, O'Reilly), and Managing Bandwidth: Deploying QOS in Enterprise Networks (1999, Prentice-Hall). A product manager by trade, he's also an advisor to a number of venture firms and the founder of the Bitnorth conference.
Renee DiResta is a Principal at O'Reilly AlphaTech Ventures (OATV), where she evaluates seed-stage investments. Prior to joining OATV in June of 2011, Renee spent seven as a trader at Jane Street Capital, a quantitative proprietary trading firm in New York City. She is interested in meeting interesting startups, data science, and improving liquidity and transparency in private markets.
Laurel Ruma is the Director of Talent for O'Reilly Media. Most recently she co-chaired Where 2.0, OSCON Java, and Gov 2.0 Expo. She joined the company in 2005 after working for five years at various IT analyst firms in the Boston area. Laurel is also the co-editor of Open Government, published by O'Reilly.
Anna Smith is a resident data scientist at bitly in New York while being in absentia from the University of Oregon physics doctorate program. Recently, she has published in both Forbes and Publications of the Astronomical Society of Australia. Her interests include manipulating data and catching up on the latest celebrity gossip.
Max Shron is a New York-based data strategist. He provides expertise and mentorship ranging from specification design and platform architecture to strategy execution, to organizations across a wide gamut of sizes and industry verticals. This work encompasses a complete data pipeline including definition, collection, analysis, visualization, and insight. Max previously was lead data scientist at New York-based OkCupid, and participated as the big-data side of its successful OkTrends blog. His work has appeared worldwide, in outlets including the New York Times, Chicago Tribune, Huffington Post and WNYC. Max holds a degree in Mathematics from the University of Chicago.
Noah Iliinsky strongly believes in the power of intentionally crafted communication. He has spent the last several years researching, writing, and speaking about best practices for designing visualizations, informed by his graduate work in user experience and interaction design. He is a frequent speaker in both industry and academic contexts. He has a master’s in Technical Communication from the University of Washington, and a bachelor's in Physics from Reed College. Noah works as a Visualization Expert at IBM's Center for Advanced Visualization.
Julie Steele is the Content Editor for Strata at O'Reilly Media. She is co-author of Beautiful Visualization and Designing Data Visualizations. She finds beauty in exploring complex systems, and thinks in metaphors. She is particularly drawn to the visual medium as a way to understand and transmit information.
Mona Vernon leads the Emerging Technologies group at Thomson Reuters. ]Mona is leading the charge on how to create an effective culture of Innovation at Thomson Reuters with a focus of external innovation and reaping the benefits of open innovation. She created and is running several successful initiatives that fall under the broad concept of “intrapreneurship” or applying “lean start-up” principles for the enterprise. She has also been instrumental in creating external innovation partnerships between Thomson Reuters on emerging and disruptive technologies. Mona holds a B.S. and M.S. in Mechanical Engineering from Tufts University and a S.M. from M.I.T. where her research focused on the role of customer experience in digital business strategy. Prior to joining Thomson Reuters, Mona worked in Technology start-ups in product development and management roles.
Mark Madsen designs and builds analysis and decision support systems, and building data management and access infrastructure. Research focus these days is on analysis techniques, emerging technology and practices in analytics, BI, information management, user experience for data access & delivery applications. I speak at a lot of conferences on anything data, with a bunch of history of science and technology mixed in.
Eric Baldeschwieler served as VP Hadoop Software Engineering for Yahoo!, where he led the evolution of Apache Hadoop from a 20 node prototype to a 42,000 node service that is behind every click at Yahoo!. Eric also served as a technology leader for Inktomi’s web service engine, which Yahoo! acquired in 2003. Prior to Inktomi, Eric developed software for video games, video post production systems and 3D modeling systems. Eric has a Master’s degree in Computer Science from the University of California, Berkeley and a Bachelor’s degree in Mathematics and Computer Science from Carnegie Mellon University.
Tom White has been an Apache Hadoop committer since February 2007, and is a member of the Apache Software Foundation. He is the author of "Hadoop: The Definitive Guide" for O'Reilly. Previously he worked as an independent consultant specializing in Hadoop, and before that was co-founder and Lead Developer at Kizoom, a UK mobile application startup. Tom has a Bachelor's degree in Mathematics from the University of Cambridge, and a Master's degree in History and Philosophy of Science from the Universities of Leeds, UK, and Florence, Italy.
Eric Sammer is currently a Principal Solution Architect at Cloudera where he helps customers plan, deploy, develop for, and use Hadoop and the related projects at scale. His background is in the development and operations of distributed, highly concurrent, data ingest and processing systems. He's been involved in the open source community and has contributed to a large number of projects over the last decade.
Martin Hall is co-founder, Chairman & Executive Vice President of Corporate Development at Karmasphere. He brings a strong entrepreneurial track record and a history of pioneering new Internet technologies and markets. Prior to founding Karmasphere, Martin was a founder of Aventail, a leading computer security company acquired by SonicWall. Prior to that, he was the founding CEO of Stardust, an Internet technology services company sold to Penton Media. Martin has chaired and participated in a number of industry groups including WinSock, Quality of Service, Internet Multicast and Wireless Multimedia Forums. He holds a Masters of Computer Science from Staffordshire University in Stafford, England.
Andrew Musselman is Chief Data Scientist in the global big data practice at Accenture. His background is in math, front- and back-end web, recommenders, and other large-scale modeling and prediction systems. In addition to building systems for clients, Andrew builds internal tools for performing data science and engineering quicker and more rigorously, and does recruitment and training in a growing team. He is a big fan of Hadoop, Pig, and Mahout, and actively promotes new tools within the firm and with clients.
Justin Borgman is Co-Founder and CEO of Hadapt. Prior to Hadapt, Justin led product development for COVECTRA, an anti-counterfeit technology firm. Before that, Justin founded an online social media company and spent the first six years of his career as a software developer at MIT Lincoln Laboratory and Raytheon.
Ted Dunning has been involved with a number of startups with the latest being MapR Technologies where he is Chief Application Architect working on advanced Hadoop-related technologies. He is also a PMC member for the Apache Zookeeper and Mahout projects. Opinionated about software and data-mining and passionate about open source, he is an active participant of Hadoop and related communities and loves helping projects get going with new technologies.
Reynold Xin is an Apache Spark committer and the lead developer for Shark and GraphX, two computation frameworks built on top of Spark. He is also a co-founder of Databricks. Before Databricks, he was pursuing a PhD focusing on large scale data systems in the UC Berkeley AMPLab.
Lynn Cherny has a Ph.D. from Stanford and an M.Phil. from Cambridge University. She has 19 years of experience managing and doing interface design and data analysis at companies including TiVo, Excite, Adobe, The MathWorks, and AT&T Labs. Her consulting work now focuses on data analysis and visualization.
Ben Lorica is the Senior Analyst in the Market Research Group at O'Reilly Media, Inc.. He has applied Business Intelligence, Data Mining and Statistical Analysis in a variety of settings including Direct Marketing, Consumer and Market Research, Targeted Advertising, Text Mining, and Financial Engineering. His background includes stints with an investment management company, internet startups, and financial services. At O'Reilly, Ben works on custom research and consulting projects, open source data warehousing and analytics.
For exhibition and sponsorship opportunities, contact Susan Stewart at email@example.com
For information on trade opportunities with O'Reilly conferences email mediapartners
For media-related inquiries, contact Maureen Jennings at firstname.lastname@example.org
View a complete list of Strata + Hadoop World 2013 contacts