New speakers are added continuously. Please check back to see the latest updates to the Strata Conference + Hadoop World program.
Jim Adler is the Chief Privacy Officer and General Manager of Data Systems at Intelius. As part of the executive leadership team, Jim is the chief consumer advocate for the company as well as responsible for the data systems that power Intelius’ products. Jim is a national leader on privacy, cryptography, and security issues having testified before the Federal Trade Commission, U.S. Congress, National Institute of Standards and Technology and leading academic institutions. Jim also serves on the advisory boards of the Future of Privacy Forum and Helios Voting. Jim blogs and tweets about privacy and data at jimadler.me and @jim_adler.
Prior to Intelius, Jim served as president and chief technology officer at Identity.net, an Internet company giving consumers control of... Read More.
Joseph Adler has many years of experience in data mining and data analysis at companies including DoubleClick, American Express, and VeriSign. He graduated from MIT with an B.Sc. and M.Eng in Computer Science and Electrical Engineering. He is the inventor of several patents for computer security and cryptography, and the author of “Baseball Hacks” and “R in a Nutshell”. Currently, he is a senior data scientist at LinkedIn.
Kiyan joined WibiData in 2011. He holds a BS in Computer Science from Penn State and a MS in Computer Science from Cornell. His graduate research focused on applying large scale data mining and machine learning to the areas of optimization and multi-agent systems. Kiyan enjoys writing, baking, comic books and video gaming when he gets the chance.
Jonathan Alexander is VP Engineering at Vocalocity, a leader in hosted VoIP for small businesses. Previously he built and managed software teams at vmSight which was acquired by Liquidware Labs, and Radnet which was acquired by Epiphany. He studied computer science at UCLA, and began his career writing software for author Michael Crichton. He is the author of Codermetrics: Analytics for Improving Software Teams, 2011 from O’Reilly.
Alasdair Allan is a scientist, author, hacker, tinkerer, and journalist who has recently been spending a lot of time thinking about the Internet of Things, which he thinks is broken. He is the author of a number of books and sometimes also stands in front of cameras. You can often find him at conferences talking about interesting things or deploying sensors to measure them. A couple of years ago, he rolled out a mesh network of five hundred sensor motes covering the entirety of Moscone West during Google I/O. He’s still recovering. A few years before that, he caused a privacy scandal by uncovering that your iPhone was recording your location all the time, which caused several class-action lawsuits... Read More.
Jesper develops experimental online services designed to introduce emotional contexts into online relationships, creating more authentic experiences. He is the co-founder of Bloom Studios, developing novel data interface applications for web and tablet platforms. He is also an accomplished data scientist, working on problems including home valuations for Trulia, credit card fraud for Visa, and social network analysis for Visible Path. Jesper speaks frequently at international technology and design conferences and has appeared in print and broadcast media for projects like Avoidr, Freerisk, and his Foursquare privacy hack. He holds a B.Sc. in Physics from Haverford College and an M.B.A. in Econometrics from University of Chicago.
I am a Curriculum Developer and Instructor with Cloudera. I run our local developers group. I like to find creative ways of using technology to solve practical problems.
Dr. Amr Awadallah
Amr is Co-Founder and CTO of Cloudera. Prior to Cloudera Amr was an Entrepreneur in Residence at Accel Partners. Before that he served as Vice President of Engineering at Yahoo!, and led a team that used Apache Hadoop extensively for data analysis and business intelligence across the Yahoo! online services. Amr joined Yahoo! after they acquired his first startup, VivaSmart, in mid-2000. Amr holds a Bachelor’s and Master’s degrees in Electrical Engineering from Cairo University, Egypt, and a Doctorate in Electrical Engineering from Stanford University.
Roger Barga is general manager and director of development at Amazon Web Services, where he is responsible for Kinesis data streaming services. Before joining Amazon, Roger was in the Cloud Machine Learning group at Microsoft, where he was responsible for product management of the Azure Machine Learning service. Roger is also an affiliate professor at the University of Washington, where he is a lecturer in the Data Science and Machine Learning programs. Roger holds a PhD in computer science, has been granted over 30 patents, has published over 100 peer-reviewed technical papers and book chapters, and has authored a book on predictive analytics.
Dr. David W. Bauer Jr. has over 20 years of experience in the field of distributed computing, specializing in HPC, large-scale data analysis and experiment design. Dr. Bauer is a pioneer in the field of HPC, BigData and Cloud Computing; he developed a cloud computing platform for the Defense Information Systems Agency (DISA) in 2006, and currently is the Chief Architect and Technical Director for the DCGS-Army Standard Cloud. Dr. Bauer is only one of a handful of people in the world who has built applications harnessing the computing power of hundreds of thousands of processing elements, and regularly works with peta-scale data sets.
Stephanie Beben is an analytics engineer and developer at Booz Allen Hamilton with two years experience designing and implementing solutions to big data problems using cloud technologies for U.S. government clients.
Prior to joining Booz Allen Hamilton, Stephanie received a M.S. in Mathematics from Texas A&M University.
Collin Bennett is a principal at Open Data
Group. In three and a half years with the company, Collin has worked on the open source Augustus scoring engine and a cloud-based environment for rapid analytic prototyping called RAP. Additionally, he has released open source projects for the Open Cloud Consortium. One of these, MalGen, has been used to benchmark several parallel computation frameworks. Previously, he led software development for the Product Development Team at Acquity Group, an IT consulting firm head-quartered in Chicago. He also worked at startups Orbitz (when it was still was one) and Business Logic Corporation. He has co-authored papers on Weyl tensors, large data clouds, and high performance wide area cloud testbeds. He holds degrees in English,... Read More.
As the Director of Data Visualization at GfK Custom Research (a global market research firm), Bitsy designs data visualization applications to tell compelling stories about research data. In addition to developing new methodologies and templates for current GfK design processes, she also educates colleagues on current and emergent visualization tools, techniques and best practices.
Prior to joining GfK she worked as a freelance consultant, designing and hand coding interactive data displays for technology companies as well as traditional market research firms.
Bitsy has six years of experience in the market research industry, and holds a B.F.A. in Industrial Design from the University of Wisconsin Stout.
Val joined NetApp in 1998 and leads the Strategic Planning Team within the office of the CTO. Working with customers, analysts, and alliance partners, Val focuses on next-generation research projects and is responsible for NetApp’s product vision.
As an early Big Data adopter, Val chairs the Enterprise Data Architecture Track of the Hadoop Summit. Val also introduced the first Cloud Standard to the industry as chairman of SNIA’s Cloud Storage Initiative, whose mission is to foster the growth and success of the cloud storage market. Previously, Val served as the vice-chair of SNIA’s Solid-State Storage Initiative.
Val has over 25 years of IT industry experience spanning NCR (AT&T) and Cognos (IBM), with 12 years in storage at EMC and... Read More.
Software Engineer at Cloudera, currently focused on the Apache HBase project.
Kevin Beyer is the Principal Architect at Platfora with 20 years of experience in building database systems. As a Research Staff Member at IBM, he created Jaql, a scripting language for large-scale, semi-structured data processing on Hadoop. Prior to the Jaql project, he added XML indexing support to IBM DB2. His Ph.D. dissertation at the University of Wisconsin focused on analytical query processing.
Before the Julia effort began, Jeff Bezanson worked as a software engineer at Interactive Supercomputing, which developed the Star-P parallel extension to MATLAB. At the company, Jeff was a principal developer of M#, an implementation of the MATLAB language running on .NET. He is now a third-year graduate student at MIT. Jeff received an A.B. in Computer Science from Harvard University in 2004, and has experience with applications of technical computing in medical imaging.
As Director of Program Management of the Database Systems Group at Microsoft, Shawn Bice has responsibility for Azure SQL Database, Big Data & SQL Server RDBMS, including guiding the vision, strategy, engineering and market delivery.
As part of his charter, Shawn is focused on delivering Microsoft’s core data platform solutions and services that allow IT Professionals, DBAs, and Developers to successfully Develop, Deploy, and Manage data applications across the SQL Azure platform and SQL Server workload.
Since joining Microsoft in 1997, Shawn has served in engineering leadership positions across the Server and Tools Business (STB) including Windows Server, System Center, SQL Server, Azure SQL Database and spent a tour in field sales, affording him the opportunity to... Read More.
Bill Schmarzo, author of the upcoming Big Data: Understanding How Data Powers Big Business, to be published by Wiley, is responsible for setting the strategy and defining the service line offerings and capabilities for the EMC Consulting Enterprise Information Management and Analytics service line. He’s written several white papers and is a frequent speaker on the use of big data and advanced analytics to power an organization’s key business initiatives.
Bill has more than two decades of experience in data warehousing, BI, and analytics applications. Bill authored the Business Benefits Analysis methodology that links an organization’s strategic business initiatives with their supporting data and analytic requirements, and co-authored with Ralph Kimball a series of articles on analytic applications. Bill has served on The Data... Read More.
David Blair is a Principal Software Engineer at Akamai where he works on the Akamai Data Platform. He has been working with scalable data processing applications for over 10 years both at Akamai and in his previous roles of Director of Product Architecture at MetraTech, Inc. and Director of Engineering at Torrent Systems. He has a PhD in Mathematics from Brandeis University and B.S. in Mathematics from University of California, Berkeley.
Ron Bodkin is the founder and CEO of Think Big Analytics, the first and leading provider of independent consulting and integration services specifically focused on big data solutions. Previously, Ron was vice president of engineering at Quantcast, where he led the data science and engineer teams that pioneered the use of Hadoop and NoSQL for batch and real-time decision making. Prior to that, Ron founded New Aspects, which provided enterprise consulting for aspect-oriented programming. Ron was also cofounder and CTO of B2B applications provider C-Bridge, where he headed a team of 900 people and led the company to a successful IPO. Ron graduated with honors from McGill University with a BS in math and computer science and holds a master’s degree in... Read More.
David is passionate about helping people make better decisions using data. He is SVP of insight for the EMI Music Group where he setup and now oversees the team that delivers regular, large scale research to EMI in 25 countries. The insight from more than 750,000 interviews in the last couple of years has reached the heart of decision-making at EMI and helps—alongside skills and judgement—with everything from signing new artists to product development and marketing plans for EMI’s biggest artists like The Beatles and Pink Floyd. He also runs insight at zeebox which is using rich consumer surveys and detailed user information to drive product development and product / relationship personalisation.
He has worked in London for New Philanthropy Capital... Read More.
I’ve been designing and building innovative information visualizations for 20 years, ranging from one of the first interactive 3D financial visualizations on the web in 1995 to visualizations embedded in financial data systems used every day by thousands of market professionals.
Alice Brennan is an award winning radio and television journalist with ten years experience at Australia’s public broadcaster, the ABC. She has hosted, produced and researched her own radio shows and news items on issues from indigenous housing to politics to drug taking. Alice has trained journalists from all over Asia and she’s freelanced for NPR, Deutsche Welle and the BBC. A graduate MA in political journalism at Columbia University, Alice has been working with The New York World, a digital government accountability website based out of the school. She’s enamored with the journalistic power of data and is scrambling madly to learn how to code!
Kurt Brown leads the Data Platform team at Netflix. Kurt’s group architects and manages the technical infrastructure underpinning the company’s analytics, which includes various big data technologies like Hadoop, Spark, and Presto, Netflix open sourced applications and services such as Genie and Lipstick, and traditional BI tools including Tableau and Redshift.
Ryan is a Distinguished Engineer with Cerner Corporation, one of the leading healthcare IT companies worldwide. He has built infrastructure for healthcare systems over the past decade, and currently is leading the design of Cerner’s big data infrastructure. Ryan also dabbles in writing, such as contributing a couple chapters to the book 97 Things Every Programmer Should Know.
Avi has led product, engineering, and data science teams at Etsy, Twitter and Dabble DB (which he co-founded and Twitter acquired). He’s known for his open source work on projects such as Seaside, Scalding, and Algebird. Avi currently works at Stripe.
Jim Caputo is the technical lead for Google’s BigQuery team, and heads the engineering efforts to externalize Google’s large scale data processing for developers and enterprise customers. Prior to his tenure at Google, Jim worked on product teams at Expedia and Microsoft.
Micheline Casey is the Chief Data Officer at the Federal Reserve Board. She is the first person to hold this newly created title at the Federal Reserve Board. In this role, she oversees the Office of the Chief Data Officer and chairs the Board Data Council, the Board’s data governance oversight group. She is responsible for the creation and execution of data strategy and data management in support of Board mission and operations, and in alignment with its strategic framework.
Prior to joining the Board of Governors of the Federal Reserve System she was principal at CDO, LLC, a boutique consultancy supporting the development of large-scale enterprise information management, data governance, and data security strategic plans and implementation efforts.
Prior to CDO,... Read More.
Scott Chastain, Engineering Manager, Information Management and Delivery, SAS
Scott empowers the SAS Americas sales and technical groups with the architecture, strategy and implementation of SAS’ business analytics infrastructure. He has direct responsibilities for information management, visualization and business intelligence.
Nagaraju Chayapathi is a Data Integration Architect at Paypal. He has 18+ years’ data management experience, creating and implementing data architectures, leading Integration Competency Centers (ICCs), and developing strategies for data integration, data governance, data profiling, data masking, metadata management, and other data related initiatives. At Paypal he architected a unified, scalable data acquisition and delivery platform feeding an enterprise data warehouse and a wide array of data marts in both real-time and batch. He also spearheaded a transition from several legacy technologies to Informatica for a single data integration and ETL platform. Prior to working at Paypal, Nagaraju held various BI and data warehouse manager and architect positions at industry leading companies including CSC, Cisco Systems, Perot Systems, Wells Fargo, and AT&T.... Read More.
Lynn Cherny has a Ph.D. from Stanford and an M.Phil. from Cambridge University. She has 19 years of experience managing and doing interface design and data analysis at companies including TiVo, Excite, Adobe, The MathWorks, and AT&T Labs. Her consulting work now focuses on data analysis and visualization.
Stephen Coller is a Senior Program officer in the Next Generation Models team. In this role Stephen sponsors current and emerging technologies and platforms that help children to attain college readiness and enable system transformation in the process.
Stephen joined the Foundation in July 2010 from Microsoft, where he worked as a Senior Director of Technology Policy and Strategy. In this role Stephen helped develop Microsoft’s vision for the connection between technology and core societal issues and convert that vision into a set of policy recommendations for senior government leaders and regulatory bodies around the world. In the U.S., Stephen served on Secretary Duncan’s National Education Technology Plan working group. Earlier in his career at Microsoft, he worked on virtual server, cloud platform, and application... Read More.
Rob Coneybeer is a veteran venture capitalist who approaches start-up investing from a product perspective. At Shasta Ventures, the Sand Hill Road firm he co-founded in 2004, Rob focuses on hardware and mobile startups. Rob is particularly interested in startups that have discovered creative new approaches to connecting the “real world” to the Internet, whether via smart phones, low-cost sensors or other innovative, new devices.
Rob started his career working in the Astro Space division of Martin Marietta, where he helped build the first EchoStar spacecraft. Some of Rob’s notable investments include smart-thermostat company Nest; Relay Rides, the leading peer-to-peer carsharing marketplace; and Mocana, which provides security for smart devices. Rob earned a master of science in mechanical engineering from the Georgia Institute of Technology... Read More.
Deborah M. Cooper possesses over 15 years of industry experience in successfully applying market strategy and customer analytics within financial services, insurance, and life sciences. At Fidelity Investments, she leveraged a customer-focused data architecture strategy to support product growth, advanced analytics to increase retirement season sales by 40%, and KPI development to align the highly-matrixed Retail organization to acquisition goals. With Liberty Mutual Group, she developed a competitive intelligence function and market-sizing strategy for a national and middle market client base.
Most recently, Ms. Cooper served as Vice President of Marketing Applications & Analytics for Putnam Investments. Prior to her work with financial services, she authored and presented research findings in molecular genetics and medical demand modeling. Ms. Cooper holds a Masters degree in... Read More.
Terence Craig is CEO and CTO of PatternBuilders, a big data analytics companies that produces advanced applications for financial services, retail and other data intensive industries.
Terence has an extensive background in building, implementing, and selling analytically-driven enterprise applications across such diverse domains as enterprise resource planning (ERP), retail sales channel optimization, professional services automation (PSA), and semi-conductor process control and analytics in both public and private companies. He has been part of the ERP/SCM industry as it has evolved, from the VAX and HP 3000 to its current heyday of client-server, GUIs, and relational databases and is looking forward to exploring what the next generation of solutions, powered by the Internet of Things and big... Read More.
Alistair Croll is an entrepreneur with a background in web performance, analytics, cloud computing, and business strategy. In 2001, he cofounded Coradiant (acquired by BMC in 2011) and has since helped launch Rednod, CloudOps, Bitcurrent, Year One Labs, and several other early-stage companies. He works with startups on business acceleration and advises a number of larger companies on innovation and technology. A sought-after public speaker on data-driven innovation and the impact of technology on society, Alistair has founded and run a variety of conferences, including Cloud Connect, Bitnorth, and the International Startup Festival, and is the chair of O’Reilly’s Strata + Hadoop World conference. He has written several books on technology and business, including the best-selling Lean Analytics. Alistair tries to mitigate his chronic ADD... Read More.
Doug Cutting is the chief architect at Cloudera and the founder of numerous successful open source projects, including Lucene, Nutch, Avro, and Hadoop. Doug joined Cloudera from Yahoo, where he was a key member of the team that built and deployed a production Hadoop storage-and-analysis cluster for mission-critical business analytics. Doug holds a bachelor’s degree from Stanford University and sits on the board of the Apache Software Foundation.
Raanan is responsible for Big Data Product Marketing at Splunk. He is focused on helping customers harness their big machine data to gain operational intelligence and driving an effective market-driven product strategy. Raanan has spent the past 10 years focused on data, information and distributed systems. Prior to Splunk, Raanan worked at Cloudera where he was a Hadoop sales engineer and before that, spent 9 years as an engineer and sales engineer at Oracle. Raanan is a certified Hadoop developer and administrator and holds a BS in Computer Science and MS in Management.
Jianyong Dai is a Apache Pig PMC member/committer and worked on Pig for almost 3 years at Yahoo and later at Hortonworks. I received my PhD in computer science specialize in computer security, data mining and distributed computing from University of Central Florida. I am interested in data science, large scale processing, Hadoop, Pig, HCatalog, Hive, and more.
Richard co-founded Pentaho in 2004 and is responsible for strategic initiatives, customer and partner relationships and leading product strategies including Big Data, customer adoption and Cloud Analytics. Richard has held key executive management positions in the business intelligence software market for over 20 years, starting his career at IBM. Richard was a Co-Founder at AppSource Corporation (acquired by Arbor Software which later merged into Hyperion Solutions) and Keyola (acquired by Lawson Software) and is an avid water skier.
Dan is the Managing Partner for Corporate Strategy, Innovation and Technology at Accenture and an Executive on Grounds at the McIntire School of Commerce of the University of Virginia.
Most recently, he has focused on the impact of cloud technologies on the firm’s businesses. Previously, he was the Managing Partner for the Communications Industry in Accenture’s Communications & High Tech Global Operating Group. In this role, he helped drive Accenture’s business in the wireline, wireless, and cable segments.
Mr. Elron is a member of the IEEE and ACM. He also serves as the adviser for the information technology industries for the World Economic Forum and as the Associate Director of the Center for the Management of Information Technology at the University of Virginia.... Read More.
Danielle is currently a Data Scientist at Nokia, and a doctoral student in Quantitative Psychology at the University of North Carolina at Chapel Hill. Danielle enjoys working to advance and apply statistical methods to address complex business questions, especially using random effects models and mixture models. Danielle is specifically interested in studying issues that arise when modeling developmental phenomenon over time and in the integration of different modeling frameworks. At Nokia, she teams with other data scientists in Nokia’s Insights group to deliver descriptive and predictive insights into Nokia’s new centralized data asset and into business analytic projects and to deliver visualizations of the data and their predictions.
Marc Demarest is CEO and a principal in Noumenal, Inc., an international management consulting firm based in the Pacific Northwest and the UK that provides a range of management and technical consulting services to high-tech, biotech, nanotech, and greentech firms. Widely known as an early proponent of data marting and tiered enterprise data warehousing models, Demarest is currently writing a book on nontechnical aspects of enterprise data warehousing.
Brian is a developer that runs marketing at GitHub, which often involves analyzing a lot of interesting data.
Michael is a serial entrepreneur who has built data platforms in life sciences, online retail, and digital media over the past decade. Michael began his career as a software engineer for Human Genome Project, and later founded CustomInk.com, an early pioneer in customized apparel. More recently, Michael conceived and led Dataspora, where he remains Chairman, helping deliver data science to telcos, insurance firms, and retail banks. Michael holds an A.B. from Harvard College and a Ph.D. in Bioinformatics from Boston University. He enjoys speaking and writing about big data and analytics.
Founder, CTO and SVP, Software Engineering
Kenneth Duda is a pioneer in high-performance networking software and lead architect of Arista Networks EOS, a stateful modular operating system for all Arista Networks products. He is also the co-author of network virtualization specifications including VXLAN with VMware and NVGRE with Microsoft. From 2005 to 2008, Ken was also the Acting President of Arista Networks.
Prior to joining Arista Networks, Ken was the CTO at There.com, where he played a lead role in designing a real-time 3-D distributed system that scaled to thousands of simultaneous users. Ken was also the first employee of Granite Systems and led the software development effort for the Catalyst 4000 product line after the acquisition by... Read More.
Kathleen P. Duff is an attorney licensed to practice in New York State. She holds an A.B. in History from Dartmouth College (‘02), cum laude, as well as a J.D. from Vanderbilt University Law School (’06). During law school, Ms. Duff was awarded the Vanderbilt Scholastic Excellence Award three separate times for earning the highest marks in each of Corporations & Business Entities, Professional Responsibility, and Legal Writing I. Ms. Duff began her career as an associate in the New York office of Skadden, Arps, Slate, Meagher & Flom LLP, where her practice focused on antitrust and commercial litigations. She also served as an associate at Richards Kibbe & Orbe LLP, where she worked on complex civil litigations. Ms. Duff joins the panel... Read More.
Ted Dunning has been involved with a number of startups—the latest is MapR Technologies, where he is chief application architect working on advanced Hadoop-related technologies. Ted is also a PMC member for the Apache Zookeeper and Mahout projects and contributed to the Mahout clustering, classification, and matrix decomposition algorithms. He was the chief architect behind the MusicMatch (now Yahoo Music) and Veoh recommendation systems and built fraud-detection systems for ID Analytics. Opinionated about software and data-mining and passionate about open source, he is an active participant of Hadoop and related communities and loves helping projects get going with new technologies.
An Apache Cassandra committer, Gary Dusbabek is a life-long programmer specializing in distributed systems. His past experience includes working with large-scale text and image indexes in the newspaper industry and high-volume advertisement booking software. He currently works on the Cloud Monitoring team at Rackspace.
Gabriel is a computer science student and works as an Software Architect in Mercadolibre (NASDAQ MELI) since 2007.
He is tasked with researching technologies and developing projects.
In the last year he has specialized in big data analysis, implementing Mercadolibre’s hadoop cluster.
Justin Erickson is a senior director of product management leading Cloudera’s platform team, which is responsible for the components in Cloudera Distribution including Hadoop (CDH) above storage. Prior to joining Cloudera, he led the high-availability and disaster-recovery areas of Microsoft SQL Server.
Tim Estes is the chairman, CEO, and founder of Digital Reasoning. Tim’s academic work at the University of Virginia focused in the areas of philosophy of language, mathematical logic, semiotics, epistemology, and phenomenology. It was that eclectic academic background, coupled with the belief that in the future all software would learn from data as a core capability, that gave rise to Digital Reasoning. Tim and his team work closely with leaders in government and industry to solve extraordinarily valuable and morally compelling problems in National Security, Finance, Legal, and Health Care by automating the understanding of unstructured data.
Robert Faludi is the Chief Innovator at Digi International. His job is forging strong connections with the maker community, uncovering new innovation methodologies, supporting outstanding new work, and creating prototypes to spur new product development. Faludi also works as a professor in the MFA program at the School of Visual Arts in Manhattan and in the Interactive Telecommunications graduate program at NYU. He specializes in behavioral interactions through physical computing and networked objects. Rob is the author of Building Wireless Sensor Networks, with ZigBee, XBee, Arduino and Processing published by O’Reilly Media, 2011. His work has appeared in The New York Times, Wired Magazine, Good Morning America, BBC World, the Chicago Museum of Science & Industry and MoMA among others.... Read More.
Lee Feinberg is the founder of DecisionViz, a company designed to helped organizations escape the legacy of reporting data by transforming complex data into simple pictures for making decisions. Industry leaders frequently ask Lee to address domestic and international audiences, most recently including O’Reilly Media Strata, Tableau Software European and North American Customer Conferences, CBS Interactive, and WARC. Lee is also the author of the BeyeNETWORK expert channel, HD Visualizations, Driving BI and Analytics. Lee most recently worked at Nokia where he introduced Tableau Software into the company and led the effort to launch Tableau across the enterprise. He served on Tableau’s customer advisory board and is now a Tableau Partner, as well as the founder of the NJ and NY Tableau User... Read More.
Michael P. Flowers is Analytics Director for the Mayor’s Office of Policy and Strategic Planning and Director of the Financial Crime Task Force of the City of New York. Prior to joining the Bloomberg Administration, Mr. Flowers was Counsel to the U.S. Senate Permanent Subcommittee on Investigations for the 110th and 111th Congress, where he led bipartisan investigations into off-shore tax haven abuses; failures in the mortgage-backed securitization market by U.S. investment and commercial banks and government agencies; and deceptive financial transactions by the North Korean government. From March 2005 to December 2006, Mr. Flowers was Deputy Director of DOJ’s Regime Crimes Liaison’s Office in Baghdad, Iraq, supporting the investigations and trials of Saddam Hussein and other high-ranking members of his regime. From 2003 to... Read More.
Kevin Foster is a Big Data Solution Architect at IBM. His focus is in helping IBM customers and partners to survive and prosper from the large amounts of “big data” generated by mobile phones, smart grids, city traffic, hospitals and other increasingly computerized aspects of our modern lives. Kevin has a Bachelors degree in Mathematics from California State University Stanislaus and a Masters degree in Computer Science from Stanford University.
Currently the Director of Infrastructure Engineering and Lead Architect at Rent the Runway. Previously a Vice President at Goldman Sachs. Apache ZooKeeper committer and PMC member.
Steve Francia leads the developer experience of MongoDB at 10gen. This includes drivers, web, technical writing, integration and evangelism. Steve brings to this role his experience as VP of engineering at OpenSky where he build the worlds first e-commerce site powered by MongoDB and one of the first PHP sites backed by MongoDB. Steve has been an engineer, entrepreneur and executive since 1995 when he was responsible for one of the first ecommerce sites while working for American Telecom.
Steve loves building things, teaching and speaking. He has been involved in open source for over 15 years and prefers to code in Vim. He is the author and contributing author of two O’Reilly books on databases (and development). He has spoken at dozens of... Read More.
Hjalmar is a serial entrepreneur, founder of four startups in the gaming, mobile and web sectors since 1996. His company, DataMarket, provides information companies with tools to effectively publish their data and reach new audiences. Their data portal, DataMarket.com, may be the largest collection of open statistics and numerical data available online. DataMarket is based largely on Hjalmar’s vision of the need for a global exchange for such data.
Michael, co-founder of farsite, directs farsite’s overall strategy and manages business development.
Michael has a bachelor’s degree in classics from the University of Michigan and a master’s degree in history from the University of Minnesota. Michael also has an MBA from Fisher College of Business at The Ohio State University. He has worked as a White House Intern in the Office of Public Liaison Business Affairs Desk, and also for SmartBrief, a Washington-DC based start-up that publishes daily B2B news briefs.
Anthony Goldbloom is co-founder and CEO of Kaggle. In 2011 & 2012, Forbes Magazine named Anthony as one of the 30 under 30 in technology, in 2013 the MIT Tech Review named him one of top 35 innovators under the age of 35 and the University of Melbourne awarded him an Alumni of Distinction Award. He holds a first call honors degree in Econometrics from the University of Melbourne. Anthony has published in the The Economist and the Harvard Business Review.
As Chicago’s first Chief Data Officer, Brett Goldstein’s role is to lead the city’s approach to and use of data to help improve the way the government works for its residents. Thus, the responsibilities of the Chief Data Officer are threefold: promoting transparency through open data, constructing a comprehensive strategy for citywide data usage and storage, and facilitating data-driven decision-making through data analytics, including groundbreaking predictive analytics.
Before coming to City Hall as Chief Data Officer, Brett Goldstein founded and directed the Chicago Police Department’s Predictive Analytics Group, which aims to predict when and where crime will happen. Goldstein is a former Commander in the Chicago Police Department. Previously, Goldstein was an early employee with OpenTable. He earned his Bachelor’s degree from Connecticut College, his... Read More.
Daniel L. Goroff is vice president and program director at the Alfred P. Sloan Foundation, a grant-making philanthropy that supports breakthroughs in science, technology, and economics. He is professor emeritus of mathematics and economics at Claremont’s Harvey Mudd College, where he previously served as vice president for academic affairs and dean of the faculty.
Goroff earned his B.A.-M.A. degree in mathematics Summa Cum Laude at Harvard as a Borden Scholar, an M.Phil. in economics at Cambridge University as a Churchill Scholar, a masters in mathematical finance at Boston University, and a Ph.D. in mathematics at Princeton University as a Danforth Fellow. He also completed the Executive Program for Nonprofit Leaders at Stanford’s Business School.
Daniel Goroff’s first faculty appointment was at Harvard University in 1983.... Read More.
Sean Gourley, Quid co-founder and CTO, did research into the
mathematics of war for his PhD thesis at Balliol College, Oxford. His findings appeared as the featured article in “Nature” (December 2009) and were the subject of a popular TED talk (2009). His work on statistical analysis, probability, and algorithm development applied to complex systems and large datasets inspired the creation of Quid. Sean is a Rhodes Scholar PhD in Physics (Complexity) from the University of Oxford; his is undergraduate degree in Physics is from the University of Canterbury, Christchurch, New Zealand.
Ilya Grigorik is a web peformance engineer and advocate at Google, an open-source evangelist, an analytics geek, and a proverbial early adopter of all things digital. Prior to focusing on web performance Ilya was the founder and CTO of PostRank, a social analytics company which became the core of social analytics within Google Analytics.
Mr. Groom is responsible for the day-to-day operations of Kognitio including sales, presales, Kognitio Cloud operations and the Kognitio Analytics Center of Excellence. Paul has thirty years experience working with and implementing very large databases (VLDBs); with in-depth knowledge of database and data warehouse technologies and the surrounding software ecosystem. This practical, customer facing knowledge, is applied to the strategic direction of Kognitio, keeping a lead in the ever changing world of IT and the exciting new wave of Big Data.
Paul began his career on a database research project at the UK national mapping agency, Ordnance Survey, followed by five years as a Lead Systems Engineer providing pre- and post-sales technical support in the UK and Europe for the then-leading vendors of database machines:... Read More.
Robert Grossman (@bobgrossman) is the Founder and a Partner of Open Data Group, which specializes in building predictive models over big data. He is a Core Faculty and Senior Fellow at the Institute for Genomics and Systems Biology (IGSB) and the Computation Institute (CI) at the University of Chicago. He has led the development of new open source software tools for analyzing big data, cloud computing, data mining, distributed computing and high performance networking. Prior to starting Open Data Group, he founded Magnify, Inc. in 1996, which provides data mining solutions to the insurance industry. Grossman was Magnify’s CEO until 2001 and its Chairman until it was sold to ChoicePoint in 2005. He blogs about big data, data science, and... Read More.
Kristian Hammond is CTO of Narrative Science, a Chicago start up focused on the generation of narratives from data. He is also a professor of Computer Science and Journalism at Northwestern University and a researcher in the areas of human-machine interaction, context-driven information systems and artificial intelligence. In 1998, Kris founded Northwestern University’s Intelligent Information Laboratory (InfoLab) where his team is creating technology that bridges the gap between people and the information they need. From 2000 to 2001, Dr. Hammond also enjoyed a run as the weekly technology correspondent for WTTW’s Chicago Tomorrow. Kris received his PhD from Yale University.
Roy Hyunjin Han is a Computational Engineer at CrossCompute, working on mathematical models to help cities run effectively. Previously, he worked with the Modi Research Group at the Earth Institute of Columbia University to develop an electricity infrastructure planning system currently used by countries in Africa and a remote sensing system that finds buildings in satellite images using convolutional neural networks. He holds a M.S. in machine learning and a B.S. in mathematics and neuroscience.
Joseph M. Hellerstein is the Jim Gray Chair of Computer Science at UC Berkeley and cofounder and CSO at Trifacta. Joe’s work focuses on data-centric systems and the way they drive computing. He is an ACM fellow, an Alfred P. Sloan fellow, and the recipient of three ACM-SIGMOD Test of Time awards for his research. He has been listed by Fortune among the 50 smartest people in technology, and MIT Technology Review included his work on their TR10 list of the 10 technologies most likely to change our world.
Bjoern is the founder of the Startup Genome and Startup Compass. In early 2011 he & his team set out to decrease the massive failure rate of startups and SMEs to accelerate the pace of innovation by transforming the business world from gut driven to data driven. Before Bjoern founded and led 4 for and non profit ventures in Germany, Bangladesh & the US. He also worked as an executive in Russia for one year for a mid sized marketing company. His personal passion or “leitmotif” is: unleashing human potential. His higher purpose is to strive to maximize the capitalization of every single individual. In his free time he speaks at conferences to pass on his lessons learned.
Rich Hickey, the author of the Clojure programming language and designer of the Datomic database system, is a software developer with over 20 years of experience in various domains. Rich has worked on scheduling systems, broadcast automation, audio analysis and fingerprinting, database design, yield management, exit poll systems, and machine listening, in a variety of languages.
Steven Hillion has been leading large engineering and analytics projects for fifteen years. Before joining Alpine Data Labs, he founded the analytics group at Greenplum, leading a team of data scientists and also designing and developing new open-source and enterprise analytics software. Before that, he was Vice President of Engineering at M-Factor, Inc. (acquired by DemandTec) where he built analytical applications that became a global standard for demand modeling. Earlier, at Kana Communications, Steven led the engineering group during the two largest releases of its flagship product. At Scopus Technology (later Siebel Systems) he co-founded development groups for finance, telecom and other verticals. He received his Ph.D. in mathematics from the University of California, Berkeley, and was a King Charles I Scholar at Oxford University.... Read More.
Sheridan has more than 15 years experience in Business Intelligence, Analytics, Web, Product Management, Business Strategy and Organization Effectiveness. He is currently VP of Data Products at Auction.com, the leading online real estate auctioneers in the United States.
He has also held positions at two leading online ganmes companies, Kabam and Playfirst, where he has built out Analytics and Big Data groups from inception. Earlier on in his career, Sheridan worked at Procter and Gamble in the areas of Decision Support and Executive Information Systems. He also worked as a Managing Consultant at Towers Perrin where he led many business strategy and organizational effectiveness engagements for large companies.
Sheridan holds a Bachelor of Arts degree in mathematics from Cambridge University and an MBA from... Read More.
Erich Hochmuth is lead of the Big Data and Analytics IT team that supports R&D at Monsanto. For the past three years Erich has been working on a cross functional team to build a heterogeneous computing stack to meet the compute and analytic needs of Monsanto’s scientific community.
Born in England, Nigel Holmes studied illustration at the Royal College of Art in London and then freelanced for magazines and newspapers for 12 years in London before coming to New York in 1978 to work for Time Magazine. He became graphics director and stayed there for 16 years.
At Time, his pictorial explanations of complex subjects gained him many imitators and a few academic enemies who thought he was trivializing information. But he remains committed to the power of pictures and humor to help readers understand otherwise abstract numbers and difficult scientific concepts.
Since 1994 he has run his own business, Explanation Graphics, explaining all sorts of things for a variety of clients. These have included American Express, The Smithsonian Institution and United Healthcare.... Read More.
Jonathan is a Software Engineer with Cloudera, currently focused on the Apache HBase project. He is an Apache HBase committer and PMC member, as well as a committer on the Apache Sqoop project, and a committer and founder of the Apache Flume (incubating) project. Jonathan has an M.S. in Computer Science from University of Washington and also has an M.S. and a B.S. in Electrical and Computer Engineering from Carnegie Mellon University.
Dr. Stan Humphries is the Chief Economist of Zillow Inc. (NASDAQ: Z), the leading real estate information marketplace. Stan joined the company as one of its earliest employees in 2005 and created the Zestimate and its first algorithm. Since that time, Stan has built out the industry-leading economics and analytic team at Zillow. Prior to joining Zillow, Stan spent five years at Expedia where he ran the advanced analytics team. Before Expedia, Stan served as a researcher and faculty member at the University of Virginia, and was previously a Presidential Management Fellow where he served at NASA, the Office of Science and Technology Policy in the Executive Office of the President, and the Technology Administration within the Department of Commerce. Stan has also... Read More.
Noah Iliinsky is a senior UX architect with Amazon Web Services. Noah strongly believes in the power of intentionally crafted communication and has spent the last decade researching, writing, and speaking about best practices for designing visualizations, informed by his graduate work in user experience and interaction design. Most recently, he worked on Amazon QuickSight. Noah is the coauthor of Designing Data Visualizations and technical editor of, and a contributor to, Beautiful Visualization, both published by O’Reilly Media. He is also a frequent speaker for both industry and academia. Noah holds a master’s degree in technical communication from the University of Washington and a bachelor’s degree in physics from Reed College.
Nilesh Jain is Sr. Research Scientist with the Cluster Computing Architecture team in Intel Labs. His current research focus is on emerging frameworks for large-scale machine learning and big data analytics. His other research interests include systems architectures and technologies that improve scaling, performance, and power consumption of distributed parallel computing. Before joining Intel Labs in 2007, Nilesh spent 11 years working on various telecom and I/O technologies within Intel product groups and at a premier telecom research organization (C-DOT) in India. Nilesh was an open source contributor to Linux Standard Base (LSB).
Brian Jepson is an editor for O’Reilly Media; he covers a number of areas, including Arduino, wireless sensor networks, mobile devices, as well as some Microsoft and Apple topics.
He likes to hack on gadgets such as Arduino and the Netduino in his spare time, and he is also the co-founder and co-host of Providence Geeks, a monthly gathering in Providence, RI.
Annika is a seasoned leader of analytics initiatives, and came from Pivotal where she built the “Data Science Dream Team” – an industry-leading group of Data Scientists – representing a rich combination of vertical domain and horizontal analytical expertise – who are facilitating Data Science-driven transformations for Pivotal customers.
Before Pivotal, Annika worked for Yahoo! for six years in data leadership roles: She led Audience and International data solutions for Yahoo!’s central data organization, Strategic Data Solutions, and led Insights Services – comprised of a team of 40 researchers covering Web analytics, satisfaction/brand health metrics, and audience/ad measurement. Annika is a recognized evangelist for “applied data” and well known for her acute focus on action-enablement.
David Jonker leads Big Data product marketing at SAP, and is helping to drive Big Data go-to-market initiatives across SAP’s data management and analytics platforms.
Previously, David Jonker lead product marketing teams for Sybase’s data management & analytics product lines offering solutions for Big Data, data warehousing, cloud, mobility, and embedded systems. David’s career also includes more than 14 years in software engineering and product management roles.
David has a degree in Systems Design engineering from the University of Waterloo.
PV Kannan co-founded 247 in April 2000. Under PV’s leadership, 247 has become the leader in intuitive customer service software and services. The company was founded based on PV’s vision to improve the consumer’s experience by leveraging big data, intuitive design and a learning platform to simplify customer service and make it easier for consumers to get things done.
PV has been featured in various publications as a thought leader in the field of global customer service, including Tom Friedman’s recent book “That Used to Be Us,” and in his earlier work “The World is Flat.” He is also a regular speaker at industry events and has been a panelist in Forbes Conference 2006 and Academy of 2006 Annual... Read More.
Gary Kearns is Group Executive, Information Services for MasterCard Advisors. He has global responsibility for leading efforts to develop differentiated data assets, drive growth within core target areas and identify new revenue opportunities in emerging segments at MasterCard. MasterCard data assets encompass 1.7 billion cards representing more than 30 million merchants and 65 billion transactions annually. Prior to joining MasterCard, Gary spent five years as president, Decision Analytics North America at Experian. In that capacity, he was responsible for leading B2B software and predictive analytics businesses serving financial institutions, government agencies, asset managers, retailers, utilities and telecom companies. He also was managing director and global leader for Risk Solutions at Standard & Poor’s. Prior to that, Gary spent twenty years in the banking industry. Gary... Read More.
Russ Kennedy brings more than 20 years experience in the storage industry to Cleversafe as the company’s Vice President of Product Strategy, Marketing and Customer Solutions. Having rolled up his sleeves working on automated tape libraries, Russ is still attracted to the technological challenges that have shaped the industry and particularly to the innovative approach that Cleversafe delivers to storage.
Russ joined the company initially in 2007 and left in 2009, staying on in an advisory role. In 2011, Russ rejoined the company seeing a clear opportunity to solve the storage needs surrounding the exponential growth of big data and the unique impact that Cleversafe delivers over traditional systems.
Previously, Russ served as the Vice President of Competitive Intelligence at CA Technologies,... Read More.
Paul Kent, Vice President of Big Data Initiatives, SAS
Paul spends his time discussing, evangelizing and developing software at the confluence of big data and high-performance computing. Kent was previously Vice President of the Platform R&D division at SAS and led groups responsible for the SAS foundation and midtier technologies, teams that develop, maintain and test Base SAS®, as well as related data access, storage, management, presentation, connectivity and middleware software products.
Greg is the Business Development Manager for Big Data at Intel Corporation. He has over 10 years of in-depth experience in business intelligence and data warehousing at Oracle and Business Objects (now SAP). A former U.S. Army officer, Greg holds a master’s degree in Information Systems Management from Carnegie Mellon University.
Siraj founded The Climate Corporation (formerly WeatherBill) in 2006, having previously worked at Google in multiple technical lead roles, from the company’s distributed computing infrastructure to the high-profile Google Book Search project and other offline content search initiatives. Siraj obtained an M.S. degree in Computer Science from Stanford University, and a B.A. (Hons.) in Computer Science from the University of Cambridge, England. While at Stanford, he was also a lead software architect for the popular Folding@Home distributed computing project.
Mr. Irfan Khan joined Sybase in 1992. As Senior Vice President and Chief Technology Officer, Mr. Khan oversees all technology offices in each of Sybase’s business units, ensuring market needs and customer aspirations are reflected within the company’s innovation and product development. Mr. Khan is also responsible for setting the architecture and technology direction for the worldwide technical sales organization.
As part of CTO responsibilities, Mr. Khan oversees the technology evangelism efforts for Sybase, responsible for seeding new innovation and driving new technologies within Sybase’s most strategic accounts. Mr. Khan also is in charge of the Sybase Developer Network.
In 2010, Mr. Khan received the InfoWorld CTO Top 25 Award and was named to the International Advisory Board of Cloud Expo.
Amandeep Khurana is a solutions architect at Cloudera, where he’s involved in the entire lifecycle of Hadoop adoption for customers from use-case discovery to taking systems to production. Amandeep is also a coauthor of HBase In Action, a book geared toward building applications using HBase. Prior to Cloudera, Amandeep was at Amazon Web Services, where he was a part of the Elastic MapReduce team, and built the first version of EMR’s HBase offering.
Aaron is the Founder and CTO of WibiData, Inc., a software company that engineers solutions for the large-scale user-centric data challenges that face today’s enterprises. He is a committer on the Apache Hadoop project and has been working with Hadoop since 2007. Aaron previously worked at Cloudera, a company which provides an enterprise platform, support and services built around Hadoop. Aaron founded the open source Apache Sqoop data import tool and Apache MRUnit Hadoop testing library projects. Aaron holds a B.S. in Computer Science from Cornell University and a M.S. in Computer Science from the University of Washington.
Robert Kirkpatrick is Director of the Global Pulse initiative of the United Nations Secretary-General, which aims to harness big data for a real-time understanding of human well-being.
Robert’s activities in government, academia, UN agencies, NGOs and the private sector have focused on developing innovative solutions at the intersection of technology, policy and social change. Robert has more than 15 years of experience in the design and use of advanced technology tools in business, public health, disaster relief, security coordination, citizen journalism, telemedicine, crisis monitoring, conflict mediation, and peace building activities. His work has focused on strengthening public policy, enhancing crisis resilience and catalyzing organizational change. Robert advocates open data, open standards, open source software, and participatory development.
He co-founded and led software development for two... Read More.
Edmund Kohlwey is a developer and data scientist at Booz Allen Hamilton. For the last three years, he has helped government clients adopt and develop their big data capabilities across many different problem domains.
Marcel Kornacker is a tech lead at Cloudera and the architect of Apache Impala (incubating). Marcel has held engineering jobs at a few database-related startup companies and at Google, where he worked on several ad-serving and storage infrastructure projects. His last engagement was as the tech lead for the distributed query engine component of Google’s F1 project. Marcel holds a PhD in databases from UC Berkeley.
Yekesa Kosuru is a Distinguished Architect at Nokia. He leads the Data & Analytics engineering team. The team is responsible for building and managing the NoSQL platform and the Big Data Analytics Platform. The NoSQL platform executes hundreds of millions of key-value queries a day and the Big Data Analytics platform hosts multiple petabyte and executes over 10,000 jobs a day, with more than 600 users.
Prior to Nokia, Yekesa worked at Oracle, where he gained deep expertise in relational databases and OLAP technology.
Mr. Krishnan is a recognized expert worldwide in the strategy, architecture and implementation of high performance data warehousing and big data solutions. He is a visionary data warehouse thought leader, ranked as one of the top data warehouse consultants in the world, and an independent analyst, writing and speaking at industry leading conferences, user groups and trade publications. He has authored eBooks, over 100 plus articles, viewpoints and case studies in Business Intelligence, Data Warehousing and Data Warehouse Appliances and Architectures.
A recognized authority on Unstructured data integration, text mining and text analytics. Along with Bill Inmon, he is promoting the next generation of data warehousing and business intelligence around BIG Data,Hadoop, NoSQL, Unstructured Data Integration and Social Intelligence.
Krish presents and speaks at... Read More.
Joe Lamantia is currently the User Experience Lead and Architect for Oracle’s Discovery products. He has spent more than 15 years at the forefront of design and technology, working in Europe, the U.S. and Asia as an entrepreneur, management and strategy consultant, and design leader for organizations ranging from Fortune 100 companies to boutique startups. His passion is understanding and bettering how people engage and interact with natural and human-created experiences, systems and services.
Joe builds products and services, design and development teams, and research programs that address emerging and rapidly evolving interaction spaces through deep insight and innovation; speaks frequently for the international design and technology community on leading edge practices; publishes extensively in professional design and technology journals; and creates and shares original... Read More.
Robert (a/k/a r0ml) Lefkowitz is a frequent speaker on the intersection of software and literacy. He has previously been a CTO at a math education startup, and held senior technology positions in the telecommunications and financial industries. He is a Distinguished Engineer of the ACM.
Jonathan has been working in software development for more than 10 years. In the last 3 years he has developed and contributed to several new and existing open source projects, including Jedis, which is being used actively by VMWare, SpringSource, etc.. Today, Jonathan is the Head of Research and Development and is tasked with researching new technologies and opening up MercadoLibre: open source, and open to the community.
Jerome Levadoux is Senior Vice President, Solutions and Alliances for Autonomy, an HP Company. Prior to joining Autonomy, Jerome held positions in the areas of Corporate Strategy and Product Management at SAP, and most recently served as SVP, IT Management line of Business at SAP Labs.
Jerome holds a BS in Engineering from Ecole Centrale Paris, an MS in Operations Research from UC Berkeley, and an MBA from Harvard Business School.
Todd holds a Sc.B in Computer Science from Brown University, where he completed an honors thesis developing a new collaborative filtering algorithm for the Netflix Prize Competition. Todd interned at Google developing machine learning methods to detect credit card fraud on AdWords and Google Checkout. Currently Todd works at Cloudera, Inc on bringing Map/Reduce technology to enterprises.
Mary Ludloff is Vice President of Marketing at PatternBuilders, a big data analytics solution provider. Through her work at Brio Technology (now Hyperion) and NONSTOP Solutions (now Manhattan Associates), Mary has a deep understanding of supply chain management issues as well as the use of business intelligence tools in data warehousing and analytic application efforts. Mary is co-author of Informix Guide to SQL: Tutorial and Privacy and Big Data, a frequent white paper contributor, an editor of Object Request Broker (ORB) programmer reference manuals and tutorials, a speaker, and a blogger on diverse topics such as big data and analytics from a technology and business perspective, data security, data privacy, and marketing in the digital age.
Kevin visualizes data and makes statistical interfaces on the web.
Before Clojure, he wrote machine learning and analytics tools in R and Scala.
In 2010 he wrote a thesis on protein structure, for which Reed College inexplicably awarded him a physics degree.
Kevin lives in Portland, Oregon, and spends as much time rock climbing as he does in the REPL.
Mark Madsen is a research analyst at Third Nature, where he advises companies on data strategy and technology planning. Mark has designed analysis, data collection, and data management infrastructure for companies worldwide. He focuses on two types of work: the business applications of data and guiding the construction of data infrastructure. As a result, Mark does as much information strategy and IT architecture work as he does performance management and analytics.
Michael is a Developer Programs Engineer supporting developers who work with Google’s Cloud and Data platforms. With many years of experience working for research and non-profit organizations, he is interested in making data analysis on large scale computing infrastructure more accessible and affordable. Michael has written for tech blog ProgrammableWeb.com, has spent time in rural Uganda researching mobile phone use, and has a Masters degree in Information Management and Systems from UC Berkeley’s School of Information.
Steve Mardenfeld is a Data Engineer for the Big Data Team at Etsy, where he uses Hadoop and other tools to process and analyze large data sets. Previously to Etsy, he worked as a Senior Data Analyst for a criminal justice agency, where he focused on utilizing data to help inform criminal justice policy issues.
James Markarian serves as executive vice president and chief technology officer at Informatica with responsibility for the strategic direction of Informatica products, including the delivery of the Informatica Platform. He also runs the corporate development group, including acquisitions. Previously, Markarian held the role of general manager of the Data Integration Business, where he led the delivery of both the flagship PowerCenter product and the Informatica Platform. Prior to joining Informatica in 1998, Markarian spent 10 years at Oracle Corporation, where he was architect for the Oracle Tools Division, responsible for cross-product architectural planning and integration decisions. He also served on Oracle’s Architecture Review Board. Markarian earned a bachelor’s degree in Computer Science and Economics and a master’s degree in Economics from Boston University.
Taylor Martin received her Ph. D. in educational psychology from Stanford. She also holds an M.S. in cognitive psychology, and a B.A. in linguistics. She has worked in research and development in curriculum and design of instructional systems on projects such as the Adventures of Jasper Woodbury and the Algebra Project. She has also worked as an elementary school teacher,particularly in the area of mathematics. She was in the Department of Curriculum and Instruction and is affiliate faculty in Developmental Psychology and the Learning Technology Center at the University of Texas, Austin. She collaborates extensively with partners in the College of Engineering, the Physics Department, and the Texas Advanced Computing Center at the University of Texas, Austin, in Computer Science and Learning Science at the... Read More.
George Mathew oversees product development, strategy and marketing for Alteryx. He brings over 16 years of executive leadership experience in enterprise technology, software-as-a-service, and e-commerce. Previously, George was the group vice-president & general manager for Business Intelligence (BI) at SAP BusinessObjects, and before that managed SAP’s portfolio strategy in the Office of the CEO. Prior to SAP, George was an early member of salesforce.com. He was also a founding member of Fort Point Partners, a management consultancy that delivered large-scale e-commerce systems.
Mike has the responsibility of outbound marketing for the Greenplum product portfolio, a comprehensive platform that is driving the future of Big Data Analytics. Previous to Greenplum, Mike was the Senior Director of Product Management for ParaScale, a parallel distributed file system company now owned by Hitachi Data Systems. Prior toParaScale, Mike held product management roles at EMC Rainfinity and McDATA.
Q Ethan McCallum is a consultant, writer, and technology enthusiast, though perhaps not in that order. Most recently he was seen working on the upcoming book Bad Data Handbook (O’Reilly).
Ryan’s role with farsite is to take the totality of data science and statistics and guide our analysts to create insight for your high-consequence decisions. Part of this job requires in-depth expertise, both to know how to produce results as well as to know what is possible. The other part is listening to you to understand how our analysis will have the most impact.
Originally in high-performance computation for nuclear engineering application, he earned his undergraduate degree and PhD from the University of Michigan. Due to the stochastic nature of many nuclear processes, Ryan naturally began working in statistics, at first in Monte Carlo simulations. While at Los Alamos National Laboratory and as a faculty member at Texas A&M University, Ryan became a recognized expert... Read More.
Richard McDougall is the Application Infrastructure CTO and Principal Engineer in the Office of the CTO at VMware. He is responsible for driving advanced development and strategy for VMware’s application platform architecture – including the performance and integration of applications, runtimes, middleware, and application encapsulation technologies.
Richard’s is known as an expert in the areas of performance measurement and optimization, and in application deployment architectures.
Before the CTO office, as the Chief Performance architect Richard drove the performance strategy and initiatives to enable virtualization of high-end mission critical applications on VMware products.
Prior to joining VMware, Richard was a Distinguished Engineer at Sun Microsystems. During his 14 years at Sun, he was responsible for driving high performance and scalability initiatives for Solaris... Read More.
Susan E. McGregor is an Assistant Professor with the Tow Center for Digital Journalism at Columbia Journalism School, where she specializes in data journalism and information visualization. She joined Columbia in 2011 following four years as the Senior Programmer on the Online News Graphics team at the Wall Street Journal, where she was a recipient of the 2010 Gerald Loeb Award in the “Online Enterprise” category as part of the team that developed the Journal’s “What They Know” series. She is also the co-author of a chapter on data journalism in the upcoming collection “Computation for Humanity: Information Technology to Advance Society,” due out in 2012. She holds a B.A. from Harvard in Interactive Information Design and an M.A. from NYU in Educational Technology.... Read More.
Building analytics libraries and research tools for quantitative finance and other fields. Actively involved in data analysis and statistics applications in the scientific Python community. Author of pandas library, contributor to statsmodels. Upcoming author of “Python for Data Analysis” from O’Reilly Media. CEO of Lambda Foundry, Inc.
Will is a software engineer at Cloudera.
Roberto Medri leads the Business Analytics team at Etsy, the world’s leading marketplace for unique goods.
At Etsy, Roberto works closely with product, marketing, and engineering to scout, build, instrument and improve Etsy’s product portfolio, spanning from revenue insights and data patterns to financial modeling to A/B experiments.
Roberto joined Etsy in 2010 as an intern while at the Wharton School, from which he graduated in 2011 as a Palmer Scholar. Before that, he worked as a consultant for Bain & Company in Italy, Switzerland, and Ukraine.
Roberto was born and raised in Lake Como, Italy.
Who am I?
In short, Nilofer Merchant is the (female) James Bond for Innovation.
As someone who has grown businesses — from Fortune 500s and silicon valley web startups — for 20 years, Nilofer is like a secret agent in knowing how to piece together the parts that matter (frameworks, strategies, and cultural values) — to get what everyone needs… results.
Rob Metcalf serves as President and COO of Digital Reasoning and is responsible for the business and sales strategy of the company. Rob Metcalf brings over a decade of experience in technology and information services businesses around the world. Prior to joining Digital Reasoning, Rob helped build software and information services businesses for LexisNexis in Mexico City, Toronto and Raleigh. As Vice President of Operations for Global Practice Management, Rob oversaw numerous acquisitions and key investments at the forefront of the company’s solution strategy. Prior to LexisNexis, Rob worked in Boston and San Francisco with The Parthenon Group, a strategy consultancy. Rob earned his MBA from Harvard Business School and his AB, Summa Cum Laude, from Princeton University.
Prior to joining the Laura and John Arnold Foundation, Anne served as New Jersey’s Attorney General where she headed the 9,000-person Department of Law and Public Safety. As New Jersey’s chief law enforcement officer, she oversaw 21 prosecutors and approximately 30,000 law enforcement officers. Anne implemented a statewide program to improve public safety through prevention of crime, law enforcement reform, and re-entry initiatives. Anne also served as a member of the United States’ Attorney General’s Executive Working Group on Criminal Justice and as a co-chair of the National Association of Attorneys General Criminal Law Committee. Prior to becoming Attorney General, Anne served as First Assistant Attorney General and, before that, was Counsel to a United States Senator. Anne also previously served as a federal prosecutor... Read More.
Dr. Donald Miner serves as a Solutions Architect at EMC Greenplum, advising and helping customers implement and use Greenplum’s big data systems. Prior to working with Greenplum, Dr. Miner architected several large-scale and mission-critical Hadoop deployments in the U.S. Intelligence Community. He is the author of the upcoming book “MapReduce Design Patterns”, which will be published by O’Reilly in the Fall of 2012. He is also involved in teaching, having previously instructed industry classes on Hadoop and a variety of artificial intelligence courses at the University of Maryland, BC. Dr. Miner received his PhD from the University of Maryland, BC in Computer Science, where he focused on Machine Learning and Multi-Agent Systems in his dissertation.
Romy is the Director of Analytics at Visual.ly where she runs the analytics and operations.
Justin is a member of the Entities team at Facebook where he helps curate and build from their rich structured object and social graphs, with a focus on location. Before joining Facebook, Justin ran the Data team at foursquare. In addition to building their core data-driven products Explore and Radar, he built a team from the ground up that consisted of Engineers and Data Scientists to solve large scale data problems as foursquare’s dataset grew from half a million check-ins to over 1.5 billion. Before that, Justin worked at a hedge fund as a quantitative analyst, building custom portfolios for their asset management division and doing modeling and analysis for their risk team, specializing in high-frequency, derivatives, and commodities trading. Prior to that, he worked... Read More.
Prasad is a Software Engineer at Cloudera. He is also a committer on Apache Flume.
Sharmila has spent 18+ years building game-changing software companies in a variety of markets. She has been EVP & CMO at numerous software companies, including Netscape, Kiva Software, AOL, Opsware, and Aster Data. She drove the creation of several multi-billion dollar market categories, including application servers, data center automation and big data analytics. She is on the board of Hadapt and Lattice Engines, advisor to numerous companies, large and small, and an active investor in early stage companies.
Arun C Murthy is Founder and Architect at Hortonworks, a software company focussed on focused on accelerating the development and adoption of Apache Hadoop. He also leads next generation MapReduce project in Apache Hadoop and has contributed full-time to Hadoop for 6 years now. In the community, Arun is the VP, Apache Hadoop, at the ASF i.e. the Chair of the Hadoop PMC. Prior to co-founding Hortonworks, Arun was responsible for all Hadoop MapReduce code and configuration deployed across the 42,000+ servers at Yahoo!. In essence, he was responsible for running Apache Hadoop’s MapReduce as a service for Yahoo. He jointly holds the current world sorting record using Apache Hadoop.
Aaron T. Myers is a Software Engineer at Cloudera and an Apache Hadoop Committer. Aaron’s work is primarily focused on HDFS. Prior to joining Cloudera, Aaron was a Software Engineer and VP of Engineering at Amie Street, where he worked on all components of the software stack, including operations, infrastructure, and customer-facing feature development. Aaron holds both an Sc.B. and Sc.M. in Computer Science from Brown University.
Thejas Nair is a software engineer working on Apache pig, hcatalog and hive projects at Hortoworks. He is a committer and PMC member of Apache Pig project. Previously, he worked at Yahoo for 9 years, developing solutions for large scale distributed data processing.
Moisés J. Nascimento is the Director of Data Architecture at Paypal where he has worked since 2008. Moisés developed a comprehensive data integration platform leveraging Cloud computing principles to build a real time data engine for Paypal. Prior to Paypal, Moisés architected data products and systems for several high tech companies including BMC Software and Salesforce.com.
Jack Norris is the senior vice president of data and applications at MapR Technologies. Jack has a wide range of demonstrated successes, from defining new markets for small companies to increasing sales of new products for large public companies, in his 20 years spent in enterprise software marketing. Jack’s broad experience includes launching and establishing analytics, virtualization, and storage companies and leading marketing and business development for an early-stage cloud storage software provider. Jack has also held senior executive roles with EMC, Rainfinity, Brio Technology, SQRIBE, and Bain & Company. Jack earned an MBA from UCLA’s Anderson School of Management and a BA in economics with honors and distinction from Stanford University.
Cathy O’Neil a data scientist for the startup media company Intent Media. Cathy began her career as a postdoc in MIT’s Math department. She has been a professor at Barnard College, where she published a number of research papers in arithmetic algebraic geometry, worked as a quant for the hedge fund D.E. Shaw in the middle of the credit crisis and for RiskMetrics, a risk software company that assesses risk for the holdings of hedge funds and banks. Cathy holds a PhD in math from Harvard.
Amy is a business and technical executive with extensive experience driving growth and change. Amy leads Nokia’s Analytics team and is committed to enabling Nokia to be a data-driven business, ultimately delivering the best experience for consumers. Her charter includes the development of Nokia’s analytics platform, developing and managing Nokia’s data asset, and leading a team of data scientists to deliver descriptive and predictive insights. Prior to joining Nokia, Amy was vice president of services marketing for Sun Microsystems.
Mike Olson cofounded Cloudera in 2008 and served as its CEO until 2013, when he took on his current role of chief strategy officer. As CSO, Mike is responsible for Cloudera’s product strategy, open source leadership, engineering alignment, and direct engagement with customers. Previously, Mike was CEO of Sleepycat Software, makers of Berkeley DB, the open source embedded database engine, and he spent two years at Oracle Corporation as vice president for embedded technologies after Oracle’s acquisition of Sleepycat. Prior to joining Sleepycat, Mike held technical and business positions at database vendors Britton Lee, Illustra Information Technologies, and Informix Software. Mike holds a bachelor’s and a master’s degree in computer science from the University of California, Berkeley.
Oscar Padilla leads the strategic services for Entravision’s Data and Analytics Services division. He is accountable for driving the formulation of new product and service offerings, as well as helping clients achieve higher efficiency from their data and analytics marketing programs. Most recently, Oscar served as senior director of Interactive Strategy at Vertis Communications, a $1.4 billion communications and marketing company. As a direct report to Vertis’ CMO, Oscar oversaw the strategic direction of the division, leading client digital strategies in the areas of social media, mobile, online advertising and web development, and managed partners across all media channels.
In addition, Oscar has worked with Hitachi Consulting, Blackboard Inc., RepEquity (previously Mindshare Interactive) and Bridgeline Digital. At these companies, Oscar led engagement for numerous... Read More.
Josh Patterson is a Principal Solution Architect at Cloudera. Prior to joining Cloudera, he was responsible for bringing Hadoop into the smartgrid during his involvement in the openPDC project. His focus in the smartgrid realm with Hadoop and HBase was using machine learning to discover and index anomalies in time series data. Josh is a graduate of the University of Tennessee at Chattanooga with a Bachelors in Business Management and a Masters of Computer Science with a thesis titled “TinyTermite: A Secure Routing Algorithm” where he worked in mesh networks and social insect swarm algorithms. Josh has over 15 years in software development and continues to contribute to projects such as Apache Mahout, openPDC, and JMotif in the open source community.
Roy Pea is David Jacks Professor of Education and the Learning Sciences at Stanford University, Director of Stanford’s H-STAR Institute (Human Sciences and Technologies Advanced Research), and Founder & Director of Stanford’s PhD program in Learning Sciences and Technology Design. Roy publishes widely on K-12 learning and education, especially in science, math and technology, fostered by advanced technologies including scientific visualization, on-line communities, digital video collaboratories and mobile computers. He is co-author of the 2010 National Education Technology Plan for the US Department of Education, co-editor of Video Research in the Learning Sciences (2007), and co-author of the National Academy of Sciences book: How People Learn (2000). He is a Fellow of the National Academy of Education, Association for Psychological Science, the American Educational... Read More.
Jon Peltier is owner of Peltier Technical Services, Inc. He has been a full time Excel developer since 2004, when he founded Peltier Tech, and part-time since about 1995. Peltier Tech provides data and graphic solutions for users of Microsoft Excel, combining Excel’s worksheet functionality and charting capabilities with its powerful VBA programming environment to produce solutions in engineering, finance, and marketing. Peltier Tech has developed a line of Excel add-ins which extend Excel’s graphical capabilities. The Peltier Tech website is an encyclopedia of Excel charting and programming techniques.
Jon is a metallurgist by training, with a Doctor of Science degree from MIT. Prior to creating Peltier Tech, Jon spent over 20 years in research and development, and more recently in manufacturing and... Read More.
Mike Percy is a Software Engineer at Cloudera. Previously, he worked on Yahoo!’s C.O.R.E team.
Prior to joining Dstillery (former Media6Degrees), Claudia Perlich spent five years working at the Data Analytics Research group at the IBM T.J. Watson Research Center, concentrating on research in data analytics and machine learning for complex real-world domains and applications. She has been published in over 30 scientific publications and holds multiple patents in the area of machine learning. Claudia has won many data mining competitions, including the prestigious 2007 KDD CUP on movie ratings, the 2008 KDD CUP on breast-cancer detection, and the 2009 KDD CUP on churn and propensity predictions for telecommunication customers. Claudia received her Ph.D. in Information Systems from Stern School of Business, New York University in 2005, and holds a Master of Computer... Read More.
Tom Phillips is CEO of Media6Degrees (m6d), a marketing technology company that captures brand signals from major marketers to deliver high-performing ad campaigns.
Phillips was founding publisher of Spy magazine in 1986, the original media chief for Starwave Corporation in 1993 (creators of ESPN.com, NFL.com, ABCNews.com and NBA.com), and CEO of Deja.com in 1998. After a stint as a venture partner at Insight Venture Partners, he joined Google and managed media platforms and the DoubleClick integration before establishing the Search & Analytics team to pioneer new uses of Google data on behalf of major advertising clients.
Phillips holds a BA in applied mathematics from Harvard and a MBA from Stanford.
Arvind is the PMC Chair for Apache Sqoop and a committer and PPMC member of Apache Flume. A seasoned enterprise software developer, Arvind has worked at Netscape, Sun Microsystems, Informatica and currently at Cloudera.
Sanjay is an Apache Hadoop committer and member of the Apache Hadoop PMC.
Prior to co-founding Hortonworks, Sanjay was the chief architect of core-Hadoop at Yahoo focusing mostly on HDFS, MapReduce schedulers, compatibility, etc.
He has also held senior engineering positions at Sun Microsystems and INRIA, where he developed software for distributed systems and grid/utility computing infrastructures.
Sanjay has a PhD in Computer Science from the University of Waterloo in Canada.
Michael J. Radwin leads Intuit’s Data Services & Solutions team. His team develops new and innovative ways to put the collective data of 50 million customers to work for them, providing unique insights and experiences they can’t get anywhere else. With Radwin’s leadership, Intuit has adopted text analytics, recommendation services, and data-driven algorithms that create experiences that delight customers and impact the bottom line across product lines such as personal finance, tax, and accounting.
Prior to Intuit, Radwin was VP Engineering of Anchor Intelligence, which provided real-time scoring based on predictive models for advertisers and ad networks. Earlier, Radwin was Director of Engineering at Yahoo! where he developed and supported web and data platform technologies Apache, PHP, MySQL, and proprietary TCP/IP middleware systems.... Read More.
Aparna is Director of Engineering at Cloudera, where she leads the development of Cloudera Manager – the industry’s first end-to-end management application for Apache Hadoop. Prior to Cloudera, she worked on search monetization products at Yahoo! and on contextual advertising systems at Microsoft. Aparna holds a Master’s degree in Computer Science from Arizona State University.
Dr. Samantha Ravich consults with both private industry and Federal and State governments on international security, financial, and political risk. She is also a Senior Advisor to The Chertoff Group and Co-Chairs the National Commission for Review of Research and Development Programs in the IC. From 2009-2011, Ravich was Senior Vice President at IPS, a software and analysis firm.
Ravich was Deputy National Security Advisor to Vice President Cheney and served in the White House for 5½ years during the 2001-2008 time period. Ravich was the Vice President’s representative on Asian and Middle East Affairs as well as on Counter-Terrorism and Counter-Proliferation. During her tenure in OVP, she also worked on Free Market and Democracy Promotion, spearheading a number of government wide initiatives.... Read More.
Kim Rees is a founding partner of Periscopic, an award-winning information visualization firm. Their work has been featured in the MoMA as well as several publications, including Print Magazine, CommArts’ Interactive Annual, The Information Design Sourcebook, Adobe Success Stories, Infosthetics.com, FlowingData.com, and others. Periscopic’s body of work was recently nominated for the Cooper-Hewitt National Design Awards.
Kim is a prominent individual in the data visualization community. She has published papers in Parsons Journal of Information Mapping, was an award winner in the VAST 2010 Challenge, and is a guest blogger for Infosthetics.com. Kim has been featured on CommArts Insights and has presented at several industry events including the Wolfram Data Summit, Strata, VisWeek, Tableau Software Conference, WebVisions, and Portland Data Visualization.
... Read More.
Joseph Rickert is a Data Scientist and Community Manager at Revolution Analytics with a passion for analyzing data and teaching people about R. He is a regular contributor to the Revolutions blog and an organizer of the Bay Area R Users Group. Joseph is frequently invited to speak on the topic of R and big data. He has presented tutorials including the O’Reilly Strata conference, Strata/ Hadoop World, DataWeek , PACE.
Joseph has worked for a number of Silicon Valley start-ups and has experience building statistical models in industries as diverse as local area networks and healthcare. Joseph holds graduate degrees in both the Humanities and Statistics. He taught statistics briefly at SJSU.
As president of Luminar, Franklin Rios brings more than 20 years of management, business development, sales and marketing experience in the interactive, e-commerce, telecomm and media industries.
Prior to Luminar, Franklin was president of the Enterprise Solutions Group at Infogroup, and vice president of Interactive Sales and Marketing at Vertis Communications, Inc., a $1.4 billion communications and marketing company.
Franklin ran his own full-service Hispanic advertising agency, Rios Group, from 2003 to 2006. He was also director of sales at CSG Systems from 2001 to 2003 and a sales manager at Columbine JDS (Harris Broadcasting Systems) from 1994 to 2001.
Naomi B. Robbins is the author of Creating More Effective Graphs, published by John Wiley (2005). She is a consultant, keynote speaker, and seminar leader who specializes in the graphical display of data. She trains employees of corporations and organizations on the effective presentation of data with customized programs. She also reviews documents and presentations for clients, suggesting improvements or alternative presentations as appropriate. Naomi received her Ph.D. in mathematical statistics from Columbia University, M.A. from Cornell University, and A.B. from Bryn Mawr College. She had a long career at Bell Laboratories before forming NBR, her consulting practice.
Jason is a Sr. Architect at Think Big Analytics. He has many years of experience writing Java application software, most recently for Hadoop-based applications.
Eric Sammer is the CTO and co-founder of Rocana. Prior to Rocana, he was an engineering manager at Cloudera. His background is in the development and operations of distributed, highly concurrent, data ingest and processing systems. He’s been involved in the open source community and has contributed to a large number of projects over the last decade. Eric is the author of Hadoop Operations (O’Reilly).
Peter Schlampp is passionate about designing products that change the way users live, work, and interact with their world. He experienced first-hand the utility and complexity of big data while building products to secure enterprise networks. Peter has led Product and Marketing teams at Solera Networks, IronPort Systems, and Cisco Systems.
Dr. Charles Schmitt is the Director of Data Sciences at the Renaissance Computing Institute (RENCI), a research computing center at the University of North Carolina at Chapel Hill. As director, Dr. Schmitt is responsible for exploring and advancing the application of novel data technologies for national research agendas. This includes work in areas such as high throughput genomic sequencing, management of distributed research data, medical decision support, and data security.
Prior to joining RENCI, Dr. Schmitt worked as a Computer Scientist in industry in areas including data-mining, bioinformatics, and software engineering. His Ph.D. is in Computer Science where he focused on developing neural network models of the human visual system.
John Schroeder founded MapR to address core limitations that were limiting the adoption of Hadoop and expand Hadoop capabilities to serve new business critical and real-time applications.
John has led companies creating innovative and disruptive business intelligence, database management, storage and virtualization technologies at early stage ventures through success as large public companies. John was previously CEO of Calista Technologies (Microsoft), CEO of Rainfinity (EMC) and Senior Vice President of products and marketing at Brio Technologies (BRYO).
Noah Schwartz is technology professional with a passion for sports. Noah is a member of a group at Bloomberg that recognized an business opportunity for a sports analytics and has been instrumental in creating the platform to deliver the applications to the end users.
Michael has been working w Hadoop since 2009 and is the founder of the Chicago area Hadoop User Group (CHUG)
He is currently a Principal Architect with Think Big Analytics helping customers solve their Big Data problems.
Chris Selland recently joined HP Vertica as VP of Marketing and brings 20+ years of success driving demand and revenue via innovative online, search & inbound marketing programs as well as leading key strategic alliance & corporate development initiatives for entrepreneurial, high-growth companies including SoundBite Communications (NASDAQ:SDBT), Constant Contact (NASDAQ:CTCT) and Lumigent Technologies (acquired by BeyondTrust).
Chris is an established thought leader, speaker and author on customer strategy-related topics including CRM, Customer Metrics & Loyalty, Customer Support and Social Media Marketing, and is an Expert Advisor at Focus.com as well as a founding member of the Enterprise Irregulars. Earlier in his career he was VP of CRM and Internet Research at the Yankee Group, and later founded Reservoir Partners,... Read More.
SkipFlag is turning conversations into knowledge (https://skipflag.com).
Vin Sharma is the director of machine learning solutions in the Data Center group at Intel, where he focuses on autonomous driving and automated trading. Vin has helped build data center infrastructure software platforms—most recently the Trusted Analytics Platform—and has helped drive enterprise adoption of open source software like Linux, KVM, OpenStack, Hadoop and analytics for over 20 years. Before joining Intel, Vin held various engineering and management roles at HP for 15 years, building enterprise software products based on Linux, Java, XML, and other open source software.
Blake Shaw is currently a Data Scientist at foursquare, a location-based service that helps people keep up with friends and discover new places. At this NYC startup, Shaw applies machine-learning algorithms to large spatiotemporal datasets in order to better understand patterns of human mobility. Shaw holds a Ph.D. (also a B.S. and M.S.) degree in Computer Science from Columbia University. His research papers have been accepted at a variety of conferences including NIPS, ICML and AISTAT. Shaw’s “Structure Preserving Embedding” won the Best Paper Award at the 26th International Conference for Machine Learning in 2009. Shaw was also the lead developer of CabSense, a mobile app for predicting the best street corners in New York City for catching taxicabs.
Chang She is CTO and cofounder of Lambda Foundry. From 2011 to 2012, he served as Assistant Vice President at Barclays Capital researching quantitative FX strategies and building research infrastructure. From 2006-2011, he worked at AQR Capital Management in global equities research and algorithm execution. He graduated from MIT with an M.Eng in Computer Science and S.B. degrees in Computer Science and Political Science.
Opower – Analytics – Senior Quantitative Analyst
Freddie Mac – Mission Division – Quantitative Researcher
Johns Hopkins – M.A. Applied Economics
Virginia Tech – B.A. Economics
Tomer Shiran heads the product management team at MapR and is responsible for product strategy, roadmap and requirements. Prior to MapR, Tomer held numerous product management and engineering roles at Microsoft, most recently as the product manager for Microsoft Internet Security & Acceleration Server (now Microsoft Forefront). He is the founder of two websites that have served tens of millions of users, and received coverage in prestigious publications such as The New York Times, USA Today and The Times of London. Tomer is also the author of a 900-page programming book. He holds an MS in Computer Engineering from Carnegie Mellon University and a BS in Computer Science from Technion – Israel Institute of Technology.
Hari is a Software Engineer at Cloudera, where he is working on building Apache Flume. Previously, Hari was a software engineer on Yahoo! Mail’s metadata indexing and query team. He holds a Masters from Cornell University in Computer Science.
Patrick Shumate is currently busy developing and deploying the next generation of content delivery networks for Comcast Cable. Prior to Comcast he provided security consulting to US Government Agencies and was the Senior Architect for RSA Consumer Solutions devision, bring you such hits as the Go ID – federated two factor authentication and RSA eFraudNetwork.
Darío has been working in software development for more than 10 years.
Since 2004 he has specialized in large website, operations and
performance. Today, Darío is the Chief Architect of MercadoLibre (NASDAQ MELI) where he leads the architecture team.
Rick Smolan, a former Time, Life, and National Geographic photographer, is best known as the co-creator of the “Day in the Life” book series.
Today Smolan is the CEO of Against All Odds Productions which orchestrates global photography projects that combine creative storytelling with state-of-the-art technology. Pulling together teams of leading journalists and photographers (plus hundreds of thousands of members of the general public) to focus on emerging topics such as the global water crisis, the effect of the Internet on civilization or how the human race is learning to heal itself, Smolan and his team produce bestselling-illustrated books, TV specials, apps, and exhibits.
Many of Smolan’s books have appeared on the New York Times best-seller lists and have been featured on the covers... Read More.
Juhan Sonin, Creative Director of Involution Studios, is an emeritus of some of the finest software organizations in the world: Apple, the National Center for Supercomputing Applications (NCSA) and the Massachusetts Institute of Technology (MIT). He has been a creative director for almost two decades with his work being featured in the New York Times, Newsweek, BBC International, Billboard Magazine and National Public Radio (NPR). His designs have enjoyed installations throughout Europe and the United States.
Juhan is a recognized expert in design for health, process management, and security, providing consultation to the U.S. Department of Health and Human Services and the Department of Defense.
Juhan was a featured speaker on the topic of software design at the TTI Vanguard... Read More.
Marshall Sponder is an independent Web analytics and
SEO/SEM specialist working in the field of market research,
social media, networking, and PR. He provides digital data
convergence generating ROI and develops data metrics, KPIs,
and dashboards that drive businesses by setting and evaluating
For nearly a decade now, Marshall has been influencing
the development of the industry, lately focusing on social
media metrics. He also possesses considerable in-house corporate
experience as a group leader at IBM and Monster,
combined with contract work at Porter Novelli PR, and is
currently a Senior Analytics Manager at WCG (Wiesscom
Group); Marshall also has deep connections to the NYC startup
... Read More.
Laks has over 20 years of experience in various capacities in marketing, credit risk management, and customer portfolio management, with a focus on analytics and decision automation technologies. He has worked with a number of clients in the retail banking, mortgage, and retail industries addressing and solving various business problems, such as customer acquisition, originations, account management, attrition, collection, and fraud management. Before coming to Opera Solutions, Laks was with Fair Isaac as Senior Director of Innovation Management. There, he managed all aspects of new product development by identifying, evaluating, and developing new business opportunities in the Financial Services and Retail sectors. Laks also served as the Director of Product Management and the Director of Go-To-Market Management at Fair Isaac. In these roles, he led... Read More.
Raymie Stata is CEO of VertiCloud, a startup building a hosted Big Data platform for data-driven applications. Stata comes to VertiCloud from Yahoo!, Inc., where he most recently held the position of Chief Technical Officer. At Yahoo, he played an instrumental role in the development of Yahoo!’s infrastructures for algorithmic search, display advertising, and cloud computing. He also helped set Yahoo’s Open Source strategy and initiate its participation in the Hadoop project. Prior to joining Yahoo!, Stata founded Stata Laboratories, maker of the Bloomba search-based e-mail client, which Yahoo! bought in 2004. He has also worked for Digital Equipment’s Systems Research Center, where he contributed to the AltaVista search engine.
Julie Steele thinks in metaphors and finds beauty in the clear communication of ideas. She is particularly drawn to visual media as a way to understand and transmit information. Julie is coauthor of Beautiful Visualization (O’Reilly, 2010) and Designing Data Visualizations (O’Reilly, 2012).
Tom Strachan is the Regional Vice President at Gooddata where he is responsible for customer acquisition and retention for the Eastern Division. Prior to GoodData, Tom held a leadership position at WebTrend’s, a leader in the web analytics space where he helped customers measure and understand their return on the digital initiatives. In addition, Tom has been an entrepreneur in which he ran a Yoga eCommerce store called, Yogaduzit.com and has previously held a leadership position at Akamai Technologies where he participated in the growth of the organization from 20M to 400M in revenue. . He loves watching his customers go from being data bankrupt to monetizing that data in a very rapid time frame on the GoodData platform.
Mike Stringer is co-founder and managing partner of Datascope Analytics. A physicist gone wild, Mike is passionate about realizing the potential for big data analytics to make a positive impact on business and society. Despite wearing a suit occasionally, he still finds the time to argue with others about Bayesian statistics, explore the amazing food in Chicago, and has recently become fascinated with typography. Mike received a BS in Engineering Physics from the University of Colorado and a PhD in physics from Northwestern University.
Michael Sullivan is Deputy Editor at The New York World, Columbia Journalism School’s accountability journalism website covering New York State and City government, where he is helping to bring more data-driven journalism to the site. Before joining The New York World in 2012, Michael worked for The Wall Street Journal Online for eight years as an Interactive Designer and Developer. With a background in graphic design and a love for computer programming, Michael found that data visualization/journalism was the perfect niche to quench his creative thirsts. He knows all too well the overwhelming amount and ever-changing set of tools out there for anyone wanting to build information graphics—whether novice or expert—and continues to both learn from and teach others on how best to approach data... Read More.
Ryan is a data engineer at Think Big Analytics. He leads technical consulting projects for big data implementations at Fortune 500 clients. He has in depth experience working with Solr/Lucene and the Hadoop stack.
Liyin Tang is a software engineer at Facebook and a HBase Committer at the Apache Software Foundation. At Facebook, he works on building the data storage system based on HBase for various applications. Liyin holds a bachelor degree in Software Engineering from Shanghai Jiao Tong University, China and a master degree in Computer Science from University of Southern California, US.
Anoush currently works to implement the partnerships and communications strategy for Global Pulse, an initiative of the United Nations focused on applying innovations in big data and real-time analytics to global development.
Charles Thayer is a Software Engineer at Facebook, where he works on the Monitoring Systems. Before Facebook, he worked at Yahoo on search technology including the Web Crawler and Hosted Vertical Crawler. His focus has been scaling both storage and compute resources across thousands of nodes and tens of thousands of disks. He’s been involved with many startups in NYC including Metrobeat/Citysearch and CityRealty. He graduated with a BS EE from Columbia University’s School of Engineering and Applied Science, before becoming CEO of his first startup, Mediabridge Infosystems.
Jim has extensive experience in leading the development, management, and performance for enterprise data architectures, including clustered, large SMP, and distributed systems for the retail, web, and telecom industries. He is responsible for the architecture, vision, direction, and technical evangelization of InfiniDB. Jim holds a BBA from Texas A&M and a Masters in Management Information Systems from the University of Texas at Dallas.
Eric Tschetter is the lead architect of Druid, Metamarkets’ distributed, in-memory database. He held senior engineering positions at Ning and LinkedIn before joining Metamarkets. At LinkedIn, Eric productized LinkedIn’s PYMK with Hadoop. He holds bachelors degrees in Computer Science and Japanese from the University of Texas at Austin, and a M.S. from the University of Tokyo in Computer Science.
Vineet Tyagi has over 17 years of experience on product development, managing innovation and creating enterprise grade solutions for Healthcare, Transportation, Insurance and Financial sectors. His expertise includes Big Data, Hadoop, Cloud Computing and Agile/Lean software development. Vineet has spearheaded the design & architecture of several large & cloud scale solutions for data lifecycle management at petabyte scale. Vineet also leads many Open source contributions that have received global recognitions and is a noted speaker on Big Data Technologies.
Rohit is an experienced technology and marketing executive with over 15 years of experience in enterprise datacenter technologies including hands-on software development, product management and marketing experience in virtualization, cloud computing, security and Java EE middleware. Before joining IBM Corporation through its Platform Computing acquisition, he was the Director of Sun Microsystems’ Cloud Services Business Unit and head of Oracle University marketing. He has been a speaker at numerous JavaONE and other technical conferences and published papers in IEEE and other journals. He is also the author of two US Patents for Java and internet technologies.
Dirk Van den Poel is full professor of marketing modeling at the Faculty of Economics and Business Administration of Ghent University in Belgium. He heads a competence center on analytical customer relationship management (aCRM), direct marketing, customer intelligence. He received his degree of management/business engineer as well as his PhD from K.U.Leuven (Belgium). His main fields of interest are studying consumer behavior from a quantitative perspective (CRM), data mining ￼(genetic algorithms, neural networks, random forests), and operations research. He co-authored more than 50 international peer-reviewed articles in leading academic journals including Journal of Applied Econometrics, International Journal of Information Technology and Decision Making.
Vinod Venkataraman is a Software Engineer at Facebook, where he focuses on developing the in-house monitoring systems. Vinod holds a Master’s degree from the University of Texas at Austin, and a Bachelor’s degree from the National Institute of Technology, Trichy, India, both in Computer Science.
Unravel radically simplifies the management and operations of Hadoop, Spark, NoSQL and MPP applications. Unravel is used by several leading organizations to optimize application performance and cluster utilization.
Bala Venkatrao is currently part of the Founding Team and VP, Products at Unravel Data, a start-up that is re-defining Application Performance Management for Big Data Applications. Bala’s prior experience includes working in product mgmt/strategy roles at Elastic and Cloudera. Bala was an early employee at Cloudera where he was involved in a wide array of activities including Product Management, Marketing, Partnership Development and Customer Advocacy.
Dauwe completed his masters degree in Business Engineering at Ghent University with a master thesis on distribution optimization in the home vending industry. After some time as an analyst at Accenture, Dauwe returned to Ghent to start his Phd at the faculty of economics and business administration. His main focus is on integrating customer variables in distribution optimization problems. He is also a teaching assistant in the Master of Marketing Analysis program. He has worked on different supply chain projects in different industries and has spent over 3 years working on a project in the home vending industry. His main fields of interest are Optimization, Supply Chain Management, Big Data and Analytics.
JD is a User Experience Lead at Salesforce where he specializes in search, online communities and collaboration. Prior work experience includes Director of UX at MarkLogic Inc, and NewCity, Inc. where he was the UX lead for the Virginia Bioinformatics Institute’s most successful project – resulting in a $27 million grant. (largest in the history of Virginia Tech) Awards include multiple IMA, CASE and ADDY’s. A graduate of Carnegie Mellon’s MHCI program, JD is frankly super stoked about how Big Data and design will affect our future.
Jim is a recovering developer, professional marketer and amateur photographer with nearly twenty years experience building products and developing emerging technologies. During his career, he has brought multiple products to market in a variety of fields, including data loss prevention, master data management and now big data. At Hortonworks, Jim is focused on accelerating the development and adoption of Apache Hadoop.
Dean Wampler is Principal Consultant at Think Big Analytics, specialists in “Big Data”, Machine Learning, and the Hadoop ecosystem. He speaks frequently at conferences on various big data and other programming topics.
Dean is the author of Functional Programming for Java Developers (O’Reilly, 2011), the co-author of Programming Scala (O’Reilly, 2009) and the co-author of the forthcoming Programming Hive, also from O’Reilly.
Sewook Wee is an R&D manager at Accenture Technology Labs. His research has been grounded on distributed system with current emphasis on Big Data platform technologies. Recently, he led Hadoop deployment comparison study where he compared bare metal Hadoop cluster with Hadoop services (Amazon EMR) at the total cost of ownership level with three real world workloads. Previously, he has led various R&D projects including hybrid NoSQL approaches that layers graph data management capability on column-oriented datastores; MapReduce-based data transformation framework; next generation software architecture that maximizes the benefits of cloud; MonteCloudo, elastic Monte Carlo simulation architecture using cloud; and web server farm architecture on AWS EC2 environment. Along with leading R&D projects, he publishes academic papers, business white papers, files patents, presents... Read More.
Frank Weigel is Vice President of Products at Couchbase, responsible for product strategy, product management, and community engagement. Frank is passionate about working with customers and users to understand emerging requirements and successfully driving disruptive technology solutions to market. Previous to Couchbase, Frank was with IBM as technical product manager for cross-platform virtualization. Prior to that Frank served in a variety of roles at Transitive, including R&D, product management, and sales engineering. Frank received his undergraduate and graduate degrees in computer engineering from Manchester University, and holds several U.S. patents.
Ben Werther is the Founder and Executive Chairman of Platfora. Ben launched Platfora, and was the founding CEO for four years, with the goal of transforming how ‘citizen data scientists’ in every company make sense and drive action through direct and effortless use of big data. Before founding Platfora, Ben was vice president of products for DataStax, where he shaped the company’s enterprise and Hadoop strategy, and was also head of products at Greenplum through its acquisition by EMC. Ben has a B.S. in Computer Science from Monash University (Australia) and an M.S. in Computer Science from Stanford University.
Tom Wheeler’s career spans more than fifteen years in the communications, biotech, financial, healthcare, aerospace and defense industries. Before joining Cloudera, he developed engineering software at Boeing, helped to design a high-volume data processing system for WebMD and served as senior programmer/analyst for a brokerage firm. Mr. Wheeler is a frequent presenter at both user groups and software conferences.
Hadley Wickham is Chief Scientist at RStudio and an Adjunct Professor at Rice University. He is an active member of the R community, has written and contributed to over 40 R packages, and won the John Chambers Award for Statistical Computing for his work developing tools for data reshaping and visualisation. His research focusses on how to make data analysis better, faster and easier, with a particular emphasis on the use of visualisation to better understand data and models.
Edd Wilder-James is a technology analyst, writer, and entrepreneur based in California. He’s helping transform businesses with data as VP of strategy for Silicon Valley Data Science. Formerly Edd Dumbill, Edd was the founding program chair for the O’Reilly Strata conferences and chaired the Open Source Convention for six years. He was also the founding editor of the peer-reviewed journal Big Data. A startup veteran, Edd was the founder and creator of the Expectnation conference-management system and a cofounder of the Pharmalicensing.com online intellectual-property exchange. An advocate and contributor to open source software, Edd has contributed to various projects such as Debian and GNOME and created the DOAP vocabulary for describing software projects. Edd has written four... Read More.
hMatt is a Principal Program Manager on the Big Data team at Microsoft focused on building tools to make developers’ lives easier. Matt has been at Microsoft for 6 years working on developer tooling for distributed systems and cloud applications as well as technical evangelism. Prior to joining Microsoft, Matt was a consultant in the central United States building enterprise integration solutions.
As the Technology Evangelist for Amazon Web Services, Matt Wood discusses the technical and business aspects of cloud computing throughout Europe. With a background in the life sciences, Matt is interested in helping teams of all sizes bring their ideas to life through technology. Before joining Amazon he built web-scale search engines at Cornell University, sequenced DNA in Hinxton and developed analytics software in Cambridge. He is a frequent speaker at international conferences, a blogger, published author and an advocate of developer productivity.
Adrian Woodhead works for Expedia helping tackle their “big data” challenges. This involves matching data sets of hundreds of millions of items across different product areas in order to allow for analysis and inspection. Prior to this he was the lead of the data team at Last.fm, worked in Amsterdam for a fledgling Digital Rights Management startup as well as other companies and startups in the mobile, content management and banking spaces.
For information on exhibition and sponsorship opportunities, contact Susan Stewart at firstname.lastname@example.org.
For information on trade opportunities contact Kathy Yu at mediapartners
For media-related inquiries, contact Maureen Jennings at email@example.com
View a complete list of Strata contacts.