New speakers are added continuously. Please check back to see the latest updates to the program.
Ziya Ma is the general manager of the global Big Data Technologies organization in Intel’s Software and Services Group (SSG) in the System Technologies and Optimization (STO) Division. Her organization focuses on optimizing... Read More.
Deepak Agrawal is vice president of data sciences at the 24[ 7] Innovation Labs, a part of 24[ 7] Customer Solutions. At 24[ 7] Innovation Labs, he heads the Data Science Group, which does research... Read More.
SeongHwa Ahn currently works on various things related to Apache Hadoop at SK Telecom with his primary focus on solving telco and manufacturing industry problem. He has spent 18 years working in IT services and... Read More.
Tyler Akidau is a senior staff software engineer at Google Seattle, where he leads technical infrastructure internal data processing teams for MillWheel and Flume. Tyler is a founding member of the Apache Beam PMC... Read More.
With over 15 years in advanced analytical applications and architecture, John Akred is dedicated to helping organizations become more data driven. As CTO of Silicon Valley Data Science, John combines deep expertise in analytics... Read More.
Utkarsh B is principal architect at Flipkart responsible for building the marketplace technology platform with specific focus on building Cataloging as a Service. Utkarsh has extensive experience (12+ years) in large scale product/systems development, including... Read More.
Dr Vivian Balakrishnan, 54, studied Medicine at the National University of Singapore after being awarded the President’s Scholarship in 1980. He was elected President of the NUS Students’ Union from 1981 to 1983 and... Read More.
Regunath Balasubramanian works at Flipkart as Principal Architect for Commerce and Supply Chain platforms. He also leads Flipkart’s open source initiatives and is a committer on a number of projects. Prior to Flipkart, he architected... Read More.
Amit is Managing Director at Accenture Digital, with responsibility for Analytics Delivery across Asia Pacific, including ASEAN, Australia/New Zealand, Japan and Greater China. Amit’s core focus is on cognitive computing, big data and analytics,... Read More.
Thomas Beaujard is an Analytics Executive at Accenture Digital based in Perth, Western Australia. He runs the Accenture Digital practice in Perth, and delivers analytics solutions using big data technologies for Mining and Oil &... Read More.
As the Business Enablement Lead for the Data Insights group at Accenture Technology Labs, Hallie is focused on making analytics and data science more accessible to internal and client leadership. She also runs the Accenture-University... Read More.
Zhaojuan Bianny Bian is an engineering manager in Intel’s Software and Service Group, where she focuses on big data cluster modeling to provide services in cluster deployment, hardware projection, and software optimization. Bianny has more... Read More.
Albert Bifet is senior researcher at Huawei. A big data scientist with 10+ years of international experience in research, Albert has led new open source software projects for business analytics, data mining, and machine learning... Read More.
Joerg Blumtritt is the founder and CEO of Datarella, a computational social science startup delivering mobile analytics, self-tracking solutions, and data science consulting. After graduating from university with a thesis on machine learning, Joerg... Read More.
Farrah Bostic created the Difference Engine based on her belief that deep understanding of customer needs is essential to growing businesses through great products and services. Farrah has honed her customer-centric insights as an advisor... Read More.
Pauline Brown is director of marketing at Dataiku, which has developed the most productive predictive services development platform for data professionals, Data Science Studio (DSS). Pauline is a firm believer that the data and... Read More.
Dave Chan, CBIP, is a business analytics practitioner with over a decade of experience implementing big data projects for retail banking, healthcare and media organizations. In his current role, Dave leads a team of... Read More.
Evan Chan is a distinguished software engineer at Tuplejump. Evan loves to design, build, and improve bleeding-edge distributed data and backend systems using the latest open source technologies. He has led the design and implementation... Read More.
Oliver Chen is a chapter lead at DataKind Singapore. This is a volunteer position, and in his day job he’s a partner at Global Valuation Ltd. He was previously a Deputy Director at the Risk... Read More.
Selene Chew is a User Experience Design Leader at Adatao, the Silicon-Valley-based leader in enterprise Big Apps. She has taught “Storytelling through Design Media” at the Ohio State University Department of Design. She won the... Read More.
Matthew Conlen is a software engineer and information designer in New York. He is a partner at the New York Data Company, and works as the senior developer for Rhizome and computational journalist at FiveThirtyEight.... Read More.
Doug Cutting is the chief architect at Cloudera and the founder of numerous successful open source projects, including Lucene, Nutch, Avro, and Hadoop. Doug joined Cloudera from Yahoo, where he was a key member of... Read More.
Shirshanka Das is a principal staff software engineer and the architect for LinkedIn’s analytics platforms and applications team. He was among the original authors of a variety of open and closed source projects built at... Read More.
Yves-Alexandre de Montjoye is a lecturer at Imperial College London, a research scientist at the MIT Media Lab, and a postdoctoral researcher at Harvard IQSS. His research aims to understand how the unicity... Read More.
Danielle Dean is a principal data scientist lead at Microsoft in the Algorithms and Data Science Group within the Artificial Intelligence and Research Division, where she leads a team of data scientists and engineers building... Read More.
Masaru Dobashi is a system infrastructure engineer and leads the OSS professional service team at NTT DATA Corporation. Masaru developed an enterprise Hadoop cluster consisting of over 1,000 nodes in 2009, which... Read More.
Mark Donsky leads data management and governance solutions at Cloudera. Previously, Mark held product management roles at companies such as Wily Technology, where he managed the flagship application performance management solution, and Silver Spring Networks,... Read More.
Ted Dunning has been involved with a number of startups with the latest being MapR Technologies where he is Chief Application Architect working on advanced Hadoop-related technologies. He is also a PMC member for... Read More.
Dr. Scott Edington provides executive leadership and oversight for OA’s Consulting practice and OA Labs. Edington’s career spans over two decades of creating next generation technology capabilities in the Payments, Defense, and Intelligence sectors.
Prior... Read More.
Jana’s a math and computer nerd who took the business path for a career. Today, she’s CEO of Nara Logics, a neuroscience-inspired artificial intelligence company, providing a platform for recommendations and decision support. Her... Read More.
Bin Fan is a software engineer at Alluxio and a PMC member of the Alluxio project. Previously, Bin worked at Google building next-generation storage infrastructure, where he won Google’s Technical Infrastructure award. He holds... Read More.
Ju Fan received his PhD in computer science from Tsinghua University, China in 2012. He is currently a research fellow in the School of Computing, National University of Singapore. His research interest includes big data... Read More.
Sameer Farooqui is a client services engineer at Databricks, where he works with customers on Apache Spark deployments. Sameer works with the Hadoop ecosystem, Cassandra, Couchbase, and general NoSQL domain. Prior to Databricks, he worked... Read More.
Eric Frenkiel co-founded our company and has served as CEO since inception. Before MemSQL, Eric worked at Facebook on partnership development. He has worked in various engineering and sales engineering capacities at both consumer... Read More.
Yew Yap is software engineer at PayPal. He has worked on recommendation systems for marketing and personalization in PayPal. Prior that, he has worked on different type of systems such as Internet Banking and Operational... Read More.
Sonal is the founder and CEO at Nube Technologies (www.nubetech.co), a startup focussed on big data preparation and analytics. Nube Technologies builds business applications for better decision making through better data. Nube’s fuzzy matching... Read More.
Mark Grover is a product manager at Lyft. Mark is a committer on Apache Bigtop, a committer and PPMC member on Apache Spot (incubating), and a committer and PMC member on Apache Sentry.... Read More.
Dr. Stephen Hardy is technology directory at National ICT Australia. He has extensive experience in applying data analytics to problems in industry and government. He was previously head of Canon’s Image and Video Research... Read More.
Guy Harrison is an executive director of research and development at Dell Software. Guy is the author of Oracle Performance Survival Guide, MySQL Stored Procedure Programming, and Oracle SQL High Performance Tuning as well... Read More.
Chris Harrold is the Global CTO for Big Data Solutions for EMC Corporation. He has been with EMC since the Isilon acquisition in 2011 and is based in Denver, Colorado far from... Read More.
Tara Hirebet is the Regional New Business Director for R/GA overlooking New Business Consulting, Intelligence and Innovation, working across the R/GA Singapore, Shanghai and Sydney offices covering the Asia Pacific Region. Before that she was... Read More.
Felipe Hoffa is a developer advocate for big data at Google, where he inspires developers around the world to leverage the Google Cloud Platform tools to analyze and understand their data in ways they could... Read More.
Dr. Thomas Holleczek is a data scientist at DataSpark in Singtel. His research interests include big data analytics, machine learning, ubiquitous computing, wearable sensors, and traffic models. Prior to joining Singtel, Thomas worked at
Juliet Hougland is a data scientist at Cloudera and contributor/committer/maintainer for the Sparkling Pandas project. Her commercial applications of data science include developing predictive maintenance models for oil and gas pipelines at Deep Signal and... Read More.
Jonathan Hsieh is a software engineer at Cloudera. He is an Apache HBase committer, and Apache Flume founder.
Shengsheng (Shane) Huang is a software architect at Intel and an Apache Spark committer and PMC member, leading the development of large-scale analytical applications and infrastructure on Spark in Intel. Her area of focus... Read More.
As Fusionex’s Vice President in Business Consulting, Isaac Jacob has more than a decade’s experience in enterprise software project implementations.
He has overseen and been involved in Big Data, Business Intelligence and Analytics projects in... Read More.
In his role as a Consultant Product Manager, Nikhil focuses on enabling Hadoop analytics on EMC’s Cloud Storage and Converged Infrastructure platforms.
Previously, as a Software Engineer, Nikhil developed massive distributed systems and helped wrangle... Read More.
Amit Kapoor is interested in learning and teaching the craft of telling visual stories with data. At narrativeVIZ Consulting, Amit uses storytelling and data visualization as tools for improving communication, persuasion, and leadership through workshops... Read More.
Ji Sung Kim is a IT manager in SK Telecom which is the largest wireless telecommunications service provider of south korea. He has been working in the area of building big data applications. He participated... Read More.
Dr. Markus Kirchberg serves as the Head of Technology Innovation and has responsibility for driving and delivering technology innovation across the Asia Pacific region. Kirchberg has over 20 years experience in research and technology-driven innovation.... Read More.
Hong-Eng Koh started his career with the Singapore Police Force (SPF) after graduating under an SPF scholarship. Through the years, he has held various appointments including senior investigation officer, head of crime prevention... Read More.
Naren Koneru is an engineering manager at Cloudera and leads the navigator development team. Prior to Cloudera, Naren was at Miti, building enterprise-wide metadata and governance solutions. Before joining Miti, Naren spent over seven years... Read More.
Marcel Kornacker is a tech lead at Cloudera and the architect of Apache Impala (incubating). Marcel has held engineering jobs at a few database-related startup companies and at Google, where he worked on several ad-serving... Read More.
Yuichi Kuroda has been an R&D manager at Mitsubishi UFJ Information Technology (MUIT) since 2015, working on technology innovation for Mitsubishi UFJ Financial Group (MUFG) with big data technology and other... Read More.
Priya Lakshminraryanan is the Director of Product Management at the Emerging Technologies Division at EMC. Priya focuses on Big Data strategies for the Cloud storage and converged infrastructure platforms at ETD. She joined... Read More.
Philip Langdale is the engineering lead for cloud at Cloudera. He joined the company as one of the first engineers building Cloudera Manager and served as an engineering lead for that project until moving to... Read More.
Uri Laserson is a data scientist at Cloudera. Previously, he obtained his PhD from MIT where he developed applications of high-throughput DNA sequencing to immunology. During that time, he co-founded Good Start Genetics,... Read More.
Kevin is VP of Data and Growth at GrabTaxi, where among other things he leads IT, recruiting and the financial services business unit, but is by far the most passionate about leading data science, working... Read More.
Kalev H. Leetaru founded and leads the GDELT Project, which monitors the world’s broadcast, print, and web news media in over 100 languages in real time and identifies the people, locations, organizations, counts, themes,... Read More.
Sanqi Li is the CTO of products and solutions at Huawei. Previously, Sanqi served as CTO at Tekelec, CTO of Carrier Network Business Group’s IT product line and Core Network product line,... Read More.
Todd Lipcon is an engineer at Cloudera, where he primarily contributes to open source distributed systems in the Apache Hadoop ecosystem. Previously, he focused on Apache HBase, HDFS, and MapReduce, where he designed and... Read More.
Feng-Yuan is director, government analytics at the Infocomm Development Authority (IDA) of Singapore. He heads a multidisciplinary team of data scientists including data analysts, social scientists, computer scientists, and data visualizers that help government... Read More.
Jun Liu is a senior performance engineer in Intel’s Software and Service group, where he works in the area of big data performance modeling and simulation, especially SQL-on-Hadoop systems. Before Intel, Jun was a... Read More.
Roger Magoulas is the research director at O’Reilly Media and chair of the Strata + Hadoop World conferences. Roger and his team build the analysis infrastructure and provide analytic services and insights on technology-adoption trends... Read More.
Ted has worked on close to 60 Clusters over 2-3 dozen clients with over 100’s of use cases. He has 18 years of professional experience working for start-ups, the US government, a number of the... Read More.
Rishi Malhotra is co-founder and CEO of Saavn, India’s leading music streaming service. As CEO, Rishi has led the company through significant and rapid user growth, while helping to secure partnerships with companies... Read More.
Silviu Maniu is a researcher at Noah’s Ark Lab, Huawei Technologies. He holds a PhD degree in Computer Science from Telecom ParisTech. His main research interests are social and uncertain data management databases, and stream... Read More.
eff Markham is the Technical Director, APAC for Hortonworks, the only company building Open Enterprise Hadoop. Previously, he was with VMware, Red Hat, and IBM helping companies build distributed applications with distributed data.... Read More.
Paul’s remit is to drive the business strategy and sales execution for the SAP Platform Solutions which includes all Database & Analytics solutions along with the successful adoption of all solutions underpinned by
Jennifer Marsman is the principal software engineer for Microsoft’s AI for Earth Group, where she uses data science, machine learning, and artificial intelligence to aid with clean water, agriculture, biodiversity, and climate change. She has... Read More.
Sujit Mathew is a Software Development Manager at the Consumer Group in PayPal. Where his team builds data driven predictive models that drive consumer engagement. Sujit has also worked as a Technology lead and Developer... Read More.
Patrick McFadin is one of the leading experts in Apache Cassandra and data-modeling techniques. As a consultant and the chief evangelist for Apache Cassandra at DataStax, Patrick has helped build some of the largest and... Read More.
Wes McKinney is a software engineer at Cloudera and lead developer of Ibis. He is the creator of Python’s pandas library and is the author of Python for Data Analysis. Previously, Wes was the founder... Read More.
Ken Medlock is an enterprise architect at the ANZ Banking Group and is leading ANZ Technology Simplification initiatives through data-driven complexity insights, development of solution rationalisation, and decommissioning strategies and frameworks. Ken has... Read More.
Prakhar Mehrotra is currently leading a team of data scientists as part of the strategic finance group within Uber. Prior to joining Uber, he was a data scientist within Sales & Monetization finance at Twitter.... Read More.
Neil Mendelson is Vice President of Big Data and Advanced Analytics, Product Management within Oracle Server Technologies.
Neil has recently returned to Oracle having originally been responsible for Data Warehousing within Product Management. In the... Read More.
Seventeen years in the analytics arena has seen Rakesh Menon work on various projects across multiple industries, from manufacturing, consumer electronics, healthcare, and banking to telco and more. Starting out in an academic environment, he... Read More.
Andreas Mueller received his PhD in machine learning from the University of Bonn. After working as a machine learning researcher on computer vision applications at Amazon for a year, he recently joined the Center for... Read More.
Raghu is in the upper echelon of Cisco Engineering having been awarded with the ellusive Distinguished engineer Status.
He is the Chief Architect of Big Data and Analytics Solutions working
within Cisco’s Data... Read More.
Arshak Navruzyan is a machine-learning-focused product manager and the founder of Startup.ML, a machine-learning fellowship program that has graduated over 30 data scientists now employed by companies including Uber, Facebook, and Baidu. Arshak has delivered... Read More.
Minh Chau Nguyen is a researcher in the Big Data Software Platform Research department at the Electronic and Telecommunications Research Institute (ETRI), one of the largest government-funded research institutes in Korea. His research interests... Read More.
Mike Olson cofounded Cloudera in 2008 and served as its CEO until 2013, when he took on his current role of chief strategy officer. As CSO, Mike is responsible for Cloudera’s product strategy,... Read More.
Josh Patterson currently runs a consultancy in the Big Data Machine Learning space and is an advisor to Skymind (deep learning startup). Previously Josh worked as a Principal Solutions Architect at Cloudera and an engineer... Read More.
Deepak in his role as Chief Technology Officer, has been advising customers across the region on their
analytical strategies. Now in his 11th year at SAS, Deepak works with customers across industries in
Jai Ranganathan is the director of product strategy at Cloudera, where he is responsible for planning the future roadmap of Cloudera products. Before Cloudera, he spent a decade at VMware, where among other things he... Read More.
Nirmal Ranganathan is a Principal Engineer working on the Data Stores Platform at Rackspace. He constantly works with various teams within Rackspace and customers alike, directing them on how best to take advantage of Big... Read More.
Tom manages Woodside’s data science program as VP Science. This program aims to deliver company-wide benefits through the application of predictive analytics, machine learning and cognitive computing techniques. Prior to this he has worked in... Read More.
Julie Rodriguez is vice president of product management and user experience at Eagle Investment Systems. An experience designer focusing on user research, analysis, and design for complex systems, Julie has patented her work in data... Read More.
Sandy Ryza is a data scientist at Cloudera focusing on Apache Spark and its ecosystem. He is an author of Advanced Analytics with Spark, as well as a frequent Spark contributor and member of the... Read More.
Kostas Sakellis is the lead and engineering manager of the Apache Spark team at Cloudera. Kostas holds a bachelor’s degree in computer science from the University of Waterloo, Canada.
Majken Sander is a data nerd, business analyst, and solution architect. Majken has worked with IT, management information, analytics, BI, and DW for 20+ years. Armed with strong analytical expertise, she is keen on “data... Read More.
Recently named number five of the top nine women writers in big data and business intelligence by Business Intelligence Solutions Review, Joanna Schloss is a subject matter expert in the areas of big data analytics,... Read More.
Jim Scott is the director of enterprise strategy and architecture at MapR Technologies. He is passionate about building combined big data and blockchain solutions. Over his career, Jim has held positions running operations, engineering, architecture,... Read More.
Paul Scott-Murphy has built messaging, analytics, integration, and big-data systems for over 20 years, currently with WANdisco where he is VP of Field Technical Services APJ. Previously CTO for Australia and New Zealand... Read More.
Jonathan Seidman is a software engineer on the partner engineering team at Cloudera. Previously, he was a lead engineer on the big data team at Orbitz Worldwide, helping to build out the Hadoop clusters supporting... Read More.
Tushar Shanbhag is the Vice President of Products at Adatao, the leader in Enterprise Big Apps. Previously, he led the flagship Director and Navigator products at Cloudera. He spent many years at VMWare. Prior to... Read More.
Gwen Shapira is a system architect at Confluent, where she helps customers achieve success with their Apache Kafka implementations. She has 15 years of experience working with code and customers to build scalable data architectures,... Read More.
Mingfei Shi is a senior software engineer on Intel’s big data technology team. He is one of the top contributors to the Tachyon project, and also a contributor to the Spark project.
Dr. Amy Shi-Nash is chief data science officer at Singtel. Amy has 15 years of industry experience in data mining, consumer analytics, loyalty, marketing, and management consulting globally. As the founding member of DataSpark at... Read More.
Around 15 Yrs of Experience in IT. Worked with IBM for 4 yrs providing customer support in North India Region on IBM AIX and Storage Arrays. Worked in HP Storage Labs Bangalore... Read More.
Rod Smith is an IBM fellow & Vice President of the IBM Emerging Internet Technologies organization, where he leads a team of highly technical innovators in seeking out disruptive technologies that aid businesses... Read More.
Yoshitaka Suzuki is a researcher in information science and technology at IHI Corporation. Yoshitaka has developed anomaly detection algorithms for several kinds of products, such as industrial machines and engines, but is now responsible... Read More.
Ivan Teh is the managing director of Fusionex, an award-winning company specializing in business intelligence, big data and analytics. Ivan has over 17 years of experience in the ICT industry. Previously, Ivan managed teams... Read More.
Kai Xin Thia is a data scientist at Lazada. He specializes in behavioral analytics and has an interest in large recommendation systems. He has been building behavioral models for three years and is in the... Read More.
Kathleen Ting is a technical account manager at Cloudera where she helps strategic customers deploy and use the Apache Hadoop ecosystem in production. She’s a frequent conference speaker, has contributed to several projects in the... Read More.
Wee Hyong Tok is a principal data science manager at Microsoft, where he works with teams to cocreate new value and turn each of the challenges facing organizations into compelling data stories that can be... Read More.
Whye Loon comes from a computing background, and received his bachelor degree in Computer Engineering and doctoral degree in Computational Intelligence from the Nanyang Technological University (NTU) in 2000 and 2004, respectively. He started... Read More.
Vinod Venkatraman is currently helping build the marketplace technology platform at Flipkart. Vinod’s specialties are Core Java, Java concurrency, SOA, JMS, web services, JTA, web development, Spring, Struts, JDBC, SQL,... Read More.
Vishnu Vettrivel is a developer and architect with around 20 years of experience building and scaling successful big data and AI platforms. He is the architect and co-founder of the Nephos project that is an... Read More.
Vladimir is an accomplished Information Management professional with a broad experience in a variety of environments and industries. Specialising in architecting and delivering business solutions, Vladimir is an achievements-orientated professional who actively demonstrates a high-performing... Read More.
Skye Wanderman-Milne is an engineer on the Impala team at Cloudera.
Wei Wang is a Ph.D. student in the computer science department of the National University of Singapore. Currently, he is working on an Apache incubator project (SINGA) for developing a general distributed deep learning... Read More.
Melanie Warrick is a senior developer advocate at Google with a passion for machine learning problems at scale. Melanie’s previous experience includes work as a founding engineer on Deeplearning4j and as a data scientist and... Read More.
Currently work at Baidu Big Data Group, focusing on big data infrastructure
Hee Sun Won is a principal researcher at the Electronic and Telecommunications Research Institute (ETRI) and leads the Collaborative Analytics Platform for BDaaS (big data as a service) and analytics for the Network Management... Read More.
Reynold Xin is a cofounder and chief architect at Databricks as well as an Apache Spark PMC member and release manager for Spark’s 2.0 release. Prior to Databricks, Reynold was pursuing a PhD at... Read More.
Fangjin Yang is a coauthor of the open source Druid project and a cofounder of Imply, a data analytics startup based in San Francisco. Previously, Fangjin held senior engineering positions at Metamarkets and Cisco Systems.... Read More.
©2015, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • firstname.lastname@example.org
Apache Hadoop, Hadoop, Apache Spark, Spark, and Apache are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries, and are used with permission. The Apache Software Foundation has no affiliation with and does not endorse, or review the materials provided at this event, which is managed by O'Reilly Media and/or Cloudera.