Presented By O’Reilly and Cloudera
Make Data Work
March 5–6, 2018: Training
March 6–8, 2018: Tutorials & Conference
San Jose, CA

Meta your data; drain the big data swamp

9:30am10:00am Tuesday, March 6, 2018

Data lakes have enabled organizations to consolidate data across functional boundaries to drive insights from analytics. However, a lack of focus on an enterprise-wide strategy and architecture have restricted the otherwise capable data lake to solve multiple enterprise-wide problems that provide horizontal or vertical views of the data that can then create actionable strategies for the firm.

When BP wanted to implement an operational data lake, it was driven by the desire to reduce operational risk, improve predictability and operational awareness, prepare for and invest in the future, and gain a competitive advantage. To achieve this, BP developed a comprehensive strategy and approach for a sustainable and operational data lake that included data and information governance in a bill of data, a lightweight, portable, and scalable process design, metadata management technology for big data ecosystems, and an Agile approach to delivering business results.

Madhav Madaboosi and Meenakshisundaram Thandavarayan offer an overview of BP’s self-service operational data lake, which improved operational efficiency, boosting productivity through fully identifiable data and reducing risk of a data swamp. They cover the path and big data technologies that BP chose, lessons learned, and pitfalls encountered along the way.

Photo of Madhav Madaboosi

Madhav Madaboosi


Madhav Madaboosi is a digital business and technology strategist within the Strategy, Architecture, and Planning Group at BP, where he leads a number of global innovation initiatives in the areas of robotic process automation, AI, big data, data lakes, and the industrial IoT. Previously, Madhav was the interface to several business portfolios within BP as a business information manager. Prior to BP, he worked in management consulting for a number of Fortune 100 firms. Madhav holds a degree in business and has completed executive programs at the Kellogg Institute of Management.

Photo of Meenakshisundaram Thandavarayan

Meenakshisundaram Thandavarayan


Meena Thandavarayan is a practice lead at Infosys, where he focuses on leveraging technical advancements and industry reference architectures for defining a data delivery platform. Meena has extensive experience leading application, technology, data, and infrastructure teams developing strategy, architecture, implementation, and IT operational services. A big data and analytics evangelist, he specializes in strategy for accelerating the digitization journey for oil and gas clients: most recently, he delivered functional and technical architecture for a one-stop self-service data and information portal.