Presented By
O’Reilly + Cloudera
Make Data Work
29 April–2 May 2019
London, UK
Please log in

The vindication of big data: How Santander UK uses Hadoop to defend privacy

Maurício Lins (everis NTT DATA UK), Lidia Crespo (Santander UK)
16:3517:15 Wednesday, 1 May 2019
Case studies, Strata Business Summit
Location: Capital Suite 12
Average rating: ****.
(4.50, 4 ratings)

Who is this presentation for?

  • Those interested in GDPR and big data architecture



Prerequisite knowledge

  • Familiarity with big data technology and GDPR

What you'll learn

  • Explore new patterns and an approach to deal with obscuring and privacy challenges


Big data is usually regarded as a menace to data privacy. But with data privacy principles and a customer-first mindset, it can be a game changer. Maurício Lins and Lidia Crespo explain how Santander UK applied this model to comply with GDPR, using graph technology, Hadoop, Spark, and Kudu to drive data obscuring, data portability, and machine learning exploration.

Traditionally, Santander UK worked with static systems, where an in-depth one-off analysis was performed as a project. However, the variety of the data in its big data platform and the speed of growth pushed the company to look for new models and technologies to overcome the challenge of data privacy.

In an environment where new data is loaded every day, making sure all data is catalogued and the inventory is up to date when rules or scope changes required a new approach to metadata management. Configurable and flexible solutions are one of the key investments for the privacy solution. In addition, the processing power necessary to scan all the data lake contents daily to identify personal information and the complexity of a self-service model with multiple silos and governance paths presented new technology challenges for the GDPR teams.

Maurício and Lidia explore some of these challenges and outline the architecture and technology Santander UK implemented to make sure its customer data is identified, protected, and provided to customers adequately.

Photo of Maurício Lins

Maurício Lins

everis NTT DATA UK

Mauricio Lins is a Data & Analytics Technical Manager at everis NTT DATA UK, where he is currently responsible for the AI and ML initiatives coordinating the integration between solutions with other offices around Europe and America.

He has 15 years of experience in the IT universe, mostly with application development and data platforms. In the last 5 years he has been working with Big Data implementation with a big variety of architectures and business segments. He is also a develper certified by Cloudera as a Spark specialist.

He had some works published in the CIACA Conference in Portugal around the Big Data area in the streaming and batch architectures.

Photo of Lidia Crespo

Lidia Crespo

Santander UK

Lidia Crespo is a technical business manager on the CDO team at Santander UK, where she leads the company’s big data governance activities. She and her team have been instrumental in the adoption of the technology platform by creating a sense of trust and with their deep knowledge of the data of the organization. With her experience in complex and challenging international projection projects and a background in audits, IT, and data, Lidia brings a combination difficult to find.