Big data is usually regarded as a menace to data privacy. But with data privacy principles and a customer-first mindset, it can be a game changer. Maurício Lins and Lidia Crespo explain how Santander UK applied this model to comply with GDPR, using graph technology, Hadoop, Spark, and Kudu to drive data obscuring, data portability, and machine learning exploration.
Traditionally, Santander UK worked with static systems, where an in-depth one-off analysis was performed as a project. However, the variety of the data in its big data platform and the speed of growth pushed the company to look for new models and technologies to overcome the challenge of data privacy.
In an environment where new data is loaded every day, making sure all data is catalogued and the inventory is up to date when rules or scope changes required a new approach to metadata management. Configurable and flexible solutions are one of the key investments for the privacy solution. In addition, the processing power necessary to scan all the data lake contents daily to identify personal information and the complexity of a self-service model with multiple silos and governance paths presented new technology challenges for the GDPR teams.
Maurício and Lidia explore some of these challenges and outline the architecture and technology Santander UK implemented to make sure its customer data is identified, protected, and provided to customers adequately.
Mauricio Lins is a Data & Analytics Technical Manager at everis NTT DATA UK, where he is currently responsible for the AI and ML initiatives coordinating the integration between solutions with other offices around Europe and America.
He has 15 years of experience in the IT universe, mostly with application development and data platforms. In the last 5 years he has been working with Big Data implementation with a big variety of architectures and business segments. He is also a develper certified by Cloudera as a Spark specialist.
He had some works published in the CIACA Conference in Portugal around the Big Data area in the streaming and batch architectures.
Lidia Crespo is a technical business manager on the CDO team at Santander UK, where she leads the company’s big data governance activities. She and her team have been instrumental in the adoption of the technology platform by creating a sense of trust and with their deep knowledge of the data of the organization. With her experience in complex and challenging international projection projects and a background in audits, IT, and data, Lidia brings a combination difficult to find.
For exhibition and sponsorship opportunities, email strataconf@oreilly.com
For information on trade opportunities with O'Reilly conferences, email partners@oreilly.com
View a complete list of Strata Data Conference contacts
©2019, O’Reilly UK Ltd • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • confreg@oreilly.com