Presented By O'Reilly and Cloudera
Make Data Work
22–23 May 2017: Training
23–25 May 2017: Tutorials & Conference
London, UK

Continuous analytics: Integrating the data hub in a DevOps pipeline

Arturo Bayo (Synergic Partners), Alvaro Fernandez Velando (Santander Spain)
16:3517:15 Thursday, 25 May 2017
Level: Intermediate
Average rating: ****.
(4.50, 6 ratings)

Who is this presentation for?

  • C-level technology executives, data scientists, and big data architects and engineers

Prerequisite knowledge

  • Intermediate knowledge of big data architecture

What you'll learn

  • Learn how to use big data technologies and resources that are already established in the enterprise to implement a data hub that supports production use cases
  • Understand how to manage the application lifecycle effectively with a DevOps-oriented pipeline
  • Learn how to use predictive analytics to estimate the resources that will be needed in the future
  • Explore a production business case from Santander Spain

Description

Most organizations have already implemented a data-driven strategy or are planning to invest in one. However, these organizations often struggle to design an effective and efficient infrastructure to support use cases and derive business value through data. As a result, many organizations focus on the business case and build monolithic architectures that isolate their data from the rest of their applications, giving rise to siloed structures and adding complexity to the application lifecycle.

Arturo Bayo and Alvaro Fernandez Velando explain how a data hub strategy helps clarify data sharing and governance in an organization and share one way to implement a data hub architecture using big data technology and resources that are already established in the enterprise. They then outline the design of a DevOps pipeline built on top of Docker containers capable of delivering continuous analytics through application lifecycle management tools. They conclude by exploring an analytic business case in production at Santander Spain that generates client networks connected by data stored in the data hub.

Photo of Arturo Bayo

Arturo Bayo

Synergic Partners

Arturo Bayo is team leader and senior data engineer at Synergic Partners, where he specializes in banking and finance projects. He has broad knowledge of database administration (SQL, Mongo DB, and Cassandra) and big data (Hadoop, R, Hive, and Spark). Arturo holds a bachelor’s of science degree in computer engineering from the Universidad Autonoma of Madrid and a bachelor’s of business administration (BBA) from UNED.

Photo of Alvaro Fernandez Velando

Alvaro Fernandez Velando

Santander Spain

Alvaro is a chemical engineer with postgraduate training in Data Science, Artificial Intelligence and Robotics. He has 20 years of experience in the financial sector in several leading banks (BBVA, HSBC, La Caixa and Santander). In 2007 he joined Santander as CRM director, where he has developed his career until recently being named Chief Risk Data Officer. Under his responsibility are the areas of methodology and modelling, Big Data in risk and Risk Management Information.

Leave a Comment or Question

Help us make this conference the best it can be for you. Have questions you'd like this speaker to address? Suggestions for issues that deserve extra attention? Feedback that you'd like to share with the speaker and other attendees?

Join the conversation here (requires login)

Comments

Andrzej Jędrzejewski | WEBOPS ENGINEER
29/05/2017 13:01 BST

Hi Guys,

Could you upload your slides, please?