There tends to be a lot of talk around enabling the next-generation use cases or advanced analytics for big data and technologies such as Hadoop. While those applications should play a role in any enterprise’s data strategy, much of the initial value and benefits of a modern data platform often comes from simply offloading the analytic workloads you’re running today. However, where do you start when developing an off-load strategy? Which workloads are better suited for Hadoop? And how do you off-load complex analytic workloads?
At Visa, the process of optimizing the enterprise data warehouse and consolidating data marts by migrating these analytic workloads to Hadoop has played a key role in the adoption of the platform and how data has transformed Visa as an organization. Nandu Jayakumar and Ewa Ding share Visa’s journey along with some best practices for organizations migrating workloads to Hadoop.
In particular, Nandu and Ewa discuss Visa’s approach to bringing Hadoop into its business, its off-load strategy, and the lessons learned from the experience. They also explain how Visa is using new technologies to help better understand the legacy workloads running today and prioritize migrations to continue to alleviate costs and pressures on these systems and and offer a glimpse at what’s in store for the future.
Nandu Jayakumar is a software architect and engineering leader at Visa, where he is currently responsible for the long-term architecture of data systems and leads the data platform development organization. Previously, as a senior leader of Yahoo’s well-regarded data team, Nandu built key pieces of Yahoo’s data processing tools and platforms over several iterations, which were used to improve user engagement on Yahoo websites and mobile apps. He also designed large-scale advertising systems and contributed code to Shark (SQL on Spark) during his time there. Nandu holds a bachelor’s degree in electronics engineering from Bangalore University and a master’s degree in computer science from Stanford University, where he focused on databases and distributed systems.
Justin Erickson is a senior director of product management leading Cloudera’s platform team, which is responsible for the components in Cloudera Distribution including Hadoop (CDH) above storage. Prior to joining Cloudera, he led the high-availability and disaster-recovery areas of Microsoft SQL Server.
Help us make this conference the best it can be for you. Have questions you'd like this speaker to address? Suggestions for issues that deserve extra attention? Feedback that you'd like to share with the speaker and other attendees?
Join the conversation here (requires login)
©2017, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • firstname.lastname@example.org