Businesses are clamoring to capture all data possible and harness it as a revenue driver. The challenge is bringing the data together. Companies that can capture and harness this data can benefit accordingly.
When it comes to data management in Hadoop, the architecture foundation makes all the difference for performance. Jake Dolezal shares research into the performance of data quality and data management workloads on Hadoop clusters. Jake discusses a YARN-based approach to data management and outlines highly effective IT resource utilization techniques to achieve extreme agility for organizations and performance gains in Hadoop.
This session is sponsored by RedPoint Global.
Jake Dolezal is a practice lead for McKnight Consulting Group Global Services. Jake has over 17 years of experience in information management, with expertise in business intelligence, analytics, data warehousing, statistics, data modeling and integration, data visualization, master data management, and data quality across a broad array of industries, including healthcare, education, government, manufacturing, engineering, hospitality, and gaming. Previously, Jake was the senior director of information management at the Choctaw Nation of Oklahoma—the third-largest Native American tribe in the United States, with over 200,000 members worldwide—where he championed and developed an enterprise-wide information management initiative from the ground up across the organization’s commercial, government, healthcare, social service, and education divisions. He was also involved with the organization’s core CRM and ERP systems. Jake is the author of two books due to be published this year. He holds a PhD in information management from Syracuse University and is a certified business intelligence professional through TDWI with an emphasis in data analysis. He is also a certified leadership coach and has helped clients accelerate their careers and earn several executive promotions.
©2016, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • firstname.lastname@example.org
Apache Hadoop, Hadoop, Apache Spark, Spark, and Apache are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries, and are used with permission. The Apache Software Foundation has no affiliation with and does not endorse, or review the materials provided at this event, which is managed by O'Reilly Media and/or Cloudera.