Presented By O'Reilly and Cloudera
Make Data Work
September 26–27, 2016: Training
September 27–29, 2016: Tutorials & Conference
New York, NY

Gaining extreme agility and performance using a Spark-free approach to data management

Jake Dolezal (McKnight Consulting Group Global Services)
4:35pm–5:15pm Wednesday, 09/28/2016
Sponsored
Location: 1B 03/04
Average rating: *****
(5.00, 1 rating)

What you'll learn

  • Learn an effective method for democratizing data access and business intelligence
  • Understand what it takes to break through the traditional trade-offs in managing big data and achieve both agility and performance without the use of code-based languages like Spark or MapReduce
  • Discover how to achieve performance in Hadoop that is 5.5x faster than Spark and 19x faster than MapReduce
  • Description

    Businesses are clamoring to capture all data possible and harness it as a revenue driver. The challenge is bringing the data together. Companies that can capture and harness this data can benefit accordingly.

    When it comes to data management in Hadoop, the architecture foundation makes all the difference for performance. Jake Dolezal shares research into the performance of data quality and data management workloads on Hadoop clusters. Jake discusses a YARN-based approach to data management and outlines highly effective IT resource utilization techniques to achieve extreme agility for organizations and performance gains in Hadoop.

    Topics include:

    • Managing complex, high-volume data with identity and entity resolution in the most demanding applications, such as customer data quality
    • Device and entity matching in IoT data
    • Pure YARN integration
    • How to leverage MDM capabilities to ensure precise data via business user data access and management

    This session is sponsored by RedPoint Global.

    Photo of Jake Dolezal

    Jake Dolezal

    McKnight Consulting Group Global Services

    Jake Dolezal is a practice lead for McKnight Consulting Group Global Services. Jake has over 17 years of experience in information management, with expertise in business intelligence, analytics, data warehousing, statistics, data modeling and integration, data visualization, master data management, and data quality across a broad array of industries, including healthcare, education, government, manufacturing, engineering, hospitality, and gaming. Previously, Jake was the senior director of information management at the Choctaw Nation of Oklahoma—the third-largest Native American tribe in the United States, with over 200,000 members worldwide—where he championed and developed an enterprise-wide information management initiative from the ground up across the organization’s commercial, government, healthcare, social service, and education divisions. He was also involved with the organization’s core CRM and ERP systems. Jake is the author of two books due to be published this year. He holds a PhD in information management from Syracuse University and is a certified business intelligence professional through TDWI with an emphasis in data analysis. He is also a certified leadership coach and has helped clients accelerate their careers and earn several executive promotions.