As healthcare data becomes more digitized, the opportunity to leverage electronic medical records, prescription data, medical billings, hospital, and other healthcare datasets to help improve health outcomes and lower the cost of care for patients in near real time is becoming a possibility. However, processing terabytes and petabytes of de-identified healthcare data requires the application of complex and ever-changing business rules. This impacts the ability to generate near-real-time insights and conduct research studies that could potentially influence how patients are treated.
Today, the analysis of databases of this magnitude can take days or even weeks of processing; to be more effective for improving patient care, researchers need to be able to run processes on demand, returning result sets instantaneously. Navdeep Alam shares his experience at IMS Health in realizing this opportunity to influence patient health outcomes in minutes to seconds and reviews current and emerging technologies in the marketplace that handle working with unbounded, de-identified patient datasets in the billions of rows in an efficient and scalable way.
Navdeep (Nav) Alam brings more than 15 years of experience in software engineering, databases, data warehousing, analytics, architecture, and development to his role as the director of global data warehousing at IMS Health, where he is charged with managing the global data warehousing organization as a center of excellence and defining and executing its future roadmap, which includes next-generation massive parallel processing (MPP), low-latency data warehousing systems. Nav is also a graduate teaching assistant at Boston University, where he assists in teaching graduate students on enterprise computing, advanced databases, data mining, and business intelligence.
Previously, Nav was the director of analytics and prediction for Empirix, where he led a global team in the architecture and development of its next-generation analytics platform, IntelliSight; director of data architecture for Mzinga’s social intelligence applications as part of its OmniSocial SaaS platform; principal software engineer for KnowledgePlanet, Mzinga’s predecessor, where he was the principal architect and developer of Firefly Simulation Developer; in the Information Technology and Application Support group at Calgary’s Nova Chemical Research and Technology Center, where he provided Y2K support and developed an interactive laboratory information management system web tutorial application; and a Unix administrator managing the Oracle data systems for Syncrude Canada and EI Processing, building filtering algorithms to scrub noise for seismic data processing. Nav holds an MS in computer science from Boston University and a bachelor’s degree in computer science from the University of Calgary.
©2016, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • firstname.lastname@example.org
Apache Hadoop, Hadoop, Apache Spark, Spark, and Apache are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries, and are used with permission. The Apache Software Foundation has no affiliation with and does not endorse, or review the materials provided at this event, which is managed by O'Reilly Media and/or Cloudera.