Applying natural language processing in practice is nontrivial for two reasons. First, human language is nuanced, fuzzy, and highly contextual, requiring domain-specific models to be trained for most tasks. Second, NLP is usually just part of a bigger machine learning or information retrieval pipeline that solves for a real business use case. Putting together a complete, scalable, performant, measurable, and reproducible pipeline traditionally requires significant engineering compromises.
David Talby and Santosh Kulkarni explain how Kaiser Permanente uses the open source NLP library for Apache Spark to tackle one of the most common challenges with applying natural language process in practice—integrating domain-specific NLP as part of a scalable, performant, measurable, and reproducible machine learning pipeline—and improve the accuracy of forecasting the demand for hospital beds. Accurate forecasting is critical to ensuring that enough beds and nurses are available to take care of incoming patients. While some predictive features are structured, many relevant features are locked in free-text clinical notes. Along the way, David and Santosh explain how Kaiser Permanente’s systems meet the highest standards of robustness, scale, and compliance.
David Talby is a chief technology officer at Pacific AI, helping fast-growing companies apply big data and data science techniques to solve real-world problems in healthcare, life science, and related fields. David has extensive experience in building and operating web-scale data science and business platforms, as well as building world-class, Agile, distributed teams. Previously, he was with Microsoft’s Bing Group, where he led business operations for Bing Shopping in the US and Europe, and worked at Amazon both in Seattle and the UK, where he built and ran distributed teams that helped scale Amazon’s financial systems. David holds a PhD in computer science and master’s degrees in both computer science and business administration.
Santosh Kulkarni is a product leader at Kaiser Permanente, where he is responsible for driving the development of its intelligence platform systems. Santosh has a deep passion for healthcare and technology and has been an active member in many of the healthcare industry’s forums, which have shaped the healthcare industry in the recent years. An experienced healthcare thought leader, Santosh has advised and supported some of the top global healthcare players in defining and building next-generation healthcare products and solutions. Previously, he spent more than a decade providing strategic, product, and digital transformation consulting and services to healthcare organizations, with focus on digital health and consumer and population health management, and was part of the initial architecture team that built Siemens’s flagship EHR platform, Soarian. Santosh holds a master’s degree in business administration and a bachelor’s degree in computer science and engineering.
©2018, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • firstname.lastname@example.org