Presented By O’Reilly and Cloudera
Make Data Work
March 5–6, 2018: Training
March 6–8, 2018: Tutorials & Conference
San Jose, CA
David Talby

David Talby
CTO, Pacific AI

Website | @davidtalby

David Talby is a chief technology officer at Pacific AI, helping fast-growing companies apply big data and data science techniques to solve real-world problems in healthcare, life science, and related fields. David has extensive experience in building and operating web-scale data science and business platforms, as well as building world-class, Agile, distributed teams. Previously, he was with Microsoft’s Bing Group, where he led business operations for Bing Shopping in the US and Europe, and worked at Amazon both in Seattle and the UK, where he built and ran distributed teams that helped scale Amazon’s financial systems. David holds a PhD in computer science and master’s degrees in both computer science and business administration.

Sessions

1:30pm5:00pm Tuesday, March 6, 2018
Data science and machine learning
Location: LL20 C Level: Intermediate
David Talby (Pacific AI), Claudiu Branzan (G2 Web Services), Alexander Thomas (Indeed)
Average rating: *****
(5.00, 1 rating)
Natural language processing is a key component in many data science systems. David Talby, Claudiu Branzan, and Alex Thomas lead a hands-on tutorial on scalable NLP, using spaCy for building annotation pipelines, Spark NLP for building distributed natural language machine-learned pipelines, and Spark ML and TensorFlow for using deep learning to build and apply word embeddings. Read more.
11:50am12:30pm Wednesday, March 7, 2018
Data science and machine learning
Location: Expo Hall 1 Level: Intermediate
Secondary topics:  Expo Hall
David Talby (Pacific AI), Santosh Kulkarni (Kaiser Permanente)
Average rating: ***..
(3.50, 2 ratings)
David Talby and Santosh Kulkarni explain how Kaiser Permanente uses the open source NLP library for Apache Spark to tackle one of the most common challenges with applying natural language process in practice: integrating domain-specific NLP as part of a scalable, performant, measurable, and reproducible machine learning pipeline. Read more.
11:50am12:30pm Thursday, March 8, 2018
Data-driven business management, Strata Business Summit
Location: 210 A/E Level: Intermediate
David Talby (Pacific AI)
Average rating: ***..
(3.50, 4 ratings)
Machine learning and data science systems often fail in production in unexpected ways. David Talby shares real-world case studies showing why this happens and explains what you can do about it, covering best practices and lessons learned from a decade of experience building and operating such systems at Fortune 500 companies across several industries. Read more.