Data science is a field where the expectations are high but the guidance around how to deliver “data science impact” can be low. What are the most important projects for a data science team to work on? How can people with technical context and people with business context bring their priorities together in a common discussion? How does a data science team ensure that all team members get their best ideas heard by the organization?
Civis Analytics’s data science research and development team consists of data scientists working on a wide variety of data science software and consulting tasks. One challenge the team confronted together (as it tripled in size) was how to prioritize projects in the company’s data science portfolio. Collectively, the team has better ideas and more experience than any single member alone, which suggests a bottom-up approach to sourcing project ideas. However, the team found that delivering a few high-quality, high-impact deliverables is better for the organization than lots of smaller, disorganized projects, which invites a more top-down approach.
Katie Malone and Skipper Seabold share a framework and best practices for quickly and collaboratively proposing, discussing, selecting, and managing high-impact data science projects.
Katie Malone is director of data science at data science software and services company Civis Analytics, where she leads a team of diverse data scientists who serve as technical and methodological advisors to the Civis consulting team and write the core machine learning and data science software that underpins the Civis Data Science Platform. Previously, she worked at CERN on Higgs boson searches and was the instructor of Udacity’s Introduction to Machine Learning course. Katie hosts Linear Digressions, a weekly podcast on data science and machine learning. She holds a PhD in physics from Stanford.
Skipper is Director of Data Science R&D and a Product Lead at Civis Analytics in Chicago. He leads a team of data scientists from all walks of life from physicists and biologists to statisticians and computer scientists. Together they drive the data science behind the products Civis offers and push the capabilities of solutions that Civis provides to its clients. He is an economist by training and has a decade of experience working in the Python data open source community. He started and led the statsmodels Python project, was formerly on the core pandas team, and has contributed to many projects in Python data stack. He holds strong opinions about writing and barbecue.
©2018, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • firstname.lastname@example.org