JupyterCon brings together data scientists with a vision of a shared environment for interactive computing and reproducible research. What if we could expand this vision to include computational infrastructure and data itself? There are significant challenges in making this vision a reality.
Wes McKinney makes the case for a shared infrastructure for data science, discusses the open source community’s efforts on Apache Arrow, and offers a vision for seamless computation and data sharing across languages.
Wes McKinney is a software architect at Two Sigma Investments. He is the creator of Python’s pandas library and a PMC member for Apache Arrow and Apache Parquet. He wrote the book Python for Data Analysis. Previously, Wes worked for Cloudera and was the founder and CEO of DataPad.
©2017, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • firstname.lastname@example.org