Although it was invented decades ago, the Unix command line is an amazing environment for efficiently performing tedious but essential data science tasks. By combining small, powerful command-line tools (like parallel, jq, and csvkit), you can quickly scrub and explore your data and hack together prototypes.
Join Jeroen Janssens for a hands-on workshop based on his book Data Science at the Command Line. Using a real-world use case, you’ll learn how to build fast data pipelines, leverage R and Python at the command line, and quickly visualize and model data.
You’ll leave with a solid understanding of how to integrate the command line in your data science workflow. Even if you’re already comfortable processing data with R or Python, the ability to leverage the power of the command line will make you a more effective and efficient data scientist.
Jeroen Janssens is the founder, CEO, and an instructor of Data Science Workshops, which provides on-the-job training and coaching in data visualization, machine learning, and programming. Previously, he was an assistant professor at Jheronimus Academy of Data Science and a data scientist at Elsevier in Amsterdam and startups YPlan and Outbrain in New York City. He’s the author of Data Science at the Command Line (O’Reilly). Jeroen holds a PhD in machine learning from Tilburg University and an MSc in artificial intelligence from Maastricht University.
For exhibition and sponsorship opportunities, email strataconf@oreilly.com
For information on trade opportunities with O'Reilly conferences, email partners@oreilly.com
View a complete list of Strata Data Conference contacts
©2018, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • confreg@oreilly.com