Presented By O'Reilly and Cloudera
Make Data Work
Dec 4–5, 2017: Training
Dec 5–7, 2017: Tutorials & Conference
Singapore

Making R go faster and bigger

Jared Lander (Lander Analytics)
12:05pm12:45pm Wednesday, December 6, 2017
Average rating: ***..
(3.33, 3 ratings)

Who is this presentation for?

  • Data scientists, engineers, and analysts

Prerequisite knowledge

  • A basic understanding of R, split-apply-combine, data frames, and databases

What you'll learn

  • Learn how to use R for high-performance computing while ensuring speed and scale

Description

R is mistakenly seen as merely a prototyping tool when in reality more and more companies are using R in production. One common (but false) knock against R is that it doesn’t scale well. Jared Lander shows how to use R in a performant matter both in terms of speed and data size and offers an overview of packages for running R at scale. Jared covers three keys areas for improving R’s computing capabilities: smarter computing (dplyr, data.table, Rcpp), parallel programming (foreach, multidplyr), and external memory (bigmemory, ff, dbplyr, sparklyr). Each allows for faster and bigger computing in R.

Photo of Jared Lander

Jared Lander

Lander Analytics

Jared P. Lander is chief data scientist of Lander Analytics, where he oversees the long-term direction of the company and researches the best strategy, models, and algorithms for modern data needs. He specializes in data management, multilevel models, machine learning, generalized linear models, data management, visualization, and statistical computing. In addition to his client-facing consulting and training, Jared is an adjunct professor of statistics at Columbia University and the organizer of the New York Open Statistical Programming Meetup and the New York R Conference. He is the author of R for Everyone, a book about R programming geared toward data scientists and nonstatisticians alike. Very active in the data community, Jared is a frequent speaker at conferences, universities, and meetups around the world and was a member of the 2014 Strata New York selection committee. His writings on statistics can be found at Jaredlander.com. He was recently featured in the Wall Street Journal for his work with the Minnesota Vikings during the 2015 NFL Draft. Jared holds a master’s degree in statistics from Columbia University and a bachelor’s degree in mathematics from Muhlenberg College.