Presented By O'Reilly and Cloudera
Make Data Work
March 13–14, 2017: Training
March 14–16, 2017: Tutorials & Conference
San Jose, CA
Edgar Ruiz

Edgar Ruiz
Solutions Engineer, RStudio

@theotheredgar | Attendee Directory Profile

Edgar Ruiz is a solutions engineer at RStudio with a background in deploying enterprise reporting and business intelligence solutions. He is the author of multiple articles and blog posts sharing analytics insights and server infrastructure for data science. Recently, Edgar authored the “Data Science on Spark using sparklyr” cheat sheet.

Sessions

2:40pm3:20pm Wednesday, March 15, 2017
Spark & beyond
Location: LL21 C/D Level: Beginner
Secondary topics:  R
Edgar Ruiz (RStudio)
Average rating: ****.
(4.80, 5 ratings)
Sparklyr makes it easy and practical to analyze big data with R—you can filter and aggregate Spark DataFrames to bring data into R for analysis and visualization and use R to orchestrate distributed machine learning in Spark using Spark ML and H2O SparkingWater. Edgar Ruiz walks you through these features and demonstrates how to use sparklyr to create R functions that access the full Spark API. Read more.