- 我们会介绍微软R服务器的设计原则和架构，以及它和Apache Spark的集成。
- 演示如何使用R服务器来进行在Apache Spark上的可扩展的机器学习，以及使用R语言来分析T字节级数据。
R is a popular data science tool for data analysis. However, it has many drawbacks, such as its memory utilization and single-thread design, that limit its usage for big data analysis. Xiaoyong Zhu explains how to use R to analyze terabytes of data, covering the design principles and the architecture of Microsoft R Server and its integration with Apache Spark and leading a demo on how to utilize it to perform scalable machine learning on top of Apache Spark.
Xiaoyong Zhu is a program manager at Microsoft focusing on scalable machine learning and advanced analytics.
For exhibition and sponsorship opportunities, email firstname.lastname@example.org
For information on trade opportunities with O'Reilly conferences, email email@example.com
View a complete list of Strata Data Conference contacts Strata Data Conference contacts
©2017, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • firstname.lastname@example.org