Presented By O’Reilly and Cloudera
Make Data Work
September 11, 2018: Training & Tutorials
September 12–13, 2018: Keynotes & Sessions
New York, NY
Toru Sasaki

Toru Sasaki
System Infrastructure Engineer, NTT DATA Corporation

Toru Sasaki is a system infrastructure engineer and leads OSS professional services team at NTT Data Corporation.
He is interested in open-source distributed computing systems, such as Apache Hadoop, Apache Spark and Apache Kafka.
He has designed and developed many clusters utilizing these products to solve his customer’s problems.
He is a co-author of one of the famous Apache Spark book written in Japanese.

Sessions

4:20pm–5:00pm Thursday, 09/13/2018
Location: 1A 23/24 Level: Beginner
Secondary topics:  Data Integration and Data Pipelines
Kenji Hayashida (Recruit Lifestyle co., ltd.), Toru Sasaki (NTT DATA Corporation)
Recruit Group and NTT DATA Corporation developed a platform based on "Datahub" utilizing Apache Kafka. This platform should handle around 1TB/day application logs generated by a lot of services in Recruit Group. Some of the best practices and know-hows, such as schema evolution and network architecture, learned during this project are explained in this session. Read more.