Brought to you by NumFOCUS Foundation and O’Reilly Media Inc.
The official Jupyter Conference
August 22-23, 2017: Training
August 23-25, 2017: Tutorials & Conference
New York, NY

Accelerating data-driven culture at the largest media group in Latin America with Jupyter

Diogo Munaro Vieira (Globo.com), Felipe Ferreira (Globo.com)
11:05am–11:45am Friday, August 25, 2017
Usage and application
Location: Nassau Level: Beginner

Who is this presentation for?

  • Developers, data scientists, and managers

Prerequisite knowledge

  • A basic understanding of the Jupyter Notebook and Spark

What you'll learn

  • Learn how Globo.com uses Jupyter and JupyterHub

Description

JupyterHub is an important tool for research and data-driven decisions at Globo.com. Diogo Munaro Vieira and Felipe Ferreira explain how data scientists at Globo.com—the largest media group in Latin America and second largest television group in the world—use Jupyter notebooks for data analysis and machine learning, making decisions that impact 50 million users per month.

By using Jupyter notebooks combined with JupyterHub, Globo.com’s data scientists can analyze company products metrics, recommendation systems results, and A/B test results and build their own machine learning models. All of the studies support OAuth 2, Spark jobs monitoring, Python, R, PySpark, and SparkR without installation and configuration issues. Globo.com’s data science platform empowers data scientists and managers. Running on two machines, each with 32 CPU cores and 125 GB of memory, the platform is supported by JupyterHub and is used to filter billions of metric events a day.

Photo of Diogo Munaro Vieira

Diogo Munaro Vieira

Globo.com

Diogo Munaro Vieira is a big data engineer at Globo.com. He is experienced in web development, P2P networking, collaborative systems, recommendation systems, open source software, and business intelligence. Diogo holds a bachelor’s degree in biological science and bioinformatics and a master’s degree in artificial intelligence from Universidade Federal do Rio de Janeiro.

Photo of Felipe Ferreira

Felipe Ferreira

Globo.com

Felipe Ferreira is a big data engineer at Globo.com, where he focuses on big data platform analytics using Hadoop and associated ecosystem tools. Felipe is an analytical, performance-focused engineer with over 12 years of experience in enterprise systems development and architectural design using JEE technology combined with the ability to drive user-centric solutions, define strategy, and lead data management.

Leave a Comment or Question

Help us make this conference the best it can be for you. Have questions you'd like this speaker to address? Suggestions for issues that deserve extra attention? Feedback that you'd like to share with the speaker and other attendees?

Join the conversation here (requires login)

Comments

Picture of Felipe Ferreira
Felipe Ferreira | BIG DATA ENGINEER
09/01/2017 5:17pm EDT

An updated version of the slides is available at http://bit.ly/2ewtpmN

Picture of Felipe Ferreira
Felipe Ferreira | BIG DATA ENGINEER
08/30/2017 6:28pm EDT

Slides are here: https://we.tl/EKgBqkOiHc