Brought to you by NumFOCUS Foundation and O’Reilly Media Inc.
The official Jupyter Conference
August 22-23, 2017: Training
August 23-25, 2017: Tutorials & Conference
New York, NY

Accelerating data driven culture at the largest media group of Latin America with Jupyter

Diogo Munaro Vieira (globo.com), Felipe Ferreira (globo.com)
11:05am–11:45am Friday, August 25, 2017
Usage and application
Location: Nassau Level: Beginner

Who is this presentation for?

Developers, Data Scientists, Managers

Prerequisite knowledge

Know how to use Jupyter Notebook Know basic about Spark

What you'll learn

JupyterHub Configuration, JupyterHub case, Monitoring Spark Jobs at Jupyter, Basics about Spark

Description

Nowadays, to work with big data and to configure environments for all applications is very difficult, but using Jupyter Notebooks combined with JupyterHub our data scientists can analyze company products metrics, recommendation systems results, AB test results and build their own machine learning models. All of studies are made with support of login (oAuth2), Spark jobs monitoring and the ability to work with Python, R, PySpark or SparkR without installation and configuration issues within a common data science platform. This platform is safe and empower data scientists and managers on their research and data driven business choices. Running on 2 machines, each with 32 CPU cores and 125GB of memory, our data science platform is supported by Jupyterhub and has been used for hundreds of data scientists to filter billions of metric events a day. We will show how Jupyter is changing the culture of Globo.com – the largest media group of Latin America and second largest television group of the world – allowing data scientists to research our users and creating a great impact on our business.

Photo of Diogo Munaro Vieira

Diogo Munaro Vieira

globo.com

Bachelor’s in Biological Science – Biophysics (Bioinformatic) from UFRJ (2011) and master’s at Artificial Intelligence from PPGIUFRJ (2014). Has experience in Computer Science, acting on WEB Development, P2P Network, Collaborative Systems, Recommendation Systems, Open Source, BI (Business Inteligence) and loves Big Data.

Photo of Felipe Ferreira

Felipe Ferreira

globo.com

Analytical, performance focused engineer with over 12 years experience in enterprise systems development and architectural design using JEE technology. Specialized in Big Data platform analytics using Hadoop and associated ecosystem tools. Exceptional technology skills combined with ability to drive user-centric solutions, define strategy and lead data management.

Leave a Comment or Question

Help us make this conference the best it can be for you. Have questions you'd like this speaker to address? Suggestions for issues that deserve extra attention? Feedback that you'd like to share with the speaker and other attendees?

Join the conversation here (requires login)