Presented By O'Reilly and Cloudera
Make Data Work
September 26–27, 2016: Training
September 27–29, 2016: Tutorials & Conference
New York, NY

Successful open data science on Hadoop: From sandbox to production

Peter Wang (Anaconda)
4:35pm–5:15pm Wednesday, 09/28/2016
Sponsored
Location: 1 E 09
Average rating: ***..
(3.33, 3 ratings)

What you'll learn

  • Explore the next generation of hardware and cloud topologies
  • Learn how Anaconda, an open data science platform, continuously incorporates the latest innovations available in the market for data scientists to do their work while preserving the ability for IT to manage and operate their production environment
  • Description

    More and more frequently, owners of Hadoop deployments find themselves facing the challenge of supporting data science ecosystems like Python and R, both adjacent to and within their Hadoop infrastructure. Although these technologies promise powerful data science insights, they can also be complex to manage and deploy. As people build out data science sandboxes and production environments, they discover a number of challenges ranging from basic package management and data lineage to reproducibility and governance of data science artifacts.

    Peter Wang distills the vast array of Hadoop and data science tools and architectures down to the essentials that deliver a powerful and lightweight stack quickly so that you can accelerate time to value while meeting your data science, governance, and IT needs. Throughout the discussion, Peter highlights challenges and best practices drawn from real-world customer use cases.

    Topics include:

    • The next generation of hardware and cloud topologies
    • How Anaconda, an open data science platform, continuously incorporates the latest innovations available in the market for data scientists to do their work while preserving the ability for IT to manage and operate their production environment

    This session is sponsored by Continuum Analytics.

    Photo of Peter Wang

    Peter Wang

    Anaconda

    Peter Wang is the cofounder and CTO of Anaconda, where he leads the product engineering team for the Anaconda platform and open source projects including Bokeh and Blaze. Peter’s been developing commercial scientific computing and visualization software for over 15 years and has software design and development experience across a broad variety of areas, including 3-D graphics, geophysics, financial risk modeling, large data simulation and visualization, and medical imaging. As a creator of the PyData conference, he also devotes time and energy to growing the Python data community by advocating, teaching, and speaking about Python at conferences worldwide. Peter holds a BA in physics from Cornell University.