Presented By O’Reilly and Cloudera
Make Data Work
21–22 May 2018: Training
22–24 May 2018: Tutorials & Conference
London, UK

Cloud-native data science with Anaconda, Docker, and Kubernetes (sponsored by Anaconda)

Mathew Lodge (Anaconda)
12:0512:45 Wednesday, 23 May 2018
Location: Capital Suite 4
Average rating: ****.
(4.50, 4 ratings)

What you'll learn

  • Learn how to do cloud-native data science with Anaconda, Docker, and Kubernetes


Big data architectures like Hadoop and Spark solve the distributed database problem well but have as an article of faith that moving compute closer to data is important for performance. They also assume your code is written in Java or another JVM-based language like Scala.

The big problem? Data science, predictive analytics, and ML don’t happen in JVM-based languages. They happen in Python, R, and to a lesser extent C/C++. Secondly, today’s data center networks have 1,000x the bandwidth at a lower total cost versus 2005, when Hadoop was first conceived, meaning that data locality doesn’t matter so much. Lastly, all the major players—AWS, Microsoft, Google, IBM, Red Hat, and Docker—are lined up behind Kubernetes. Containers and Kubernetes make great language-agnostic distributed computing clusters.

Mathew Lodge demonstrates that it’s just as easy to deploy Python as it is Java, walking you through doing cloud-native data science with Anaconda, Docker, and Kubernetes. Welcome to the future.

This session is sponsored by Anaconda.

Photo of Mathew Lodge

Mathew Lodge


Mathew Lodge is senior vice president of product and marketing at Anaconda. Mathew has well over 20 years’ diverse experience in cloud computing and product leadership. Previously, he was chief operating officer at container and microservices networking and management startup Weaveworks; vice president of VMware’s Cloud Services Group and cofounder of what became VMware’s vCloud Air IaaS service; and senior director of Symantec’s $1B+ Information Management Group. Early in his career, Mathew built compilers and distributed systems for projects like the International Space Station, helped connect six countries to the internet for the first time, managed a $630M router product line at Cisco, and attempted to do SDN 10 years too early at CPlane.

Comments on this page are now closed.


Picture of Mathew Lodge
27/05/2018 11:26 BST

Yes, I’ll get them posted on slideshare. In the meantime, please email me at mlodge at anaconda com

Picture of Mark Atterbury
26/05/2018 10:58 BST

Are the slides from this session going to be shared? Thanks, Mark