Build Systems that Drive Business
Sep 30–Oct 1, 2018: Training
Oct 1–3, 2018: Tutorials & Conference
New York, NY

Speakers

Hear from a wide range of talented senior engineers, systems practitioners, and technical managers who are doing amazing things in distributed systems and DevOps. More speakers will be announced; please check back for updates.

Filter

Search Speakers

Naoman Abbas is a software engineer on the visibility team at Pinterest, where he leads the charge on Pintrace, their distributed tracing system. Prior to Pinterest, Naoman worked at Netflix and Microsoft as a software engineer building cloud platform components.

Presentations

Using distributed trace data to solve performance and operational challenges Session

Distributed tracing is the only method available today to trace requests across several systems. Data gathered with this technique can help solve a wide-range of performance and operational challenges. In this talk, I’ll share how Pinterest uses trace data to solve a myriad of technical and analytical challenges, and how you can, too.

Sean T. Allen is vice president of engineering at Wallaroo Labs and a member of the Pony core team. His turn-ons include programming languages, distributed computing, Hiwatt amplifiers, and Fender Telecasters. His turn-offs include mayonnaise, stirring yogurt, and sloppy code. He is one of the authors of Storm Applied.

Presentations

Pat Helland and me: How to build stateful distributed applications that can scale almost infinitely Session

In 2007, Pat Helland published "Life Beyond Distributed Transactions: An Apostate’s Opinion," in which he conducts a thought experiment on how to design a distributed database that can scale almost infinitely. While the paper explicitly addresses distributed database design, Sean Allen shows that the ideas are far more widely applicable, particularly in scaling stateful applications.

Preetha Appan is a software engineer on the Nomad team at Hashicorp, most recently working on scheduler internals. She has previously worked on various Consul features towards Consul 1.0. Before joining Hashicorp, she was an early engineer at Indeed.com where she built distributed systems for search and recommendations from the ground up.

Presentations

Who guards the guardians? Designing for resilience in cluster orchestrators Session

In this talk, we will look at various failure modes ranging from network failures to entire server failures in Nomad, an open source scheduler that supports heterogeneous workloads.

I am a dual French-Australian citizen with a passion for IT automation, snowboarding, squash and much more!

I have extensive technical abilities and a passion for delivering services which improve the usability of systems. Much of my experience in recent years has been in designing, developing, implementing and testing infrastructure and application automation, with a strong focus on delivering CI/CD pipelines to dev teams.

I pride myself in having a capacity to help organizations and development teams build an end-to-end, secure, reliable, performant delivery pipeline and abstraction engines to help software engineers focus on what they do best: write and ship code. This is what I am currently doing at Airbus, where I work as a DevOps Principal Engineer and DevOps Evangelist.

Presentations

Regulated organizations and continuous delivery: a theory of opposite attraction Session

Continuous Delivery is hard. Continuous Delivery in highly regulated environments is even harder. With often more than 1,000s applications running into production, 10,000s of employees, heterogenous technologies and legacy systems, some organizations are facing challenges in embracing continuous delivery principles. This session aims at helping starting with achieving CI/CD in regulated firms.

Hooman Beheshti is VP of technology at Fastly, where he develops web performance services for the world’s smartest CDN platform. A pioneer in the application acceleration space, Hooman helped design one of the original load balancers while at Radware and has held senior technology positions with Strangeloop Networks and Crescendo Networks. He has worked on the core technologies that make the Internet work faster for nearly 20 years and is an expert and frequent speaker on the subjects of load balancing, application performance, and content delivery networks.

Presentations

Revisiting HTTP/2 Session

Now that adoption is ramped up and HTTP/2 is being regularly used on the Internet, it's a good time to revisit the protocol and its deployment.

Kristina Bennett has worked at Google since 2009. Although she recently joined the Customer Reliability Engineering team in their mission to SRE everyone else!”, prior to that she spent 5 years working on data integrity across Google.

Presentations

Tradeoffs in resiliency: managing the burden of data recoverability Session

Recommending practices for practical data recoverability and shining light onto some of the pitfalls awaiting the unwary, based on the lessons of five years of data integrity tooling and consulting across Google.

.

Presentations

Keynote by Michael Bernstein Keynote

Keynote by Michael Bernstein

Aaron has been building, breaking, and fixing systems for over a decade from tiny startups to serving over 100M users at Netflix. He is presently applying his passion for empiricism and system design to multi-region high-availability architecture and operations on the Traffic team at Netflix. Previously, Aaron co-authored Chaos Engineering (O’Reilly, 2017.)

Presentations

Availability, Latency and Cost: Withstanding regional outages Session

Multi-region deployments can improve availability and latency and can cost way less than you think.

Bill Boulden is the chief technology officer of ClearView Social, where he has migrated a VM-driven infrastructure to an autoscaling application fleet with serverless components. Bill has been developing software since the age of 6. Previously, he was an API architect at Delaware North Companies. Running serverless applications in production has given him a unique perspective on architecture and application delivery for modern companies. In his spare time, he’s a pink-haired house music DJ by the name of Spruke, who enjoys EDM and generative ambient music.

Presentations

Serverless APIs with AWS Lambda and API Gateway Tutorial

Serverless architectures remove load from web servers and scale flawlessly to handle any volume while keeping you from paying for an instant of wasted idle time. Bill Boulden walks you through creating a functioning serverless API that coexists alongside conventionally served web pages using AWS Lambda and API Gateway.

Michael Brunton-Spall is an independent security consultant. Previously, Michael was deputy director for technology and operations and head of cybersecurity at the UK Government Digital Service and held a number of jobs ranging from creating low-level embedded hardware to gaming development on consoles to scaling and operating the Guardian newspaper. He is a regular conference speaker, the author of Agile Application Security, and an enthusiastic Agilist and security geek.

Presentations

Attack trees: Security modeling for Agile teams Tutorial

Traditional security approaches to threat and risk management are highly optimized to work within a traditional software development lifecycle. Michael Brunton-Spall shares a new approach to reviewing systems along with real-life examples to help you prioritize where to focus security efforts and what sorts of security threats you should worry about.

Tammy Butow is a principal SRE at Gremlin, where she works on chaos engineering—the facilitation of controlled experiments to identify systemic weaknesses. Gremlin helps engineers build resilient systems using their control plane and API. Previously, Tammy led SRE teams at Dropbox responsible for the databases and storage systems used by over 500 million customers and was an IMOC (incident manager on call), where she was responsible for managing and resolving high severity incidents across the company. She has also worked in infrastructure engineering, security engineering, and product engineering. Tammy is the cofounder of Girl Geek Academy, a global movement to teach one million women technical skills by 2025. Tammy is an Australian and enjoys riding bikes, skateboarding, snowboarding, and surfing. She also loves mosh pits, crowd surfing, metal, and hardcore punk.

Presentations

Chaos Engineering Bootcamp Tutorial

A hands-on chaos engineering workshop, covering the tools and practices you need to implement chaos engineering in your organization.

Francesc Campoy Flores is vice president of developer relations at source{d}, a startup applying ML to source code and building the platform for the future of developer tooling. Previously, he was a developer advocate for Google’s Cloud Platform and Go teams. Francesc is passionate about programming and programmers, especially Go and gophers, and he’s given many talks and workshops at conferences like Google I/O, Gophercon(s), GOTO, and OSCON. When he’s not on stage, he’s probably coding, writing blog posts, or working on his JustForFunc YouTube series, in which he hacks while cracking bad jokes.

Presentations

Keynote by Francesc Campoy Keynote

Keynote by Francesc Campoy

Ian Coldwater is a DevOps Engineer at Jamf Software and a grown teenage hacker who is passionate about information security. She spends a lot of time breaking vulnerable software and teaching developers how to improve their security practices. In her spare time, she likes to go on cross-country road trips, capture flags and eat a lot of pie. She lives in Minneapolis and tweets at @IanColdwater.

Presentations

Ship of Fools: Shoring Up Kubernetes Security Session

Ian Coldwater gives you practical advice about securing your Kubernetes clusters, from an attacker’s perspective.

Molly Crowther is a Senior Technical Program Manager at Pivotal, working on security strategy and vulnerability management for Cloud Foundry. She also leads security workshops for open- and closed-source Cloud Foundry teams in North America and Europe.

Presentations

Faster is Safer: Security in the Enterprise Session

This talk is about how the enterprise can use cloud platforms to make security move at the pace of business and not the other way around.

Anil Dash is an entrepreneur and activist known as one of the most prominent advocates for a more inclusive and ethical technology industry. He is CEO of Fog Creek Software, the creators of Glitch, the friendly community that helps everyone make the app of their dreams.

Dash was an advisor to the Obama White House’s Office of Digital Strategy and was formerly a monthly columnist for Wired. Today, he serves as a board member for Stack Overflow, the Data & Society Research Institute, and the Lower East Side Girls Club and serves as an advisor to startups like Medium and non-profits like Donors Choose. The New Yorker described Dash as a “blogging pioneer” for his Webby-recognized personal website which began in 1999, and for his seminal work in helping create some of the first blogging and social media publishing tools.

As a public speaker, Dash has taken the stage at events ranging from the Obama Foundation Summit to the Aspen Ideas Festival to SXSW. He has been a guest on media and podcasts ranging from Vice’s Desus and Mero to Krista Tippet’s On Being, and collaborated with “Hamilton” creator Lin-Manuel Miranda, to creating one of the most popular Spotify playlists of 2018. Time named @anildash one of the best accounts on Twitter, and it is the only account ever retweeted by Bill Gates and Prince, a succinct encapsulation of Dash’s interests.

Presentations

Keynote by Anil Dash Keynote

Keynote by Anil Dash

Bart De Vylder is a data scientist at CoScale. Previously, Bart was active in software engineering and architecture, with a focus on distributed systems. His interests lie in machine learning and building reliable, scalable data processing systems. Bart holds a PhD in Artificial Intelligence from the Free University of Brussels.

Presentations

Kafka Streams in practice, what works and what doesn’t (yet) Session

In this session Bart De Vylder shares his experience of migrating an existing codebase and production environment to Kafka Streams, a relatively new and promising streaming library. He’ll motivate the choice for Kafka Streams, highlight aspects that worked remarkably well but also address the challenges run into.

Rocio Delgado is a Sr. Engineering Manager at Slack. Previously Sr. Engineering Manager at GitHub, GE and founding engineer @ WorkMarket. She’s been a backend engineer, tech lead, and manager for 13+ years. She’s interested in performance, scalability, resilience, distributed systems, building and growing teams and diversity and inclusion in tech. Her hobbies are yoga, dancing & being a karaoke junior performer. Living in Brooklyn via México.

Presentations

Communicating and managing change Session

Evolving teams and evolving companies are a constant in the career of a leader, helping your team navigate through that change becomes critical to your success as a manager and for the organization.

Jaana B. Dogan is a software engineer at Google, where she works on observability of Go production services. She has a decade of experience building developer platforms and tools.

Presentations

Keynote by Jaana Burcu Dogan Keynote

Keynote by Jaana Burcu Dogan

Throughout his career, Allan has worked on large distributed systems containing hundreds to thousands of servers in production. He has built scalable applications on various platforms ranging from large supercomputing centers to production clusters in the enterprise. He is currently managing distributed systems at scale for Bloomberg, where he oversees the company’s Hadoop infrastructure.

Presentations

Autoscaling Containers... with Math Session

Write your own autoscaler using simple maths concepts in control theory.

For 25 years Bret Fisher has built and operated distributed systems as a sysadmin and helped over 50,000 people learn dev and ops topics. He is a Docker Captain, the author of the wildly popular Docker Mastery series on Udemy, and also provides DevOps-style consulting and live workshops with a focus on immutable infrastructures, containers, and orchestration. Bret’s also an occasional shell, web, and JavaScript developer. He spends his free time in Virginia’s local, thriving tech scene helping lead local Code for America and Docker Meetups. Bret basically spends his days helping people, and giving high fives. He lives at the beach, writes at bretfisher.com, prefers dogs over cats, and tweets at @bretfisher

Presentations

Docker Zero to Hero: Docker, Compose and Production Swarm 2-Day Training

Containers will soon be the standard way to build, ship, and run most software. Get onboard fast with a end-to-end workshop by a Docker Captain! Start with the "why" of Docker, then create images, containers and more. Dive into Docker Compose, the developers best friend! Then gain DevOps skills spending a whole day deploying multi-tier apps on server clusters with Swarm and other popular tools.

Liz is a Staff Site Reliability Engineer at Google and works on the Google Cloud Customer Reliability Engineering team in New York. She lives with her wife, metamour, and two Samoyeds in Brooklyn. In her spare time, she plays classical piano, leads an EVE Online alliance, and advocates for transgender rights.

Presentations

Building Successful SRE in Large Enterprises Session

Implementing Site Reliability Engineering doesn't have to be intimidating, and isn't only for cloud native organizations; we present eight key lessons from the Google Customer Reliability Engineering team's experience helping large enterprises adopt SRE as an operations engineering model.

Sebastien Goasguen is a twenty year open source veteran. A member of the Apache Software Foundation, he worked on Apache CloudStack and Libcloud for several years before diving into the container world. He is the founder of Skippbox, a Kubernetes startup acquired by Bitnami. An avid blogger he enjoys spreading the word about new cutting edge technologies. Sebastien is the author of the O’Reilly Docker Cookbook and 60 Recipes for Apache CloudStack and the co-author of Kubernetes Cookbook.

Presentations

Certified Kubernetes Application Developer (CKAD) Prep + Exam 2-Day Training

Are you proficient in developing and maintaining applications using Kubernetes? That’s the question more employers are asking these days. Prove you know your stuff by becoming a Certified Kubernetes Application Developer. In this 2-day package, you get a full day of test prep from O’Reilly’s top Kubernetes trainer Sebastien Goasguen, plus the opportunity to take the exam on-site.

Alice Goldfuss is a systems punk currently helping GitHub run their cutting-edge container platform. She loves kernel crashes, memory design, and performance hacks.

Alice has consulted on some books (Docker: Up & Running, Effective DevOps, Site Reliability Engineering vol 2), presented at some conferences (SREcon, Velocity, Container Summit), and run some others (LISA17, DevOps Days Portland). You can follow her on Twitter (@alicegoldfuss), but you’ll probably regret it.

Presentations

The Container Operator's Manual Session

In this talk, we’ll discuss some of the advantages and disadvantages of running containers, in production, at scale.

Software Engineer with over 15 years of experience. Was Director and Principal Engineer at Getty Images working on Sports, News and Entertainment tools, then moved Getty’s Asset Management Platform to the Cloud (https://speakerdeck.com/mhamrah/the-new-enterprise-rabbitmq-akka-and-aws-at-getty-images). Was a Senior Software Engineer at Uber working on metrics and monitoring (https://speakerdeck.com/mhamrah/uber-anomaly-detection-at-scale, I spoke at Velocity NYC). Now at Namely where I am leading development of Namely’s platform.

Presentations

Frankenstein's Microservices: How To Avoid The Monster Session

Many companies adopt microservices to break down monoliths and soon uncover a hidden cost: how does one manage all these new interconnected things popping up? Avoid creating Frankenstein's Monster by understanding elements of a Microservice platform so you can sleep at night.

Michael Hausenblas is a developer advocate for Go, OpenShift, and Kubernetes at Red Hat, where he helps app ops engineers build and operate distributed services. Michael shares his experience with distributed systems and large-scale data processing through demos, blog posts, and public speaking engagements and contributes to open source software such as OpenShift and Kubernetes. Previously, Michael was a developer advocate at Mesosphere, chief data engineer at MapR Technologies, and a research fellow at the National University of Ireland, Galway, where he researched large-scale data integration and the internet of things and gained experience in advocacy and standardization (World Wide Web Consortium, IETF). In his free time, Michael contributes to open source software (mainly using Go), blogs, and hangs out on Twitter too much.

Presentations

Troubleshooting Kubernetes Applications Session

In this hands-on talk we address troubleshooting applications running in Kubernetes, from application-level debugging to distributed tracing to chaos engineering.

Presentations

Chaos Engineering Bootcamp Tutorial

A hands-on chaos engineering workshop, covering the tools and practices you need to implement chaos engineering in your organization.

Stephanie Hurlburt is a graphics engineer and the owner of Binomial. Currently, she’s building Basis, which is both a proprietary texture compressor and a new file format that will be an open standard for texture compression in the industry. Previously, Stephanie worked on graphics engineering and engine programming at Oculus and Unity.

Presentations

Keynote by Stephanie Hurlburt Keynote

Keynote by Stephanie Hurlburt

Kavya Joshi is a backend and systems developer at Samsara. Her favorite aspects of being a programmer are reasoning about systems at scale and delving into the inner workings of sophisticated software. When not programming, Kavya tends to spend her time on rock walls and mountaintops.

Presentations

Keynote by Kavya Joshi Keynote

Keynote by Kavya Joshi

I am an engineer who prides himself on building reliable, scalable infrastructure.

I specialise in maintaining large system infrastructure as demonstrated by work at LinkedIn (applications) and at The University of Queensland (networks). I possess high-level skills in maintaining Linux and Windows servers and their respective infrastructure services. My interpersonal skills allow me to interact with clients and colleagues in a professional manner using exemplary communication skills.

Previous work experience has included; building small satellites at NASA and writing thermal environments software at Rio Tinto.

Presentations

Monitoring containers correctly Tutorial

In this hands-on workshop, you will build a small monitoring utility for cgroup containers in order to learn how best-practices in container monitoring. Using a small Java process, we'll walk through how to monitor the various cgroups constraints and demonstrate that understanding how applications (particularly JVM-based applications) perform in containers, can be a surprisingly difficult task.

Nigel Kersten is chief technical strategist at Puppet. Nigel was responsible for the development of the initial versions of Puppet Enterprise and has since served in a variety of roles at the company, including head of product, CTO, and CIO. He has been deeply involved in Puppet’s DevOps initiatives and regularly speaks around the world about the adoption of DevOps in the enterprise and IT organizational transformation. Previously, Nigel worked at Google, where he was responsible for the design and implementation of one of the largest Puppet deployments in the world.

Presentations

Why are we all such hypocrites when it comes to DevOps? Session

There is often a large disconnect between how senior execs represent transformations internally and externally, and how the people on the ground feel. Drawing on cognitive biases, communication theory and real-world experience working with companies undergoing change, we'll explain why these situations occur and how we can mitigate them to close the DevOps hypocrisy gap

Ameet Kotian is a Senior Storage Operations Engineer at Slack and responsible for operating the infrastructure used to store all of Slack’s data. Previously, he was one of the first Site Reliability Engineers at Twitter where he worked, for close to five years, on the graph storage service and Twitter’s internal distributed database called Manhattan. His work related to large scale automated deployments was featured on Twitter’s engineering blog.

Presentations

Smooth Scaling: Slack’s journey towards a new database Session

Slack’s rapid growth over the last few years outpaced the our original database’s scaling capacity, which negatively impacted our customers and engineers. A small team of engineers embarked on a journey for the right database solution and eventually steered toward Vitess, an open-source cluster database.

Bridget Kromhout is a principal cloud developer advocate at Microsoft. Her CS degree emphasis was in theory, but she now deals with the concrete (if the cloud can be considered tangible). After 15 years as an operations engineer, Bridget traded being on call for being on a plane. A frequent speaker and program committee member for tech conferences, she leads the devopsdays organization globally and the DevOps community at home in Minneapolis. She podcasts with Arrested DevOps, blogs at Bridgetkromhout.com, and is active in a Twitterverse near you.

Presentations

Kubernetes 101 Tutorial

In this hands-on Kubernetes workshop, Bridget Kromhout walks you through launching clusters and details all the moving parts you need to know about to use Kubernetes in production.

Bridget Lane is a Software Developer for Gannett & USA Today where her day to day job involves deep-diving into Golang APIs, API management, and cache setup. On her free time, Bridget enjoys cooking, playing board games, and slaying ferocious beasts as a sorcerer in the distant realm of Dungeons and Dragons.

Presentations

From silos to a single pane of glass at USA TODAY NETWORK Session

Three years ago our technical teams were completely siloed, making improvements and troubleshooting difficult and often blind to the rest of the technical organization. We will talk about how drastically our tool belt, thought process, and goals have changed to help us to be a single pane of glass today.

I’m currently a backend engineer at Slack in San Francisco working to scale our Enterprise product to support some of the world’s largest companies. I spend most of my time chasing down people making network calls in a loop, refactoring unwieldy chunks of code, and consolidating redundant database schemas.

Presentations

How to Get Away with Refactoring Session

How do you refactor major, core functionality in a million-line codebase without disrupting the entire system? Learn how we overhauled channels at Slack and the many obstacles we overcame to boost both application performance and company-wide developer productivity (with only a few hiccups).

Moishe Lettvin has been writing software since 1980s, when he worked on DOS TSR written in a combination of Pascal and Assembly. Since then, he’s worked on projects from Microsoft Outlook to Google App Engine, and he currently works on backend projects at MailChimp.

Once upon a time, Moishe made a CRT explode with a software bug.

Presentations

Strategies for Better Technical Interviews Session

Technical interviewing is profoundly important, but unfortunately easy to do poorly and very difficult to do well; in this talk I’ll discuss some strategies for reducing bias and increasing the fidelity of your technical interviews.

Idit Levine is the founder and CEO of solo.io, where she is aiming to streamline the cloud stack.
Prior to founding solo.io, Idit was the CTO of the cloud management division at EMC and a member of its global CTO Office.
At EMC, Idit lead, designed and implemented project UniK, an open source platform for automating unikernels compilation and deployment and project layer-x, an open source framework for cross-cluster scheduling . At solo, Idit recently released Squash, an open source platform for debugging microservices applications and Gloo the function gateway.

Presentations

Debugging microservices Apps: via sevice mesh, open tracing & Squash‍ Session

Debugging microservices Apps: via sevice mesh, open tracing & Squash‍

Alicia Liu is the VP of Engineering at Nava, the first public benefit corporation to be a federal prime contractor, working with US government agencies to modernize digital services and infrastructure. Previously, Alicia was the CTO of Coach.me, helping people reach their goals through behavior design, community, and personal coaching. Alicia writes, mentors, and speaks on topics ranging from engineering best practices to diversity and engineering culture in the tech industry. Outside of work, she can be found reluctantly running, or trying to read with a cat or two on her lap.

Presentations

Go Slow To Go Fast: Building Strong Foundations for Leadership Session

Principles and practices to build strong foundations for leadership, and to improve mental health and resilience while dealing with the challenges of being in a new role.

Nikki McDonald is a content director at O’Reilly Media, where she writes, edits, and works with the industry’s leading practitioners to develop books, online courses, and training videos to help engineers and developers collaborate more effectively and create and deploy complex distributed systems. She also cochairs O’Reilly’s Velocity Conference, held annually in San Jose, New York, and London. Nikki started out as a features editor at MacUser magazine back when people were still dialing up to the internet with AOL. She lives in Ann Arbor, MI.

Presentations

Closing Remarks Keynote

Closing Remarks

Closing Remarks Keynote

Closing Remarks

Tuesday Opening Welcome Keynote

Tuesday Opening Welcome

Wednesday Opening Welcome Keynote

Wednesday Opening Welcome

Jessica is a software engineer from Cambridge, MA. She is an organizer for the Boston Python Meetup, an organizer and lecturer for the Boston Python Workshop for women, and a STEM volunteer in local schools.

Presentations

Keynote by Jessica McKellar Keynote

Keynote by Jessica McKellar

Presentations

Chaos Engineering Bootcamp Tutorial

A hands-on chaos engineering workshop, covering the tools and practices you need to implement chaos engineering in your organization.

James is a site reliability engineer at Quantopian, a Boston startup making algorithmic trading accessible to everyone. Past roles have seen him responsible for processing MRI scans at the Center for Brain Science at Harvard University, sales engineering and developer evangelism at AppNeta, and release engineering during the Romney for President 2012 campaign. Between NYSE trading days, he organizes DevOpsDays Boston and conducts Ansible trainings on O’Reilly’s Safari platform. What free time remains is dedicated to cooking, sci-fi, permadeath video games, and Satanism.

Presentations

Ansible for SRE Teams Tutorial

Ansible is a "batteries included" automation, configuration management, and orchestration tool that's fast to learn and flexible enough for any architecture. This workshop will help you learn Ansible with an eye towards sustainable development in cloud environments.

Sell Cron, Buy Airflow: Modern Data Pipelines in Finance Session

Quantopian integrates financial data from vendors around the globe. As the scope of our operations outgrew cron, we turned to Apache Airflow, a distributed scheduler and task executor. In less than six months we were able to rearchitect brittle crontabs into resilient, recoverable pipelines defined in code to which anyone could contribute.

Anubhav Mishra is a developer advocate at HashiCorp. Previously, he worked at Hootsuite, where he built distributed systems and a microservice delivery platform. Anubhav loves open source software and is constantly finding ways to contribute to projects that excite him and helping developers and operators do better. In his free time, he DJs, makes music, and plays football. He’s a huge Manchester United supporter.

Presentations

Smart Networking with Service Meshes Tutorial

Over the past year, service meshes have gained significant interest. Most service meshes have two components: a control plane and a data plane. Anubhav Mishra explains what it takes to build a scalable control and data plane. This talk also discusses how HashiCorp Consul can provide many features like distributed key-value store and service discovery that make it ideal for a control plane.

Christian is the Infrastructure Lead for HealthCare.gov’s core systems. He is also the cofounder of Nava PBC (https://www.navapbc.com/), a startup working across numerous federal agencies to radically improve how government serves people through technology. He lives in Washington DC.

Presentations

Migrating HealthCare.gov to Terraform: Lessons Learned Session

How we successfully migrated America's largest government website to the cloud infrastructure provisioning tool Terraform, the lessons we learned along the way, and how you can effectively use Terraform for your next project.

Aviran Mordo is the head of engineering at Wix. In his 20+ years in the software industry, Aviran has held a number of engineering roles and leading positions, from designing and building the US National Archives Electronic Records Archives prototype to building large search engine infrastructures. Aviran has vast knowledge of internet technologies, software development, and continuous delivery. He is a technology blogger as well as a dev-centric culture advocate.

Presentations

You got arrested by the CAP, you have the right to remain consistent Session

Challenges and real-life use-cases of handling data in a distributed environment

Leemay leads the Recommendations and Targeting engineering efforts at Comcast, and sets the strategic direction for Content Personalization for Comcast’s Xfinity consumer facing video products. Leemay also leads efforts with A/B testing, Testing and Targeting, and producing the metrics to measure successful customer outcomes.

Presentations

Migrating a recommendations platform from bare metal to the cloud Session

In this talk we'll walk through the importance of data collection pipelines and efficiently storing data sets with intentions of making it easily accessible by downstream machine learning platform.

I enjoy solving problems, particularly those related to scale. I’ve worked in all parts of the application development lifecycle—from inception to deployment—and enjoy the challenges of creating secure, performant, resilient, and maintainable code and infrastructure. I’m currently a Site Reliability Engineer for Stack Overflow.

Presentations

Bulk Image Processing Using Kubernetes Session

How we utilized the Kubernetes and Google Cloud to burst and extend the capacity of our physical infrastructure for optimizing almost ten million images in less than two weeks.

Amy Nguyen is a software engineer at Stripe on the observability team and she works to make data accessible for everyone. Outside of work, Amy writes about the tech industry, loves baking, and reads too many self-improvement books.

Presentations

How to Break Up With Your Vendor Session

How do you make a deadline for an infrastructure-critical software migration while making sure everyone's requirements are met and no data has been lost?

Victoria Nguyen is a network systems engineer at Fastly. She loves rock climbing and Halloween.

Presentations

Networks, echolocation, and fish GIFs Session

Victoria Nguyen explains how Fastly overhauled the monitoring and data collection of its globally distributed network without its caches noticing.

Kris Nova is a Senior Developer Advocate for Heptio with an emphasis in containers, infrastructure, and Kubernetes. She is an ambassador for the cloud native computing foundation.

Prior to Heptio, Kris worked as a developer advocate for Microsoft, as well as an engineer on Kubernetes in Azure. She has a deep technical background in the Go programming language, and has authored many successful tools in Go. She is a Kubernetes maintainer, and the creator of kubicorn, a successful Kubernetes infrastructure management tool. She organizes a special interest group in Kubernetes, and is a leader in the community. Kris understands the grievances with running cloud native infrastructure via a distributed cloud native application, and recently authored an O’Reilly book on the topic called, “Cloud Native Infrastructure”. Kris lives in Seattle, WA and spends her free time mountaineering.

Presentations

Moving an enterprise monolith to Kubernetes Session

In this demo we look at the true and painful story of what it is like to move a monolithic enterprise app to running in a container in Kubernetes. We prototype a production environment that matches true stories, and liberate the application into a scalable and modern cloud native environment.

Heather Osborn has been working in technology as a system and operations engineer for the last 25 years. Although not common in the tech world, she’s stayed with Ticketmaster for the last 20 years through the various incarnations – partly because of multiple technology reinventions and unique challenges, and partly because she wants to see what will happen next. She’s looking forward to this new era of public cloud and container orchestration.

Heather is an avid long distance runner who has lots of time to think about these things while pounding the pavement.

Presentations

Archaic to Orchestrated: Ticketmaster's hybrid devops transformation Session

If a company whose technology and human infrastructure have grown up organically around a custom-written vax operating system can make the move to public cloud native applications and begin a rapid march to a hybrid cloud solution, so can you.

Jerome Petazzoni is a devops advocate and international speaker. He was born and raised in France, where he worked on geographic information systems, voice-over-IP, video streaming and encoding, and started a cloud hosting company back when EC2 wasn’t an Amazon product yet. He moved to California to build and scale the dotCloud PAAS, which eventually gave birth to Docker. He worked 7 years for Docker, during which he represented the famous container company at hundreds of conferences and events, and trained thousands of engineers to use Docker, Swarm, and Kubernetes. He’s fluent in many language (mostly programming ones, though), owns a dozen of musical instruments, and can play the theme of Zelda on most of them.

Presentations

Kubernetes Bootcamp: Deploying and Scaling Microservices 2-Day Training

This hands-on training is targeted to people who don't have prior experience with Kubernetes. You will learn about its concepts and architecture, and how to use it to deploy and scale your applications. Each section features explanations, demos, and exercises. Each student is given access to a k8s cluster during the training, and will get first-hand experience by completing the exercises.

Neil Peterson is a datacenter and cloud enthusiast. With 15 years’ experience in large datacenter deployment, management and maintenance operations, Neil now works as a Senior content engineer delivering technical documentation and samples with focus on Azure and Containers.

Presentations

Consuming cloud services with the Kubernetes Service Catalog Session

This talk will be a technical deep dive on using the Kubernetes Service Catalog to dynamically provision and consume managed cloud services.

Guy Podjarny (@guypod) is Snyk’s co-founder and CEO, focusing on using open source and staying secure. Guy was previously CTO at Akamai following their acquisition of his startup, Blaze.io, and worked on the first web app firewall & security code analyzer. Guy is a frequent conference speaker & the author of O’Reilly “Securing Open Source Libraries”, "Responsive & Fast” and “High Performance Images”.

Presentations

Securing Serverless: By Breaking In Session

Serverless shuffles security priorities, naturally addressing certain risk while elevanting others - as this live hacking session will vividly demonstrate. We will break into a demo vulnerable Serverless app while explaining each security mistake, its impact and how it can be avoided. By the end, you'll better appreciate the need for - and be ready to handle - keeping your functions secure.

Alex Rasmussen is the VP of Engineering at Freenome, an AI genomics company with a unique approach to detecting cancer at its earliest stages and helping physicians optimize the next generation of precision therapies. He holds a Ph.D. from University of California San Diego, where his dissertation focused on highly-efficient large scale data processing systems. While at UCSD, he led the TritonSort project, which set several world records in large scale sorting.

Presentations

How do we solve the world's spreadsheet problem? Session

In this talk, we'll explore some of the data integrity problems that come up a lot when dealing with spreadsheet data, and explore whether those integrity problems are inescapable and ongoing work to mitigate them.

Dave Rensin is the Director of Customer Reliability Engineering (CRE) at Google. His team takes Google SREs focused on the reliability and availability of internal Google systems and focuses them on the reliability and availability of customer production systems running on Google Cloud. His mission is to teach Google customers how to design, build, and run highly available systems using Google SRE practices and tools.

He is the author of several books — including two for O’Reilly and holds more than a dozen patents in distributed systems, data acquisition, access control and pattern matching.

Presentations

Building Successful SRE in Large Enterprises Session

Implementing Site Reliability Engineering doesn't have to be intimidating, and isn't only for cloud native organizations; we present eight key lessons from the Google Customer Reliability Engineering team's experience helping large enterprises adopt SRE as an operations engineering model.

Liz Rice is the technology evangelist at container security specialists Aqua Security. She has a wealth of software development, team, and product management experience from her years spent working on network protocols and distributed systems and in digital technology sectors such as VOD, music, and VoIP. When not building startups and writing code, Liz loves riding bikes in places with better weather than her native London, or racing in virtual reality on Zwift.

Presentations

A programmer's guide to secure connections Session

Beyond looking out for a little green padlock in the browser bar, what do you need to know about secure connections as a programmer? What do people mean by terms like authentication, verifying a certificate, or signing a message? This talk aims to demystify HTTPS, TLS, X.509 and more!

Matt has been a Ruby/Rails developer since 2006, was CTO of several startups (all using Rails) as well as the most recent CTO of Rails Machine, a rails hosting company. Now, as co-founder and CEO of ReactiveOps, a DevOps-as-a-Service and consulting company he’s lead growth — completely bootstrapped — from zero to 20 people (and growing!), built a company/product/strategy, managed P&L, and (so far) have kept it from cratering into the ground.

Presentations

How NTSB Air Disaster Analysis Can Help You in an Emergency Session

In this talk, we’ll examine the failures of modern airline operations - from United Airlines flight 173 in 1978, to Air France flight 447 in 2009 and First Air flight 6560 in 2011 - and how the NTSB investigation and aftermath has dramatically improved flight safety.

As an Executive Manager and Senior Architect, Casey manage teams to tackle Big Data, architect solutions to difficult problems, and train others to do the same. He seek opportunities to leverage his experience with distributed systems, artificial intelligence, translating novel algorithms and academia into working models, and selling a vision of the possible to clients and colleagues alike. For fun, Casey models human behavior using personality profiles in Ruby, Erlang, Elixir, Prolog, and Scala.

Presentations

Deprecating Simplicity Session

Use Chaos Engineering to embrace complexity and navigate it, rather than rejecting complexity and trying to erase it.

Presentations

Rebuilding the airplane in flight...safely Session

Rewriting the key software component of your platform from scratch is always intimidating, especially when you guarantee 100% uptime, your platform is in the critical application delivery path, and your environment is highly distributed. Shannon Weyrick discusses NS1's recent DNS server rewrite and the steps the company took to roll it out across its globally distributed network with no downtime.

Priyanka Sharma is an entrepreneur with a passion for building developer products and growing them through open source communities. Priyanka heads marketing and partnerships at LightStep and also works on the OpenTracing project, an instrumentation standard for distributed tracing. In her copious spare time, she advises startups at HeavyBit industries, an accelerator for developer products. Priyanka cofounded WakaTime, an open source time tracker for developers. She holds a BA in political science from Stanford University.

Presentations

Tracing Polyglot Systems: An OpenTracing Tutorial Tutorial

Understand how distributed tracing works and how to employ it in the development and operations of your applications in the programming language of your choice: Java, Go, Python, Node.js, C#, and C++.

Yuri is a software engineer at [Uber Technologies](http://eng.uber.com/), working on distributed tracing, reliability, monitoring, and efficiency. Founder of [CNCF Jaeger](http://jaegertracing.io), a distributed tracing system, and co-founder of [CNCF OpenTracing](http://opentracing.io), a vendor-neutral instrumentation API for distributed tracing.

Presentations

Tracing Polyglot Systems: An OpenTracing Tutorial Tutorial

Understand how distributed tracing works and how to employ it in the development and operations of your applications in the programming language of your choice: Java, Go, Python, Node.js, C#, and C++.

Ines Sombra is director of engineering at Fastly, where she spends her time helping the web go faster. Ines holds an MS in computology with an emphasis on cheesy ’80’s rock ballads. She has a fondness for steak, fernet, and a pug named Gordo. In a previous life, she was a data engineer.

Presentations

Closing Remarks Keynote

Closing Remarks

Closing Remarks Keynote

Closing Remarks

Tuesday Opening Welcome Keynote

Tuesday Opening Welcome

Wednesday Opening Welcome Keynote

Wednesday Opening Welcome

Sharon Steed is a corporate empathy and communications consultant as well as an
international keynote speaker. Sharon is a subject matter expert on
communicating with empathy. She has spoken at companies on improving team
communication and collaboration; at tech conferences on vulnerability as an asset;
and has given a TEDx talk on empowering insecurities.
A lifelong stutterer, she uses her speech impediment to teach both what empathy
is and how to be empathetic. Sharon’s course Communicating with Empathy is
available on both LinkedIn Learning and Lynda.com. She lives in Pittsburgh, PA,
USA.

Presentations

Creating cultures of empathy Session

This talk teaches what empathy is (and is not) and teaches individuals the tools needed to cultivate an empathy mindset at work and in life.

James Thompson is a staff software engineer at Nav, where he is committed to helping engineering teams become more deliberate in how they build software through developing strong learning cultures, principled engineering practices, and holistic architectural thinking. He has worked with web technologies since 2003. He also loves barbecue.

Presentations

Beyond Accidental Architecture Session

Accidental architecture is a product of circumstances, rather than deliberate development towards a goal; and, it is best addressed by equipping teams to make more deliberate and informed technical decisions.

Laura Thomson is Senior Director of Engineering, Firefox Engineering Operations at Mozilla, and is on the board of ISRG, the non-profit behind Let’s Encrypt. Laura has worked in various engineering roles at Mozilla for the last ten years, and previously worked in jobs including Principal/VP at OmniTI, engineer, consultant, and computer science academic. She has written several books on various open source technologies, including “PHP and MySQL Web Development” (with Luke Welling, her husband). Laura is from Melbourne, Australia, but now lives with her family on a rustic horse farm in Maryland, USA, where she relaxes by indulging in manual labor and other rural pursuits.

Presentations

Keynote by Laura Thomson Keynote

Keynote by Laura Thomson

James Turnbull is the CTO of Empatico. A longtime member of the open source community, James is the author of 10 technical books about open source software: The Terraform Book, The Art of Monitoring, The Logstash Book, The Docker Book, Pro Puppet, Pulling Strings with Puppet, Pro Linux System Administration, Pro Nagios 2.0, and Hardening Linux. He was formerly CTO at Kickstarter and an advisor at Docker. James likes food, wine, books, photography, and cats. He is not overly keen on long walks on the beach or holding hands.

Presentations

Closing Remarks Keynote

Closing Remarks

Closing Remarks Keynote

Closing Remarks

Tuesday Opening Welcome Keynote

Tuesday Opening Welcome

Wednesday Opening Welcome Keynote

Wednesday Opening Welcome

Seth Vargo is a Developer Advocate at Google. Previously, he worked at HashiCorp, Chef Software, CustomInk, and a few Pittsburgh-based startups. He is the author of Learning Chef. Seth is passionate about reducing inequality in technology. When he is not writing, working on open source, teaching, or speaking at conferences, Seth enjoys spending time with his friends and advising nonprofits. He loves all things bacon.

Presentations

Modern Security Best Practices for Microservices and Distributed Systems Session

This talk outlines the key principles for securing microservices and distributed systems in the modern world where applications run in cloud or hybrid cloud infrastructure.

Husband, Father, Go Developer, DevOps Sorcerer, Manager, and coffee enthusiast… Not necessarily in that order. A man of many hats, Kris has been everything from an IT Director of a local non-profit, to a DevOps engineer for a cloud based telephony company, and he now manages the Developer Solutions team at Gannett (USAToday Network). In his professional life, he’s passionate about automation, clean code, and comfortable collaborative work environments. Outside of work, he’s passionate about his wife and kids, rock climbing, off-roading, politics, STEAM educational programs, comic book super heroes, coffee, and archery.

Presentations

From silos to a single pane of glass at USA TODAY NETWORK Session

Three years ago our technical teams were completely siloed, making improvements and troubleshooting difficult and often blind to the rest of the technical organization. We will talk about how drastically our tool belt, thought process, and goals have changed to help us to be a single pane of glass today.

A proven, high energy business executive in Information Technology with over 20 years of progressive experience exemplified by successful assignments in operations, sales and technology. An inspirational leader, who thrives in a dynamic, fast paced environment with the intellectual agility to prioritize and make decisions in challenging, difficult situations. A visionary, who creates a competitive advantage by leveraging technology with innovative solutions and approaches. Experienced start up technologist as founder of 4 startups including online marketing, gaming and web based application firms.

Presentations

Deploying Large Scale Microservices in China Using DC/OS Session

Lessons learned from building large scale DC/OS microservices architecture in the confines of mainland China's cloud services Tencent and AliCloud.

Heidi Waterhouse is a developer advocate at LaunchDarkly. She works at the intersection of cutting-edge technology, usability, and possible disaster.

Presentations

Disaster Resilience the Waffle House Way: Flat-tops, feature flags, etc Session

This talk explores lessons about state that we can adapt from the world outside computers, how to quantify them using a finite state machine, and then how to explore implementing them automatically while we are in a less-than-perfect condition.

Sarah Wells has been a developer for 15 years, leading delivery teams across consultancy, financial services and media. Over the last few years she has developed a deep interest in operability, observability and devops, and this has recently led to her taking a new role as Technical Director for Operations & Reliability at the Financial Times.

Before that, she lead work at the FT on building a semantic publishing platform, making it easy to discover and access all the FT’s published content via APIs in a common and flexible format. That project meant a focus on Go, microservices, containerisation, kubernetes, and how to influence teams to do the right things.

Presentations

Switching Horses Midstream: The Challenges of Migrating 150+ Microservices to Kubernetes Session

The Financial Times recently migrated their content platform to kubernetes. Find out what it takes to migrate 150+ microservices from one container stack to another, without affecting the existing production users and while the rest of your teams are working on delivering new functionality.

Shannon Weyrick is VP of Architecture at NS1. A 20-year veteran of internet infrastructure, Shannon is an accomplished technical architect, developer, and leader whose experience encompasses both development and operations of globally distributed platforms. Previously, Shannon worked at Internap and F5 Networks. A regular open source contributor, he has led and worked on a wide range of infrastructure projects, from high-performance servers to novel programming languages and runtimes, and enjoys writing and speaking at industry conferences.

Presentations

Rebuilding the airplane in flight...safely Session

Rewriting the key software component of your platform from scratch is always intimidating, especially when you guarantee 100% uptime, your platform is in the critical application delivery path, and your environment is highly distributed. Shannon Weyrick discusses NS1's recent DNS server rewrite and the steps the company took to roll it out across its globally distributed network with no downtime.

Jamie Wilkinson is a Site Reliability Engineer at Google. Contributing author
to the “SRE Book”, he has presented on contemporary topics at prominent
conferences such as linux.conf.au, Monitorama, PuppetConf, Velocity, and
SRECon. His interests began in monitoring and automation of small
installations, but continues with human factors in automation and systems
maintenance on large systems. Despite over 15 years in the industry, he is
still trying to automate himself out of a job.

Presentations

SLO Burn Session

A brief introduction to the theory of SLOs, a practical guide to implementing sustainable SLO-based alerting for systems of any size, and a discussion on the tooling required to supplement the system in the absence of cause-based alerting.

Adam Wolfe Gordon is a software engineer on the storage team at DigitalOcean, working primarily on block storage orchestration. He likes building elegant microservices, continuous deployment, and occasional forays into low-level software such as ceph and qemu.

Presentations

Managing Multiple Sources of Truth in Distributed Applications Session

When building distributed applications, it is highly desirable to maintain a single source of truth, such as a database, for all application state. Unfortunately, for some applications multiple sources of truth are unavoidable. This talk offers strategies, learned from real-world experience, for managing multiple sources of truth without sacrificing consistency and usability.

Martin Woodward is the principal program manager for DevOps in Microsoft, where he focuses on Visual Studio Team Services and Team Foundation Server. Previously, Martin was executive director of the .NET Foundation, helping drive Microsoft’s move to open source, and was responsible for the Java, Linux, and Mac tooling in the Developer division, where he helped introduce Git into Microsoft.

Presentations

60,000 tests in six minutes: Creating a Reliable Pipeline, Eliminate Flaky Tests and Deploy Safely but Quickly Session

Detail on what type of tests to do when in your DevOps pipeline and why.

Jason is a technical evangelist at Datadog, where he works to inspire developers and ops engineers with the power of metrics and monitoring. Previously, he was the community manager for DevOps & Performance at O’Reilly Media and a software engineer at MongoDB. He’s currently exploring the world while living as a nomad and would love to hear about the part of the world that you call home.

Presentations

Canary Deploys with Kubernetes and Istio Session

I'll show how you can more easily test code in production while isolating the effect of potential issues using container orchestration and services meshes.