4–7 Nov 2019

Speakers

Hear from innovative engineers, talented leaders, and senior developers who are doing amazing things around systems engineering and cloud native infrastructure. More speakers will be announced; please check back for updates.

Grid view List view

Jibby Ayo-Ani is a DevOps engineer II at Welkin, a series-A health tech startup, where she leads the DevOps space to ultimately foster long-term solutions to patients with chronic diseases. She lives to fight the good fight; an experienced Silicon Valley software and DevOps engineer, Jibby has worked across multiple industries addressing social issues in both large and small technology companies as well as for the Marshallese government as a consultant to prevent bad guys from laundering money through off-shore bank accounts. With cloud computing as her core skill set, she’s become a subject matter expert on site reliability and disaster recovery aiming to build out scalable AWS cloud solutions.

Presentations

Reliability and resiliency in application development in the cloud Session

Look through the lens of the DevOps engineer with Jibby Ayo-Ani to discover the power of cloud computing while using it to its utmost power to protect your company against everything from security vulnerabilities to natural disasters.

Oscar Barlow is a senior consultant at Infinity Works. He has been a philosopher, marketer, and filmmaker. He’s seen all of IMDb’s top 250 movies, and once built a boat more or less because people told him he couldn’t.

Presentations

Dude, where's my yogurt? Algorithms and competition in UK retail Session

Oscar Barlow walks you through a case study in productionizing a system for algorithmically determining your competitive strengths and weaknesses in UK retail.

Daniel Barthelmes is the head of operations for Otto.de, the ecommerce platform of OTTO. During his nearly 10 years at OTTO, Daniel has helped change and develop Otto.de from a customized off-the-shelf software running in on-premises date centers to a state-of-the-art cloud native, distributed microservice platform, gradually transitioning more and more responsibilities from the centralized operations to the decentralized development teams. He enjoys working with people just as much as working with technology. At work he loves wearing a suit and a tie.

Presentations

No limits: When a migration to the cloud transforms your tech teams Session

When the development and engineering teams got the green light from management to go ahead and migrate OTTO's ecommerce platform to the cloud, the teams had no clue how to even approach such an undertaking. So they did what they always do. Daniel Barthelmes dives into the amazing results.

Nikhil Barthwal is a tech lead in the Google Cloud Platform at Google, working on Knative, a Kubernetes-based platform to build, deploy, and manage modern serverless workloads and is passionate about building distributed systems. He has several years of work experience in big companies and smaller startups and also acts as a mentor to several startups. Outside of work, he speaks at local meetups as well as international conferences on several topics related to distributed systems and programming languages. Learn more via his homepage.

Presentations

Knative: A Kubernetes framework to manage serverless workloads Session

Knative is a Kubernetes-based platform to build, deploy, and manage modern serverless workloads. It provides a set of middleware components that are essential to build modern, source-centric, and container-based applications that can run anywhere. Join Nikhil Barthwal to explore using Knative to build and deploy modern serverless workloads in a vendor neutral fashion.

Laurent Bernaille is a staff engineer on the compute team at Datadog, where he’s responsible for setting up and scaling Kubernetes platforms. Laurent has given several talks on the topic of application deployment and containers in conferences such as DockerCon, Open Source Summit, and EuroBSDCon.

Presentations

Kubernetes the very hard way Session

Laurent Bernaille examines the lessons he learned operating large Kubernetes clusters.

Julia Biro is a reliability engineer at Contentful, working with highly available systems serving mass customer traffic. She’s especially interested in the concepts that allow high availability—everything from horizontal scalability to the process around incident response. She sees systems, people, and the tooling that allows interaction between them as one whole that should be managed as such.

Presentations

Making S3 even more resilient using Lambda@Edge Session

Julia Biro explains technical solutions and insights from building a true multiregion active-active file service using Lambda@Edge and S3 (with buckets in multiple AWS regions).

Zack Bloom is a director of product at Cloudflare, where he works to build a better internet. Previously, he was the cofounder and CEO of Eager, a company acquired by Cloudflare in 2016, and an engineer at HubSpot, where he worked on open source software.

Presentations

Isolate computing Session

The way a process works hasn't materially changed since the mainframe, but with the internet and serverless, what we need from a computer has changed dramatically. If we take technology invented for running code in a browser, and move it into the network, many of our problems may disappear. Let Zack Bloom show you why.

Peter Bourgon is a principal engineer at Fastly. He’s also a distributed systems engineer and open source aficionado.

Presentations

Infinite parallel universes: Modeling state at the edge Session

Peter Bourgon dives into how Fastly is thinking about state at the edge and provides the first public look into the architecture and communication model of a global-scale data system that Fastly is currently prototyping.

Janna Brummel is an IT chapter lead (a line manager who still does day-to-day work) on the SRE team at ING in Amsterdam. Previously, Janna worked as a business manager for ING’s CIO and as a dev engineer developing software for ING’s debit and credit card backend systems.

Presentations

Revolutionizing a bank: Introducing service mesh and a secure container platform Session

For years, Janna Brummel and Robin van Zijll have been told no to any external hosting. They've always lost time by not being able to use open source and cloud native products without adjustments. All because they work for a bank. Things are changing now: Janna and Robin are proving it's possible to run APIs in a secure container platform in the public cloud.

Juan Pablo Buriticá is the vice president of engineering at Splice, where he leads a distributed team throughout the US and Latin America building a cloud platform for music creation, collaboration, and sharing. Juan Pablo has built effective software engineering organizations by emphasizing open source software values, technical excellence, trust, and empathy. He has organized five global software engineering conferences, spoken at multiple events, and founded and led the growth of Colombia’s JavaScript community, the largest Spanish-speaking JS community in the world, with more than 5,000 members.

Presentations

Accelerating engineering delivery tempo Session

The Splice engineering team grew almost 10 times in 18 months. The delivery practices that worked when it was 5 people broke way before it got to 50. Juan Pablo Buritica explains how the engineering team accelerated delivery using industry insights and data.

Ingrid Burrington writes, makes maps, and tells jokes about places, politics, and the weird feelings people have about both. Much of her work focuses on mapping, documenting, and studying the often-overlooked or occluded landscapes of the internet (and the ways in which the entire planet has become, in effect, a “landscape of the internet”). By examining the political geography and embodied realities of living on a networked planet, she seeks to demystify these technologies for nontechnical audiences and to reframe technology’s underlying politics and power dynamics. Her writing has appeared in the Atlantic, the Nation, Popula, e-flux journal, and other outlets. She’s also the author of Networks of New York: An Illustrated Field Guide to Urban Internet Infrastructure. Previously, Ingrid taught at Rhode Island School of Design, the Cooper Union, and the School for Poetic Computation. Her work has previously been supported by Eyebeam, Data & Society Research Institute, USC’s Annenberg Innovation Lab, and the Center for Land Use Interpretation.

Presentations

How to deploy infrastructure in just 13.8 billion years Keynote

The reliability of cloud services tends to operate in the perpetual present tense—focused more on maintaining systems right now more than preparing for a far future. Ingrid Burrington explores how reframing the time scales of computation can change and maybe improve the way your build infrastructure.

Jesse Butler is a principal cloud developer advocate with Oracle, where he’s an advocate and evangelist for all things cloud native, including Kubernetes, containers, and serverless. He’s been working with containers for several years, first in Solaris and later in Linux.

Presentations

Learning from the past: The cloud native systems analyst (sponsored by Oracle Cloud Infrastructure) Session

Learn from the systems analysts of yore and how their practices apply to today's cloud native stack. Jesse Butler gleans some powerful ideas from the past to enhance your observability, resilience, and security in the cloud.

Paris Buttfield-Addison is a cofounder of Secret Lab, a game development studio based in beautiful Hobart, Australia. Secret Lab builds games and game development tools, including the multi-award-winning ABC Play School iPad games, the BAFTA- and IGF-winning Night in the Woods, the Qantas airlines Joey Playbox games, and the Yarn Spinner narrative game framework. Previously, Paris was a mobile product manager for Meebo (acquired by Google). Paris particularly enjoys game design, statistics, blockchain, machine learning, and human-centered technology. He researches and writes technical books on mobile and game development (more than 20 so far) for O’Reilly; he recently finished writing Practical AI with Swift and is currently working on Head First Swift. He holds a degree in medieval history and a PhD in computing. Paris loves to bring machine learning into the world of practical and useful. You can find him on Twitter as @parisba.

Presentations

Replacing your gearbox at 100 mph: How live games monitor and change with millions playing Session

Games can teach the software world a lot about performance, monitoring, change, and beyond. Jon Manning and Paris Buttfield-Addison review current best practices for gathering large amounts of data from ongoing user activity, finding the best modifications to make in the analysis of that data, and how to roll out changes while hundreds of thousands of users are right in the middle of something.

David Cheney is a software engineer at VMWare. David has been involved with the Go project for more than eight years. He is a regular contributor to the language, focusing on Go on ARM processors. Previously, he ported Go to FreeBSD/ARM and Solaris/AMD64 and is working on a port to Linux/ARM64. David writes frequently about Go on his blog and has spoken locally and internationally.

Presentations

5 things Go taught me about open source? Keynote

This talk is about the unexpected things Dave learned along the way trying to convince programmers to try Go and how they might translate to the experiences that all have working in an ecosystem of open source projects.

Lita Cho is a senior software engineer on the networking team at Lyft, where she builds out the service mesh to handle both Kubernetes and legacy systems. She also maintains the tracing infrastructure at Lyft. While at Lyft, she’s worked on building out the API infrastructure using protocol buffers, creating systems that generate code and bring type safety to Lyft’s polyglot microservice architecture. Previously, she was at DreamWorks Animation for four years, working on production and pipeline tools for artists and animated movies.

Presentations

Deploying hybrid topologies with Kubernetes and Envoy: A look at service discovery Session

New software in production comes with a lot of risks, especially for companies with high availability requirements. However, it’s possible to make significant infrastructure changes while maintaining your company’s reliability. Jose Nino and Lita Cho outline deploying hybrid topologies with Kubernetes and Envoy.

Devesh Chourasiya is a technical lead on the transaction core team at Yelp, providing the commerce platform for enabling consumer transactions at Yelp. His passions include building scalable, performant, and well-monitored systems. He holds a master’s degree in computer science from the University of Arizona.

Presentations

Building autoscaling systems: A case study using Step Functions autoscaler Session

The challenging problem in scaling resources dynamically is to maintain a healthy system while limiting expense from unused resources. Devesh Chourasiya walks you through the challenges and major design considerations of any autoscaler system through a production case study of AWS Step Functions autoscaler at Yelp.

Jesus Climent is a senior site reliability engineer on the CRE team at Google, where he helps companies meet their reliability requirements. Previously, he spent eight years at Nokia as a system administrator and architecture engineer.

Presentations

SRE classroom: How to design a reliable application in three hours (sponsored by Google) Tutorial

Jesus Climent explores the key concepts behind microservices. With his guidance, you'll work through a problem and apply these concepts to evaluate and build systems of your own.

Kevin Crawley is a developer evangelist at Instana, an APM and container monitoring service provider, a local organizer for DevOpsDay Nashville and Docker Nashville, and speaks globally on topics including distributed computing, microservices, containers, monitoring, logging, deployment automation, observability, public speaking, alert fatigue, and human ops. He’s been distinguished by his peers as a Docker Captain for his outstanding work with the community and the successful implementation of Docker in Production over the past five years. He’s passionate about sharing the stories of success (and failure) of the teams he works with and his own experiences. Previously, he designed and built distributed scheduling systems using Symfony (PHP), Amazon SQS, DynamoDB, and Redshift. Most recently, he was responsible for designing, building, and delivering a modern software delivery platform using Go and Angular, which was integrated with Docker Swarm and GitLab. This application is capable of delivering phoenix environments and a fully configurable production pipeline. This application manages the SDLC of over 1,000 containers and dozens of engineers building on a Kafka, Cassandra, and Spring Boot ecosystem.

Presentations

Observability workshop: Instrumenting cloud native applications Tutorial

Kevin Crawley walks you through how to configure a Kubernetes cluster in the Google Kubernetes Engine (GKE), deploy Prometheus and Jaeger to monitor a distributed microservice application, and instrument that application by introducing libraries and tooling to support capturing business metrics. You'll configure, update, and deploy a cloud native application using Gitlab and Gitlab-CI.

Aish Dahal is a software engineer at Slack, where he works on scaling Slack for its largest customers. Previously, he’s worked for PagerDuty and Goldman Sachs.

Presentations

Automated refactoring and safety in large repositories Session

Aish Dahal explains how Slack uses compiler tools with the HipHop Virtual Machine (HHVM) in scaling and autorefactoring its large Hack code base.

Jennifer Davis is a cloud operations advocate at Microsoft. Previously, she was a principal site reliability engineer at RealSelf and developed cookbooks to simplify building and managing infrastructure at Chef. Jennifer is the coauthor of Effective DevOps and speaks about DevOps, tech culture, and monitoring. She also gives tutorials on a variety of technical topics. When she’s not working, she enjoys learning to make things and spending quality time with her family.

Presentations

Prioritizing trust while creating applications Session

Time and money are generally the resources we focus on when building applications. Yet we can’t buy trust; it builds slowly and can be broken quickly when we don’t factor it in to our development process. Jennifer Davis examines how to leverage security practices to enable an all-team approach to security.

The ultimate guide to complicated systems Keynote

Building and maintaining distributed systems is hard. Industry tools and recommended practices are evolving at an ever-increasing velocity. New platform choices reduce infrastructure management and add operational complexity obscuring the value of operation skills. Often, bureaucratic decisions drive practices and tool choices.

Dmitrii Dolgov is a software engineer at Zalando and a PostgreSQL contributor.

Presentations

PostgreSQL at low level: Stay curious Session

Dmitrii Dolgov takes a deep dive into how to troubleshoot intricate performance issues in PostgreSQL using such tools as strace, perf, extended Berkeley Packet Filter (eBPF). And stay curious.

Daniel Drozdzewski is a technical lead at Scott Logic, a bespoke software consultancy based in the UK. He’s sometimes funny (always trying!) and a Polish (by birth and formative years), British (by where home is), and Irish (by marriage) cultures blender. He took his first steps in technology by writing Minesweeper in C64 Basic back in the early 90s. After formal schooling and studies revolving around digital technology, he entered the world of software development and has been riding Moore’s Law wave for last 13 years. Previously, his career took him to startups and big corporations, developing software for mobile, desktop, and backend in a number of different technologies, but with very strong presence in Java and distributed and data processing. He has also been involved in setting up the Agile Yorkshire meetup back in 2007 (it was called eXtreeme Programming Club back then) and is the owner of (very quite recently) the Agilist Google User group. He’s a father of two girls and a husband to one very understanding Irish lady, a passionate cook, very keen but bad dancer, loves to travel and meet people, read books and watch movies.

Presentations

Performance beyond improv Session

In the world of software development or technology in general, performance often gets overlooked or is looked at late. Daniel Drozdzewski examines the philosophical aspects and benefits of keeping performance at the forefront of your mind.

Gilles Dubuc is a senior performance engineer and is a member of the performance team at the Wikimedia Foundation, where he focuses on bridging the gap between performance metrics and real user performance perception.

Presentations

How to make sense of real user performance metrics Session

Gilles Dubuc takes a deep dive into how Wikipedia interprets large amounts of real user performance data and the many pitfalls you can fall into when doing so.

Ismail Elouafiq is a data scientist and developer at Sweden’s Television. Ismail started as a developer by making and selling video games at a young age. He’s worked with data, machine learning, and artificial intelligence before it was popular. Ismail also writes online about data science, security, and machine learning. He also speaks five languages, and when he’s not looking at a screen, you’ll most likely find him rock climbing, freediving, or practicing partner acrobatics.

Presentations

Building a data ecosystem at Sweden's Television: Lessons and pitfalls Session

Sweden's Television manages online products that range from providing news to TV series and are used by millions of people. To make sure that it creates content that engages, entertains, and educates, it started its own platform for collecting and analyzing user data. Ismail Elouafiq highlights the architectural choices the company made and the lessons it learned in building its data ecosystem.

Liz Fong-Jones is a developer advocate, labor and ethics organizer, and site reliability engineer (SRE) with 15+ years of experience at Honeycomb. Previously, she was an SRE working on products ranging from the Google Cloud Load Balancer to Google Flights. She lives in Brooklyn with her wife, metamours, and a Samoyed/Golden Retriever mix, and in San Francisco and Seattle with her other partners. She plays classical piano, leads an EVE Online alliance, and advocates for transgender rights as a board member of the National Center for Transgender Equality.

Presentations

Cultivating production excellence: Taming complex distributed systems Session

Join Liz Fong-Jones to learn how taming the complex distributed systems you're responsible for requires changing not just the tools and technical approaches you use, it also requires changing who's involved in production, how they collaborate, and how you measure success.

Observing and understanding distributed systems with OpenTelemetry Tutorial

Liz Fong-Jones and Yoshi Yamaguchi walk you through a microservice-based system with distributed tracing using OpenTelemetry. You'll progress from inspecting raw trace data to visualizing the distributed traces with open source engines such as Jaeger and Zipkin, and you'll export the instrumented data to SaaS products such as Honeycomb and Stackdriver.

Lorenzo Fontana is an open source software engineer at Sysdig, where he primarily works on Falco, a Cloud Native Computing Foundation (CNCF) project that does container runtime security and anomaly detection. He’s passionate about distributed systems, software-defined networking, the Linux kernel, and performance analysis. He’s the maintainer of the IO Visors Project’s kubectl-trace.

Presentations

eBPF-powered distributed Kubernetes performance analysis Session

Extended Berkeley Packet Filter (eBPF) is one of the most exciting topics in the performance analysis space. IOVisor (Linux Foundation’s eBPF project) makes eBPF tools. Lorenzo Fontana is here to to explain how eBPF works, what purpose it serves, and the tooling available for Kubernetes.

Karthik Gaekwad is a cloud native developer advocate at Oracle Cloud Infrastructure. A veteran engineer, he enjoys building software products from scratch using cloud and container technologies. Previously, he worked in both large enterprises and startups, including National Instruments, Mentor Graphics, Signal Sciences, and StackEngine (acquired by Oracle). He organizes several conferences including Devopsdays Austin and Container Days, and he’s an accomplished author for LinkedIn Learning. Karthik holds an MS in computer engineering from the University of Arizona. In his free time, he enjoys spending time with family in his hometown of Austin, Texas, dabbling in new product ideas, and blogging .

Presentations

Kubernetes at scale: The good, the bad, and the ugly (sponsored by Oracle Cloud Infrastructure) Keynote

Karthik Gaekwad explores why the Oracle Container Engine for Kubernetes (OKE) is one of Oracle Cloud's most popular platforms. You'll learn the good and some ugly lessons learned along the way on how to manage, operate, and scale Kubernetes at a cloud provider scale.

Sébastien Goasguen built his first compute cluster while working on his PhD in the late ‘90s when they were still called Beowulf clusters; he’s been working on making computing a utility since then. He’s been focused on containers and container orchestration, creating a Kubernetes startup Skippbox where he created kompose, Cabin, and kubeless. Active in the serverless community, he cofounded TriggerMesh, a serverless management platform that builds on top of Kubernetes and Knative. He can be found hiking the Jura or at open source conferences. He’s the author of the Docker Cookbook and coauthor of the Kubernetes Cookbook.

Presentations

Certified Kubernetes Application Developer (CKAD) prep for exam 2-Day Training

Can you develop and maintain applications using Kubernetes? That’s the question more employers are asking these days. Take the next step in your career by becoming a Certified Kubernetes Application Developer at Velocity. You get a full day of test prep from Sébastien Goasguen, O’Reilly’s top Kubernetes trainer, and the opportunity to take the exam onsite.

Deploying serverless applications to any cloud with Knative Tutorial

Priyanka Sharma and Sébastien Goasguen break the shackles vendor lock-in by teaching you to deploy serverless functions to any cloud provider of your choice.

Christian Grabowski is a backend software engineer at NS1, a next-generation DNS and traffic management company. Christian has worn many engineering hats over the course of his career and has worked on a variety of software but loves getting into the nitty-gritty, low-level code the most. When he’s not developing fast, intelligent DNS services, he’s rather active in the open source community, contributing to projects such as gobpf, BCC, and Kubernetes.

Presentations

The golden idol swap: Pragmatic database migration Session

Switching databases requires a lot of effort from engineering teams, and Christian Grabowski walks you through steps you can take to reduce the amount of work needed to achieve payoff. NS1 created an abstraction layer of wire protocols between old and new databases, which allowed it to develop advanced functionality in new services, while legacy services required minimal changes.

Chris Guzikowski is the senior content director at O’Reilly, where he manages the acquisition and development of content in software architecture and software development. He’s also the cochair of the O’Reilly Software Architecture Conference. Chris has been working on technical content and technology marketing for more than 30 years. He and his family reside in suburban Boston, MA.

Presentations

Thursday Opening Welcome Keynote

Program chairs Chris Guzikowski, Ines Sombra Turnbull, and James Turnbull open the second day of keynotes.

Wednesday Opening Welcome Keynote

Program chairs Christopher Guzikowski, James Turnbull, and Ines Sombra open the first day of keynotes.

Jon Hall is a principal product manager at BMC. He’s been working in the technology sector, particularly in the field of IT service management (ITSM), for two decades, in roles ranging from development through implementation and process consultancy to product management. A strong DevOps advocate in the ITSM community, his work focuses on the alignment of the two ways of thinking in enterprises, particularly as DevOps scales beyond the scope of ad hoc support. He’s spoken at a large number of ITSM and DevOps conferences, including DevOps Enterprise Summit USA, Configuration Management Camp, and a number of devopsdays events.

Presentations

Swarming, Cynefin, and avoiding the problems of becoming a third-line support team Session

Jon Hall explores the challenges faced by DevOps practitioners as their role and scale grows in established enterprises and they get drawn into existing support structures. Join him to learn why swarming is a better alternative for DevOps than traditional tiered support.

Michael Hausenblas is a developer advocate at AWS, part of the container service team, focusing on container security. Michael shares his experience around cloud native infrastructure and apps through demos, blog posts, books, and public speaking engagements as well as contributes to open source software. Previously, was at Red Hat, Mesosphere, MapR, and in two research institutions in Ireland and Austria.

Presentations

GitOps 101 Tutorial

GitOps is the practice of continuous delivery using Git repos as the single source of truth, managing infrastructure and applications in an immutable and declarative manner. Michael Hausenblas motivates the model and shows it in action, using Kubernetes and a number of tools.

Mike Hobbs is a senior ops engineer at MOO and is a seasoned Linux ops engineer with nearly 20 years experience in this field. Having worked in small companies and large enterprises alike, Mike has seen many patterns emerge and repeat over the years and feels it’s about time he shared some of these experiences with a wider audience. He has a particular passion for the DevOps culture, feeling that the unification of disciplines within technology has the potential to deliver great benefits to business and higher job satisfaction for those working within it. Supporting data-driven decision making and all things monitoring, metrics, and observability are the areas in which Mike currently favors most.

Presentations

Helping your dev teams succeed at ops, post-Kubernetes Session

Michael Hobbs takes a look at how best to ensure your service owners can succeed with responsibilities and concerns that were traditionally the domain of ops teams prior to the deployment of Kubernetes for production load within a business.

Bastian Hofmann is a senior system engineer at SysEleven, a full-service hosting company, where he focuses on bringing the power of Kubernetes to its customers. Previously, he was focused on performance, monitoring, web security, and developer productivity at ResearchGate, the social network for researchers and scientists. When he’s not developing stuff or looking at graphs, he likes to cycle, do yoga, or go to beer gardens, and he frequently speaks at international conferences on software architecture, scaling web applications, and open standards and protocols.

Presentations

Highly available cross-region deployments with Kubernetes Session

Out-of-the-box Kubernetes makes it easy to deploy and scale your applications within one Kubernetes cluster in one single region. But it's also possible to deploy an application over multiple clusters in different regions, so it becomes truly highly available even if a complete region fails. Learn how to deploy one application across multiple federated Kubernetes clusters with Bastian Hofmann.

Nic Jackson is a developer advocate and polyglot programmer at HashiCorp and the author of Building Microservices in Go. In his spare time, Nic coaches and mentors at Coder Dojo teaching kids 7–14 an introduction to coding, teaches at Women Who Go and GoBridge, speaks and evangelizes good coding practice, process, and technique, and works to raise money for a charity he runs with his wife.

Presentations

Bridging the gap between brownfield and greenfield applications with a service mesh Tutorial

Dynamic cloud-based infrastructure has forced us to reevaluate how we route and secure traffic in our internal networks; a popular solution for this is a service mesh. The service mesh doesn't apply only to modern infrastructure. Erik Veld and Nic Jackson walk you through how to bridge the gap between brownfield and greenfield applications.

Peter Jausovec is a software engineer with more than 10 years of experience in the field of software development and tech. He’s spent time in various roles, starting with QA then moving to software engineering and leading tech teams. His early career was mostly focused on developer and cloud tooling; however, in recent years he’s been focusing on developing distributed systems and cloud native solutions.

Presentations

Building cloud native applications 2-Day Training

Peter Jausovec details fundamental building blocks of the next-generation cloud native apps. You'll learn to decide on containers versus functions, which communication patterns to use, how to set up development and debugging environments, CI/CD pipelines, and to monitor and observe your applications using key metrics through theory, demos, and exercises, using Kubernetes.

Nathanael Jean-Francois is the senior network architect at NS1, where he oversees the global network. He’s been in the internet infrastructure industry for about 10 years and has worked with companies from ad tech, CDN, and hosting, both in a network engineering capacity as well as system administration.

Presentations

BGP edge optimization with active measurement Session

The internet is underpinned by one critical protocol—Border Gateway Protocol (BGP). It's famously simple—by design. That simplicity helped it withstand the test of time and led many organizations to build complicated systems to drive their routing and shape their traffic. It's good enough for almost anything. Go beyond "good enough" with Nathanael Jean-Francois and take BGP to the next level.

Jonathan Johnson is an independent software architect with a concentration on helping others unpack the riches in the cloud native and Kubernetes ecosystems. Jonathan is halfway into his second score of engineering commercial software, driven by his desire to design helpful software to move us forward. His applications began with laboratory instrument software and managing its data. Jonathan was enticed by the advent of object-oriented design to develop personal banking software. Banking soon turned to the internet, and enterprise applications took off. Java exploded onto the scene, and since then he has inhabited that ecosystem. At 454 Life Sciences and Roche Diagnostics, Jonathan returned to laboratory software and leveraged Java-based state machines and enterprise services to manage the terabytes of data flowing out of DNA sequencing instruments. Then, as a hands-on architect at Thermo Fisher Scientific, he applied the advantages of microservices, containers, and Kubernetes to the company’s laboratory management platform. Jonathan enjoys comparing and sharing his adventures with peers. He shares ways to modernize application architectures while adhering to the fundamentals of high modularity and low coupling.

Presentations

Understanding Kubernetes Tutorial

Jonathan Johnson introduces you to Kubernetes for software engineers through concepts and a hands-on tutorials using KataCoda.com/javajon.

David Jungwirth is the head of DevOps consulting (EMEA) at Enterprise Studio by HCL Technologies. His interest and expertise include the design of automation solutions, driving cultural change, and streamlining organizational processes. David has held leadership positions in product management, architecture, technology consulting and IT management consulting. Collectively, his teams have delivered more than 10 thousand days of services over the past few years for customers across all industries and at every level of IT sophistication. David holds master’s degrees from Vienna University of Technology (medical informatics), University of Vienna (informatics management), and University of Salzburg (communication science), as well as several executive education certificates from MIT Sloan, Hernstein among others. He’s also earned project management and technical delivery certifications from APIM, SCRUM alliance, SAFe, the DevOps Institute, and AWS. You can find him on LinkedIn and Xing.

Presentations

Complex change through simple steps: 3 principles to reduce deployment time by 99% Session

While it’s great to think big, it's important to start small and sensibly. David Jungwirth explains how Enterprise Studio by HCL Technologies helped an enterprise achieve a deployment time reduction of 99%, double its releases, and massively reduce its overhead costs for each release with few small improvements over a period of two and a half years.

Park Kittipatkul is a software engineer at SignalFx, where he works on product engineering and enjoys exploring technologies, including serverless architecture and monitoring. He has over 10 years of experience in diverse areas, and while at SignalFx, he’s won multiple hackathons with innovative projects and presentations. Previously, he was at Twitter working on their feeds and timeline product, and he spent five years at Salesforce.com building sales, support, and communities products.

Presentations

Building maintainable, observable applications on multicloud serverless architecture Session

Serverless computing has a number of benefits over traditional application infrastructure. However, implementing maintainable and scalable applications using serverless computing services poses challenges. Park Kittipatkul details best practices for building, maintaining, and instrumenting applications on multicloud serverless architecture.

Ellen Körbes works with developer relations at Garden. They code, write, speak, teach Go, make videos, and dabble with Kubernetes. A native of Brazil, they’re deeply involved with diversity and inclusiveness in tech. They’re also an avid gopher—responsible for the most comprehensive Go course in Portuguese. They first got acquainted with Kubernetes while writing code for kubectl, in a SIG-CLI internship. They’ve spoken at world-famous events, and at countless local meet-ups. Ellen is a proud recipient of a “Best Hair” award.

Presentations

The state of Kubernetes development tooling Session

Developers working with Kubernetes still wonder what the optimal development workflow looks like. Join Ellen Korbes to take a look at the capabilities of the tooling available in the current landscape and see if it can offer end-to-end workflows that perform effectively in the real world.

Ignat Korchagin is a systems engineer at Cloudflare, where he works mostly on platform and hardware security. Ignat’s interests are cryptography, hacking, and low-level programming. Previously, he was senior security engineer for Samsung Electronics’s Mobile Communications Division, and his solutions can be found in many older Samsung smartphones and tablets. Ignat started his career as a security researcher in the Ukrainian government’s communications services.

Presentations

The definitive guide to making software fail on ARM64 Session

Even software, written in high-level cross-platform language with no assembly can fail multiple ways when ported to a different CPU architecture. Ignat Korchagin examines the issues Cloudflare encountered when porting its software stack to ARM64.

Alexandros Kosiaris is a principal site reliability engineer at the Wikimedia Foundation, where he’s pushed forward for more virtualization and better orchestrated microservices and environments for their execution. Previously, he was a Linux sysadmin, turned FreeBSD sysadmin, turned Linux sysadmin, turned systems engineer (somewhere along that path there’s a DevOps hat as well). Alexandros has been in the space since 1999, starting as a hobbyist, then a professional. The Kubernetes project is a current passion.

Presentations

What happens when you type de.wikipedia.org? Session

The Wikimedia Foundation runs the world’s favorite encyclopedia and one of the top 10 websites on the internet. Effie Mouzeli and Alexandros Kosiaris provide an overview of how Wikipedia is delivered to you.

Akshay is a Senior SRE on Cloud Bigtable, Google’s petabyte-scale NoSQL database. Before this, he’s worked as an engineer on Google Search and as an options trader at JPMorgan. He enjoys learning new things and scaling himself sub-linearly.

Presentations

SRE classroom: How to design a reliable application in three hours (sponsored by Google) Tutorial

Jesus Climent explores the key concepts behind microservices. With his guidance, you'll work through a problem and apply these concepts to evaluate and build systems of your own.

Andy Kwiatkowski is a senior production engineer at Shopify, where he helps drive capacity planning, autoscaling, and job infrastructure. Being a hyper-growth company, the infrastructure needs of Shopify are constantly changing, and Andy works with the world-class team at Shopify to keep up with ever-growing demand. Previously, he was an engineering manager at D2L, a leader in cloud-based learning management systems, and he spent 12 years as developer in the video game industry, working for large companies such as Electronic Arts, Rockstar Games, and his own development studio.

Presentations

Autoscaling in reality: Lessons learned from adaptively scaling Kubernetes Session

Andy Kwiatkowski takes a deep dive into how Shopify saved a million dollars a year in infrastructure costs by rolling its own autoscaler.

Danielle Lancashire is a senior software engineer at HashiCorp where she works on the Nomad cluster orchestrator. Previously, she was a staff software engineer at CircleCI where she worked on build infrastructure and was a maintainer of CocoaPods and fastlane. She’s also been described as the queer who yells at clouds.

Presentations

Stateful systems in the time of orchestrators Session

As organizations increasingly move workloads to cluster orchestrators, they frequently run into issues when trying to manage their stateful services. Danielle Lancashire demystifies how orchestrators interact with storage providers, explores common issues, and guides you through how to reliably run stateful workloads in cluster orchestrators.

Homin Lee is a data scientist at Datadog, where he writes algorithms that process trillions of data points a day. Previously, Homin built large-scale machine learning systems at several startups. Homin has a PhD from Columbia University in computational learning theory and was a Computing Innovation Fellow at the University of Texas at Austin.

Presentations

The observability graph: Knowledge graphs for automated infrastructure observability (sponsored by Datadog) Session

Homin Lee details constructing and using knowledge graphs to help DevOps teams make sense of the overwhelming volume of metric, log, trace, and event data generated by today's observability systems.

Jan Lehnardt is the CEO of Neighbourhoodie. He makes @couchdb @hoodiehq @jsconf EU @greenkeeperio. and @offlinefirst. He’s dissatisfied with the status quo, a duct tape artist, and feminist.

Presentations

CouchDB 4.0: 1.0 + 2.0 = 4.0 Session

Jan Lehnardt examines the challenges and solutions of changing the storage and distributed systems underpinnings from AP to CP (as per CAP) while retaining the unique multimaster replication APIs that CouchDB is renowned for.

Ho-Ming Li is the lead solutions architect at Gremlin. Previously, he worked at Amazon Web Services with many customers, providing guidance around architectural and operational best practices. He takes a strategic approach to deliver holistic solutions, often diving into the intersection of people, process, business, and technology. His goal is to enable everyone to build more resilient software by means of chaos engineering practices.

Presentations

Chaos engineering: When the network breaks (sponsored by Gremlin) Session

Ho-Ming Li outlines how to use chaos engineering to accelerate your understanding of how your network can break (packet loss, black hole attacks, latency injection, and packet corruption) and impact your services.

Jon Manning is the cofounder of independent game development studio Secret Lab. He’s working on the critically acclaimed award-winning adventure game Night in the Woods, which includes his interactive dialogue system Yarn Spinner, and Button Squid, a top-down puzzler. He’s written a whole bunch of books for O’Reilly about iOS development and game development. Jon holds a PhD about jerks on the internet. He’s currently writing Practical AI with Swift for O’Reilly.

Presentations

Replacing your gearbox at 100 mph: How live games monitor and change with millions playing Session

Robin Marx is a web performance researcher at Hasselt University, Belgium. He’s the author of the academic paper on HTTP/2 prioritization and has been a contributor in the QUIC working group for multiple years. He focuses on creating tools for the QUIC and HTTP/3 protocols that helps others debug their behavior and performance. He often talks about QUIC and web performance at international conferences, trying to make the complex situations more insightful to the wider public. On the weekends, Robin likes to hit other people on the head with longswords.

Presentations

Fixing HTTP/2 and preparing for HTTP/3 over QUIC Session

Deploying HTTP/2 correctly can be challenging in practice, and HTTP/3 will make things even more difficult as the underlying QUIC protocol runs over user datagram protocol (UDP). Robin Marx explores practical proxying, caching, load balancing, and routing issues and how to overcome them.

Josh Michielsen is a senior software engineer for the platform engineering team at Condé Nast International, where he helps to drive the vision of a truly global platform to house some of the world’s largest online publications. He specializes in container orchestration, software development, continuous delivery, and cloud operations. When he’s not wrangling Kubernetes or checking if err != nil in Go, he’s a hobbyist data scientist, photographer, cyclist, and doge owner living in Cambridge, UK.

Presentations

Operating a global cloud native platform Session

Operating cloud native infrastructure is more than just spinning up a container orchestrator. Auxiliary services are required in order to operate effectively and provide developers with a true platform experience. Josh Michielsen explores how Condé Nast operates multiple Kubernetes clusters across the world, with a focus on observability, testing, app delivery, and developer experience.

James Mickens is an associate professor of computer science at Harvard University. His research focuses on the performance, security, and robustness of large-scale distributed web services. James earned a BS degree in computer science from the Georgia Institute of Technology and a PhD in computer science from the University of Michigan. Previously, he spent six years as a researcher at Microsoft. He currently does not have a Wikipedia article, mainly because Wikipedia servers do not accept 17 GB text files entitled “Let Me Sing The Song Of James Mickens (This Song Was Uploaded From An I.P. Address Owned By James Mickens).”

Presentations

My love letter to computer science is very short and I also forgot to mail it Keynote

Details to come.

Russ Miles is CEO of ChaosIQ.io, where he and his team build commercial and open source products and provide services to companies applying chaos engineering to build confidence in the resilience of their production systems. Russ is an international consultant, trainer, speaker, and author. His most recent book, Antifragile Software: Building Adaptable Software with Microservices, explores how to apply chaos engineering to construct and manage complex, distributed systems in production with confidence.

Presentations

Fast track to chaos engineering 2-Day Training

Build confidence in your systems' behavior and identify weaknesses before they happen. Join Russ Miles on a deep dive into chaos engineering and learn how to apply it in your organization.

Effie Mouzeli is a site reliability engineer at the Wikimedia Foundation, where she’s one of the newer members of the SRE team. She studied physics and scientific computing but decided to follow neither. Instead she became a sysadmin, later a systems engineer, now an SRE. She worked in a number of startups and small organizations where her responsibilities were usually automation, infrastructure architecture, and working closely with developers. Away from work, she loves camping, concerts, and dressmaking.

Presentations

What happens when you type de.wikipedia.org? Session

Jose Nino is the lead for core server networking group at Lyft. Jose has been instrumental in creating systems to scale Lyft’s Envoy production environment for increasingly large deployments and engineering orgs. He’s worked as an open source Envoy maintainer and has nurtured Envoy’s growing community. Recently, Jose has moved on to scaling Lyft’s network load tolerance systems. Jose has spoken about Envoy and other related topics at several venues, most recently at KubeCon EU 2018 and at the main stage at KubeCon NA 2018.

Presentations

Deploying hybrid topologies with Kubernetes and Envoy: A look at service discovery Session

Ana Oprea specializes in site reliability engineering, security, and planning and strategy for Google’s technical infrastructure—a role that follows naturally from her previous experience as a software developer, technical consultant, and network admin. After working and studying in Germany, France, and Romania, she accounts for different cultural approaches when facing any challenge.

Presentations

Secure reliable systems Keynote

Ana Oprea examines SRE and security best practices for designing, operating, and scaling dependable infrastructure.

Paul Osman is an engineering manager at Under Armour Connected Fitness. He’s been building external and internal platforms for over 10 years. From public APIs targeted at third parties to internal platform teams, he’s helped build distributed systems that power large-scale consumer applications. He’s also managed teams of engineers to deliver service-based software systems rapidly and with confidence.

Presentations

Measuring chaos: Chaos engineering and team health Session

Once restricted to companies like Netflix, chaos engineering is becoming a common practice in organizations of all sizes. Paul Osman outlines techniques Under Armour uses to measure service health with chaos engineering. He details its operational maturity model and how the company uses it to blamelessly identify teams that need additional help and action items to improve resiliency and happiness.

Manuel Pais is an independent DevOps consultant, focused on team design, practices and flow. He helps organizations define and adopt DevOps and continuous delivery (both from technical and human perspectives) via strategic assessments, practical workshops and coaching. Manuel is also coauthor of the book Team Topologies: Organizing Business and Technology Teams for Fast Flow (IT Revolution Press, 2019). He’s also an InfoQ lead editor and answers by @manupaisable on Twitter and Medium.

Presentations

The elephant in the Kubernetes room: Team interactions Session

Regardless of all the technical benefits that Kubernetes brings, team interactions are still key for successfully delivering and running services. Manuel Pais explores how team design affects the success of Kubernetes adoption.

Jérôme Petazzoni is a DevOps advocate and international speaker. He was born and raised in France, where he worked on geographic information systems, voice over IP, video streaming, and encoding and started a cloud hosting company back when EC2 wasn’t an Amazon product yet. In California he built and scaled the dotCloud PaaS, which eventually gave birth to Docker. While at Docker, he represented the company at hundreds of conferences and events and trained thousands of engineers to use Docker, Swarm, and Kubernetes. He’s fluent in many languages (mostly programming ones), owns a dozen musical instruments, and can play the theme of Zelda on most of them.

Presentations

Deploying and scaling applications with Kubernetes 2-Day Training

Jerome Petazzoni dives into Kubernetes' concepts and architecture and explains how you can use it to deploy and scale your applications, featuring theory and practice with demos and exercises. You'll be given access to a Kubernetes cluster during training, and get your hands dirty completing the exercises.

Björn Rabenstein is an engineer at Grafana Labs and a Prometheus developer. Previously, he was a production engineer at SoundCloud, a site reliability engineer at Google, and a number cruncher for science.

Presentations

What remains of dashboards and metrics without the hype and anti-patterns Session

Open source tools for dashboarding and metrics have seen massive adoption in recent years. Riding the hype, the new, shiny tools are inevitably confronted with overblown expectations and problematic usage patterns, causing frustration and criticism. Björn Rabenstein outlines how to use dashboards and metrics effectively rather than condemning them altogether.

Lena Reinhard is the director of engineering at CircleCI, the leader in continuous integration and delivery for developer teams. After a career in finance, arts, and media, Lena found herself working in tech, and at age 26 cofounded her first software company and became a CEO. Over the years, she’s supported distributed engineering organizations around the world, helping them deliver great products while continuously learning through inclusive cultures and growth-oriented feedback. Lena is convinced that only diverse teams in inclusive organizations can build appropriate solutions for the challenges that humanity is facing. She’s also a proficient writer, photographer, and forever a learner.

Presentations

Building high-performing engineering teams, 1 pixel at a time Keynote

Psychological safety is one of the leading indicators of a high-performing team. Yet, Lena Reinhard explains, forging deep human relationships and building trust can be difficult when your team is distributed or largely interacts on screens.

Alois Reitbauer is a chief technology strategist at Dynatrace. Alois is fanatic about monitoring, DevOps, and application performance and has spent most of his professional career building monitoring tools and speeding up applications. He’s a regular conference speaker, blogger, book author, and sushi maniac.

Presentations

Don't let your delivery pipelines become your next legacy code Session

The "you build it, you run it" DevOps movement has made developers write a lot of ad hoc automation code. Alois Reitbauer explains why you need to make sure this code is modular, extensible, and maintainable.

Liz Rice is the technology evangelist at container security specialists Aqua Security and coauthor of the O’Reilly report Kubernetes Security. She has a wealth of software development, team, and product management experience from her years spent working on network protocols and distributed systems and in digital technology sectors such as video on demand (VOD), music, and voice over internet protocol (VoIP). When not building startups and writing code, Liz loves riding bikes in places with better weather than her native London or racing in virtual reality on Zwift.

Presentations

A beginner's guide to eBPF Session

Liz Rice empowers you to start writing extended Berkeley Packet Filter (eBPF) programs so you can unleash what has been described as superpowers for Linux.

Tess Rinearson works on Tendermint, a proof-of-stake consensus engine which powers a variety of blockchain projects, including the Cosmos Network. She’s worked in the blockchain and cryptocurrency space since early 2015, focusing largely on storage and protocol problems for permissioned blockchains (and writing a popular bitcoin explainer—using emoji—along the way). Before falling into the blockchain hole, she was a full stack engineer at Medium.

Presentations

Consensus is for everybody Session

Maybe you've tried to get a group of computers to come to a consensus. Computer scientists have studied this problem for decades, but it’s gained attention again recently as blockchains and cryptocurrencies rely on consensus algorithms to operate safely. Tess Rinearson introduces you to distributed consensus with an emphasis on the wide variety of algorithms used in blockchains.

Baruch Sadogursky (JBaruch) is a developer advocate at JFrog. His passion is speaking about technology. Well, speaking in general, but doing it about technology makes him look smart, and 17 years of high-tech experience sure helps. When he’s not on stage (or on a plane to get there), he learns about technology, people, and how they work or—more precisely—don’t work together. He’s a CNCF ambassador, Developer Champion, and a professional conference speaker on DevOps, Java, and Groovy topics. He’s also a regular at the industry’s most prestigious events, such as JavaOne (where he was awarded a Rock Star award), DockerCon, Devoxx, devopsdays, OSCON, and Qcon. You can follow him on Twitter at @jbaruch.

Presentations

DevOps patterns and anti-patterns for continuous software updates (sponsored by JFrog) Session

Baruch Sadogursky analyzes real-world software update fails and how multiple DevOps patterns that fit a variety of scenarios could have saved the developers. Manually making sure that everything works before sending an update and expecting the user to do acceptance tests before they update is most definitely not on the list of such patterns.

Lorenzo Saino is an engineering manager at Fastly, where he leads the team responsible for building the systems that control Fastly network infrastructure.

During his tenure at Fastly he built systems solving problems including fault tolerance, load balancing and network observability.

Prior to Fastly, he received a PhD from University College London. His thesis investigated design issues in networked caching systems and he was awarded the Fabrizio Lombardi prize in 2016 for his research.

He served on the program committee of O’Reilly Velocity EU in 2017 and 2018 and he is a regular reviewer for journals and conferences in the field of computer networks and systems.

Presentations

The power of good abstractions in systems design Keynote

This talk shows how good abstractions make it possible to identify and apply solutions to seemingly unrelated problems from different disciplines to build better systems with less effort.

Eleanor Saitta is a practice lead at Systems Structure Ltd, a security architecture and strategy consultancy with media, finance, healthcare, infrastructure, and software clients across the US and Europe. She’s worked in security for 16 years, covering everything from core security engineering and architecture work for Fortune 50 software firms to cross-domain security for news organizations and NGOs targeted by nation states. She’s a cofounder and developer for Trike, an open source threat modeling methodology and tool that partially automates the art of security analysis, and has contributed to the Briar and Mailpile secure messaging projects. She’s also a regular speaker at industry conferences; past venues include the O’Reilly Velocity Conference, Kiwicon, ToorCon, CCC, Hack In The Box, and HOPE, among others. You can find her on twitter as @dymaxion and at https://dymaxion.org. SSL lives at https://structures.systems.

Presentations

Taking the ops out of DevOps Session

Infrastructure as code and immutable images can make your systems radically more secure and improve team resilience, but Eleanor Saitta outlines why taking advantage of them requires rethinking the role of ops both technically and philosophically.

Priyanka Sharma is the director of Cloud Native Alliances at GitLab, the only cloud-agnostic single application for the entire DevOps lifecycle. Priyanka has worked on several Cloud Native Computing Foundation (CNCF) projects, with her deepest expertise being the OpenTracing standard. She’s worked on Jaeger tracing and Vitess and given talks on Kubernetes, Prometheus, Envoy, and the Secure Production Identity Framework for Everyone (SPIFFE)/SPIRE. A former entrepreneur with a passion for building developer products and growing them through open source communities, Priyanka advises startups at HeavyBit Industries, an accelerator for developer products. She holds a BA in political science from Stanford University.

Presentations

CD with Kubernetes: The prequel (sponsored by GitLab) Session

If you're interested in learning a framework of reference to enable continuous deployment to Kubernetes for business-critical production applications, join in. Priyanka Sharma bridges the gap between how to make large-scale migrations of production applications and the nitty gritty details that engineering managers and leaders need to consider.

Emily is a Senior Software Engineer at Fastly, where she works on the platform for delivering core CDN configurations and develops in Perl using speech recognition. In a past life, she worked in HR at mobile gaming companies. Emily holds a BA in Architecture from UC Berkeley, and in her spare time likes to hang out in parks with her dog, named Chicken.

Presentations

Voice-driven development: Who needs a keyboard anyway? Session

Keyboards are the way most developers program computers, and being unable to use a keyboard for extended periods of time can seem like a career-ending limitation. Emily Shea leads a demo-driven showcase of the speech recognition technology she's been using for the last year, starting from a solution to repetitive strain injury (RSI) and developing into an efficient way to develop software.

Kelly Shortridge is the vice president of product strategy at Capsule8. Kelly is known for research into the applications of behavioral economics to information security and has spoken at conferences internationally, including Black Hat USA, AusCERT, Hacktivity, TROOPERS, and ZeroNight. Previously, Kelly served in product roles at SecurityScorecard and BAE Systems after cofounding IperLane, a security startup that was acquired. Kelly began their career as an investment banking analyst at Teneo, covering the data security and analytics sectors.

Presentations

Controlled chaos: The inevitable marriage of DevOps and security Keynote

Software is eating the world, and security will be eaten as well if it doesn't evolve. Kelly Shortridge exposes why chaos and resilience engineering represents the future of security programs—and why it catalyzes the dawn of defensive innovation. You'll examine how adopting distributed, immutable, and ephemeral infrastructure (the "DIE" triad) can create powerful security benefits.

Andreas Sieferlinger is a cloud platform engineer at Scout24, where he works on building CI/CD systems. He focuses on simplifying workflows by making it easy to follow best practices. Previously, he worked on designing scalable AWS architectures for video streaming and developer tooling to simplify cloud migrations. Aside from IT, he spends most of his time in Boy Scout-ing, outdoor activities, and fiddling around with synthesizers. You can find him on Twitter as @webratz.

Presentations

The deputy shot the sheriff: Privilege escalation in build pipelines Session

Build pipelines are commonly used in the industry to build and roll out changes to cloud accounts. Typically, wide permissions are granted to those systems, making them an interesting attack vector. Take a look with Andreas Sieferlinger at typical vulnerabilities and examine the case of the confused deputy—a trusted third-party party—and how these vulnerabilities can be mitigated in real-life.

Rob Skillington is the chief technology officer at Chronosphere. Previously, he was on the monitoring team at Uber where he created M3DB, an open source time series database built for M3 to scale to the needs of Uber’s ever-growing metrics footprint of more than ten billion metrics. He’s also a member of OpenMetrics, an effort to create an open standard for transmitting metrics at scale.

Presentations

M3 and Prometheus: Monitoring at planet scale for everyone Session

Rob Skillington and Łukasz Szczęsny explore scaling monitoring, alerting, and configurational complexity for a single view of your applications, databases, infrastructure, and operations across all regions using M3 and Prometheus.

Ines Sombra is director of engineering at Fastly, where she spends her time helping the web go faster. Ines holds an MS in computology with an emphasis on cheesy ’80s rock ballads. She has a fondness for steak, fernet, and a pug named Gordo. In a previous life, she was a data engineer.

Presentations

Thursday Opening Welcome Keynote

Program chairs Chris Guzikowski, Ines Sombra Turnbull, and James Turnbull open the second day of keynotes.

Wednesday Opening Welcome Keynote

Program chairs Christopher Guzikowski, James Turnbull, and Ines Sombra open the first day of keynotes.

James Stewart is an independent consultant helping senior leaders embed modern technology and security in their strategies and lead transformational change. Previously, James was a cofounder of the UK Government Digital Service and served as deputy CTO of the UK government, where he was instrumental in the UK government’s use the public cloud, embrace of open source, and changing approach to security, all with the goal of increasing government’s ability to focus on user needs. James speaks regularly around the world on organizational transformation, technology strategy, and cybersecurity.

Presentations

Solving problems with principles Session

Leading organizations focus their efforts on meeting user needs and avoiding the trap of large-scale enterprise architecture plans. With competing technology demands, it's hard to establish a common vision, let alone dependencies between teams and systems. James Stewart and Dafydd Vaughan explain how developing a solid set of principles can help you solve problems across complicated organizations.

Jennifer “Jenn” Strater is a developer advocate for Gradle. She’s the owner of the Groovy Community slack and a contributor to various open source projects including Spring REST Docs, Codenarc, Groovy, and Gradle.

Presentations

The path to build happiness Session

Jenn Strater walks you through the best practices she's learned since transitioning from a software engineer at various product companies to working for a company that focuses on build automation.

Molly Struve is the lead site reliability engineer at DEV. During her time in the software industry, she’s had the opportunity to work on some challenging problems. These include scaling Elasticsearch, sharding MySQL databases, and creating an infrastructure that can grow as fast as a booming business. When not making systems run faster, she can be found fulfilling her need for speed by riding and jumping her show horses.

Presentations

Creating a scalable monitoring system that everyone will love Session

Molly Struve gives you the tools and strategies you need to build a monitoring system that will scale with your team and your infrastructure.

Łukasz Szczęsny has been an infrastructure engineer in many roles. As an early SRE at Uber on the observability team he developed parts of M3 and other monitoring infrastructure.

Presentations

M3 and Prometheus: Monitoring at planet scale for everyone Session

Yulia Trakhtenberg is a software engineering manager with more than seven years’ experience in software team building and management. With 15 years of experience in software development (both backend and frontend), including requirements, architecture, design, coding, testing, quality, and customer issues. Yulia’s expertise is with seeing the broader picture and the small details, combining deep technical knowledge with understanding of product, business and customer needs, all based on a solid understanding and proven experience in Agile management methodologies. Some of the technologies and methodologies and other -logies Yulia has experience with include: C++, Ruby, web, big data, Java, PHP, NoSQLs, Redis, MySQL, automation, management, recruitment, and team empowerment.

Presentations

A GDPR retrospective: Implementation by a large-scale data organization in reality Session

GDPR was likely one of the biggest challenges in data management that occurred in 2018. Yulia Trakhtenberg dives into a one-year retrospective about how it was executed in reality at a large-scale data organization.

James Turnbull is VPE at Glitch. A longtime member of the open source community, James is the author of a number of books about open source software. Previously, he was a CTO in residence at Microsoft, founder and chief technology officer at Empatico and Kickstarter, VPE of Venmo, and an adviser at Docker. James likes food, wine, books, photography, and cats. He is not overly keen on long walks on the beach or holding hands.

Presentations

Thursday Opening Welcome Keynote

Program chairs Chris Guzikowski, Ines Sombra Turnbull, and James Turnbull open the second day of keynotes.

Wednesday Opening Welcome Keynote

Program chairs Christopher Guzikowski, James Turnbull, and Ines Sombra open the first day of keynotes.

Robin van Zijll is a site reliability engineer and product owner on the SRE team at ING in Amsterdam. He has years of experience being on call for all functionalities used by retail banking customers.

Presentations

Revolutionizing a bank: Introducing service mesh and a secure container platform Session

Dafydd Vaughan (he/him) is an affiliate at Public Digital and a digital technology specialist with extensive experience in transforming complex legacy IT and coaching technology teams as they build new digital services. Previously, he was a senior technical architect at the Government Digital Service, where he led teams that transitioned Driver and Vehicle Licensing Agency’s (DVLA‘s) IT services back in-house, and delivered new digital services such as viewing your driving license, buying and selling vehicles, and taxing your vehicle; he spent two years leading the team that operated Gov.uk; and he set up a consultancy, Cedyrn, to advise large organizations on the replacement of legacy technology, adoption of public cloud services, and how to build strong digital teams.

Presentations

Solving problems with principles Session

Erik Veld is a developer advocate at HashiCorp and has over 15 years’ experience working with cloud and infrastructure. Previously, he was a consultant at Xebia and also founded Instruqt, a hands-on learning platform that uses real infrastructure to teach cloud and DevOps tooling.

Presentations

Bridging the gap between brownfield and greenfield applications with a service mesh Tutorial

Rosemary Wang is a developer advocate for HashiCorp. She interfaces with users and partners to find creative open source solutions for infrastructure. An infrastructure automation explorer and cloud enthusiast, Rosemary works to bridge the technical and cultural barriers between infrastructure engineers and application developers. She has a talent for solving intractable problems with code, whether this means helping an infrastructure engineer learn to code or an application developer troubleshoot infrastructure failures. When Rosemary isn’t drawing on whiteboards, she valiantly attempts to hack stacks of various infrastructure systems on her laptop while foraging for tasty victuals (especially pizza) around the world.

Presentations

Test-driven development (TDD) for infrastructure Session

In software development, test-driven development (TDD) is the process of writing tests and then developing functionality to pass the tests. Rosemary Wang explores methods of adapting and applying TDD to configuring and deploying infrastructure as code.

Heidi Waterhouse is a developer advocate with LaunchDarkly. She delights in working at the intersection of usability, risk reduction, and cutting-edge technology. One of her favorite hobbies is talking to developers about things they already knew but had never thought of that way before. She sews all her conference dresses so that she’s sure there is a pocket for the mic.

Presentations

Everything is a little bit broken; or, The illusion of control Keynote

We never change the amount of work or technical debt; we just shift it, and with it, we change how it emerges and appears. Heidi Waterhouse explains how you can handle this level of uncertainty.

Free as in puppies: Toward a better build versus buy understanding Session

Join Heidi Waterhouse to learn about how it's easy to miss several important parts of the calculation when you make a build versus buy calculation, including maintenance, updating, security, availability, and finding operators.

Shannon Weyrick is vice president of architecture at NS1. A 20-year veteran of internet infrastructure, Shannon is an accomplished technical architect, developer, and leader whose experience encompasses both development and operations of globally distributed platforms. Previously, Shannon worked at INAP and F5. A regular open source contributor, he has led and worked on a wide range of infrastructure projects from high-performance servers to novel programming languages and runtimes, and he enjoys writing and speaking at industry conferences.

Presentations

Chaos engineering and war gaming with a flamethrower Session

DDoS mitigation is an ever-evolving art. Architectures change, attackers get more creative, and keeping the team and platform ahead of the curve is a constant battle. Chaos engineering to the rescue. Shannon Weyrick examines using DDoS war games as a means of keeping the platform resilient, the team’s skill set polished, their tools in top shape, and their spirits and confidence high.

Jamie Wilkinson is a site reliability engineer at Google. He’s a contributing author to the SRE Book and has presented on contemporary topics at prominent conferences such as Linux.conf.au, Monitorama, PuppetConf, Velocity, and SRECon. His interests began in monitoring and the automation of small installations and have continued with human factors in automation and systems maintenance on large systems. Despite his more than 15 years in the industry, he’s still trying to automate himself out of a job.

Presentations

Configuration is riskier than code Session

Config changes cause more outages than code changes. Jamie Wilkinson introduces you to the UTM theorem, which shows that your application is a terrible software interpreter, and you should apply the same risk mitigation as for code deployment. This is harder with config languages; an application isn't a full interpreter. You can address this risk with progressive rollouts and config minimization.

Sabine Wojcieszak is the enthusiastic Agile and DevOps enabler at getNext IT, a Kiel, Germany, based consultancy. As a coach, she helps teams and organizations to improve their teamwork and their communication in order to grow. She enables people to work in an agile way or to adopt the DevOps mind-set and focuses on the human part of such evolutionary process. Her personal interest is in the topics of New Work and the new approach of dealing with failure. Sabine is a well-known speaker at international tech conferences, author of several articles, and one of the devopsdays Kiel organizers. She’s one of the founders of the Kiel region community of Softwerkskammer.org—the German Software Craftsmenship community—and cofounder of the Slack Friday and Scalable Thinking meetups. For the Industrial DevOps open source research project Titan, Sabine is the community manager. She also lectures on topics of Agile project management, DevOps, and open source at the University of Applied Science in Kiel.

Presentations

A fairy tale about habits; Or what we can learn from Cinderella and her peers in DevOps Session

Like Cinderella's "The good in the potty, the bad in the croppy," Sabine Wojcieszak explains why you should take a closer look at your habits and decide which of them will support your DevOps endeavors and which will harm them.

Yoshi Yamaguchi is a developer advocate on Google Cloud Platform at Google, where he works on system observability. Yoshi has been leading and supporting developer communities over 10 years.

Presentations

Observing and understanding distributed systems with OpenTelemetry Tutorial

Christine Yen is the cofounder of Honeycomb, a startup with a new approach to observability and debugging systems with data. Christine has built systems and products at companies large and small and likes to have her fingers in as many pies as possible. Previously, she built Parse’s analytics product (and leveraged Facebook’s data systems to expand it) and wrote software at a few now-defunct startups.

Presentations

Observability: Understanding production through your customers' eyes Keynote

By empowering you to ask new questions of your software, observability fuels curiosity about the world as it is, not how you expect it to be. In the end, after all, Christine Yen explains, "Nines don't matter if your users aren't happy."

Brian Young is the vice president of technology at Penguin Random House. He’s a 20-year veteran of the software industry, with startup and enterprise tech leadership experience across a range of businesses.

Presentations

Serverless, automated, personalized book recommendations at scale on AWS Session

The Momentum Engine is a serverless application built on AWS using Lambda, DynamoDB, Kinesis, SQS, EMR and data pipelines, and S3, allowing Penguin Random House (PRH) to use continually updated, data-driven analytical models to automatically compose and send personalized book recommendations emails to its subscriber base—at scale. Brian Young details PRH's journey with this system.