Rowan Cota explains how BuzzFeed created a strong SRE team by growing the engineers it needed instead of waiting for them to fall out of the sky—and how you can too. Rowan turns narrative examples into a framework that anyone can use to harness the power of growing potential to diversify and strengthen their teams.
In deep learning, individual "neuronal" computational actions are simple but aggregated over several neurons and iterated through several computation cycles can solve complex problems. What if we swapped out neurons for people? What kinds of complex problems could we solve? Omoju Miller offers a new vision for the global brain, where we harness human action for a better future.
Beyond looking out for a little green padlock in the browser bar, what do you need to know about secure connections as a programmer? What do people mean by terms like authentication, verifying a certificate, or signing a message? Join Liz Rice as she demystifies HTTPS, TLS, X.509, and more.
A skilled incident commander improves time to resolution and decreases stress all around. Join Beth Long and Elisa Binette to learn how to build strong incident management skills at the individual level and shape organizational processes to drive down MTTR, making both customers and engineers happier.
Amy Boyle walks you through building, scaling, and monitoring a stream processing pipeline.
John Chapin explains how—in this brave new world of managed services and platforms—you can use serverless technologies and an infrastructure-as-code mindset to architect, build, and operate resilient systems that survive even massive vendor outages.
Can you develop and maintain applications using Kubernetes? That’s the question more employers are asking these days. Take the next step in your career by becoming a Certified Kubernetes Application Developer. You get a full day of test prep from O’Reilly’s top Kubernetes trainer and the opportunity to take the exam onsite, leaving an official Certified Kubernetes Application Developer.
Elisa Heymann and Bart Miller explain how they performed an in-depth assessment of software controlling maritime container shipping, exposing opportunities for an attacker to smuggle goods or divert shipments and even damage personnel and ships.
Crystal Hirschorn explores incident planning, postmortem-driven development, chaos engineering, and observability practices and details the exponential effect they can have on leveling up your engineering organization, one controlled chaos experiment at a time.
Hiring is time consuming and infrequent enough that it's hard to get much practice, but it has far-reaching consequences for your team's happiness and productivity. Marisa Brandt draws on a broad mix of best practices and personal experience to detail an approach to developing a concrete strategy that addresses your unique hiring needs.
Looking at a service in isolation in a multiservice architecture simply does not give you enough information. Distributed tracing tools shine a light on the relationship between components. José Carlos Chávez explains how distributed tracing works, what you can use it for, and how tools like Zipkin can help.
Euan Finlay shares practical tips and advice on setting up an incident response framework, what to do when "everything is on fire," and how to improve things afterward—along with some horror stories of his own.
Simon Lasselsberger shares diagrams that illustrate the evolution of the architecture of the eight-year-old Runtastic microservice backend.
Data friction was a major roadblock for Choice Hotels, preventing the company from achieving its strategic vision for growth and innovation. Propagating the right data to the right teams was a costly and inefficient process that took weeks of manual effort—especially during peak seasons. Nick Suwyn explains why and how Choice Hotels adopted a DevOps and DataOps approach to solve this issue.
Tracy Miranda discusses GitOps, Jenkins X, and what the future of CI/CD for Kubernetes should look like.
Jermila Paul Dhas explains how the cloud enablement team at the Financial Times proactively validates the security and compliance of the FT's entire cloud estate.
Join Paul Johnston to explore and better understand distributed architectures and automation. Is it about pragmatism and simplicity? What are the skills that we have dismissed as only relevant for certain scenarios that should be used within a serverless environment? How do you hire for this new approach? Is it about code challenges or more abstract problem solving?
Serverless introduces a number of challenges to existing tools for observability, so you need to adapt your practices to fit this new paradigm. Yan Cui explains how to build observability into a serverless application, demonstrating how to implement log aggregation, distributed tracing, and correlation IDs through both synchronous and asynchronous events.
If you are here, then you’ve likely made it past the breakwaters of your career. You face vast stretches of career that can be met with little more than what you already have. And therein lies the threat. Katrina Owen explains why you must design your own crucible.
The transition from perpetuating exclusionary behaviour to actively working for inclusion is easier than you think. We'll explore how to contribute in valuable ways to discussions on culture and inclusivity, as well as how not to do it.
Taylor Thomas explains why Kubernetes is not a developer tool and cuts through other common misconceptions. Along the way, you'll learn how Kubernetes provides powerful abstractions for running and operating applications.
The excessive and dirty energy use of data centers is one of the biggest ethical issues facing the tech industry today—and one that, ironically, Kubernetes was originally invented to help address. However, that's not how it's being used. Anne Currie dives into the ethics of data centers and explores Kubernetes's role.
Imagine having millions of years of experience in developing and operating complex distributed systems? What if we could reverse-engineer nature's strategies for high-performance, resilient, and secure systems? From organisms to ecosystems, Claire Janisch explores some of the best biomimicry opportunities inspired by nature's software and wetware.
Maxime Petazzoni explains why monitoring custom application metrics is essential for visibility into the internal workings of a system and shares a framework for properly instrumenting them, along with a number of relevant use cases.
Adrian McMichael explores property portal Rightmove's structured approach to logging and monitoring across more than 50 microservices, showing you how to get to the bottom of production issues and helping you drive improvement and a sense of ownership in your projects.
Your company may not have a DBA explicitly in the org chart, but that just means everyone on the team takes over the role at some point. Silvia Botros explains why that’s ok (all the DBAs she knows were accidental too) and shares tips and tricks for keeping your MySQL databases running smoothly and protecting your company’s most important asset.
In the vastness of the Cloud Native ecosystem, we find ourselves overwhelmed with complexity and a promise that efficiency will one day outweigh this complexity. In this presentation, we take a look at the new era of the Cloud Native space and the kernel that has made this all possible: Kubernetes.
Bridget Lane and Marcelo Mandolesi share the USA TODAY NETWORK's two-year journey learning, migrating to, and running Kubernetes, exploring the hidden costs, expectations broken, the company's process of evangelizing, and why they would do it all over again.
Data scientist Jane Adams examines the ways in which these strategies actually fail to achieve the intended result, and more importantly how they perpetuate discriminatory hiring practices.
Félix López Luis offers an introduction to gossip protocols, using a simulator to demonstrate how they behave when there are challenges like network partitions and faulty nodes.
We all know how to build web apps around a central server. Increasingly, we want to develop apps in which the user can still get work done while offline and that sync their data the next time an internet connection is available. Martin Kleppmann shares recent computer science research that is helping develop the abstractions and APIs for the next generation of applications.
The Kubernetes API is extensible, allowing you to create your own resources that behave like native ones. Philippe Martin explores the tools, concepts, and a real example of a custom resource that simplifies the deployment of a complete content delivery network (CDN).