September 19–20, 2016: Training
September 20–22, 2016: Tutorials & Conference
New York, NY

Speaker slides & videos

Presentation slides will be made available after the session has concluded and the speaker has given us the files. Check back if you don't see the file you're looking for—it might be available later! (However, please note some speakers choose not to share their presentations.)

Accelerated Mobile Pages (AMP) deliver a fast user experience for mobile devices by placing restrictions on the AMP markup and its ability to run code. Without code, how can we gather analytics? How can we validate that our AMP pages are fast? Nic Jansma and Nigel Heron explain how to gather RUM for AMP.
As Uber broke its monolith into microservices, monitoring became increasingly difficult. No single service could answer a critical question: is the business running? Akshay Shah and Michael Hamrah share the challenges Uber faced when monitoring business outcomes instead of engineering metrics and why building an anomaly detection system to solve those problems is easier than you might expect.
Katherine Daniels explains how to take the same principles that allow us to build effective DevOps environments within engineering departments and use them to create organizations and communities that encourage sustainable participation from a wider range of people, building bridges between people, teams, and communities so that everyone can thrive and produce the best work possible.
Mike McGarr and Andrew Spyker explain how Netflix has been evolving its existing cloud-native, Internet-scale service to add support for containers while integrating with a broad existing VM-based cloud infrastructure. Mike and Andrew explore how containers are improving developer productivity and deepening Netflix’s control over resource management given its already proven cloud architecture.
Karl Isenberg demonstrates how to build and roll out containerized applications in an automated fashion and outlines best practices around Docker image builds, taking footprint and security aspects into consideration.
Ken Gardner explores the latest innovations in performance analytics and explains how data science can be used in surprising ways to visualize and prioritize improvements that lead to peak performance.
Tim Kadlec and Patrick Meenan explain how the construction of websites and applications impacts performance as well as how to quickly debug and resolve performance issues. Tim and Patrick dive into how browsers work, how web pages are delivered, backend and frontend issues, optimizations, and techniques to get the best performance and provide hands-on experience for working on web performance.
Ashish Kuthiala presents research-based findings on the factors that play the most important roles in accelerating DevOps adoption and results and offers a case study on how Hewlett Packard Enterprise encouraged a mindset of collaboration and continuous improvement among its globally distributed teams to achieve faster innovation cycles across both the IT and business teams.
Microservices are becoming increasingly popular. Reshmi Krishna discusses how companies like Uber and Twitter analyze their microservices latency in real time and demonstrates how to integrate popular distributed tracing tools like Zipkin into an existing application with just a few lines of code.
At a certain point of complexity, systems are nearly impossible to understand. So how do you stay reliable when you can't keep the whole system in your head? Tom Croucher discusses the approaches that Uber takes to ensuring its systems stay reliable by exploring real outages and the lessons they teach us.
Dropbox has grown enormously since launching in 2008, surpassing 500 million sign-ups and 500 petabytes of user data. Its Databases team of SREs manage thousands of MySQL servers, which requires a deep understanding of MySQL, automation, Linux, Python, hardware, performance tuning, monitoring, and reliability. Tammy Butow offers an overview of how Dropbox database infrastructure has scaled.
Sunil Shah and Roger Ignazio introduce DC/OS, a recently open sourced distributed operating system based on Apache Mesos, and demonstrate how to integrate it with Jenkins, allowing you to set up a continuous delivery pipeline that takes an application composed of microservices from code repository to Docker Hub to a staging or production server with seamless automation.
In recent years at Ancestry, a focus on new features and new markets has come at the expense of perceived performance. Jed Wood describes the progress Ancestry made in 2016, significantly improving page load times, overhauling how it measures performance, and starting a cultural shift to make speedy pages a requirement shared by designers, product managers, developers, and marketers.
On June 4, 2015, GoDaddy deployed its fastest India home page. The next day, user purchases shot up several percentage points. Soon after, GoDaddy rolled out the solution to all of its sales sites. Jim Pierson explores the technical, operational, and organizational strategies that helped to achieve this, along with the specific tools and changes that worked best (and which didn’t).
HTTP/2 has been ratified for months, and browsers have already committed to supporting it. Everything we hear tells us that the new version of HTTP will provide significant performance benefits while requiring little to no change to our applications. But deploying HTTP/2 may not be as easy as it seems. Hooman Beheshti examines HTTP/2's core features and how they relate to real-world conditions.
Monitoring data from tools like OpenTSDB is typically used for dashboards and alerts, but applying techniques used in the data science, financial, and scientific computing fields to real-time monitoring data can drive deeper understanding about infrastructure. Rob Claire introduces the monitoring tools Pinterest uses and offers real-world examples of problem solving with data monitoring.
Infrastructure as code leverages Agile engineering tools and practices to provision and manage highly reliable infrastructure at speed. Building on the concepts from his book, Infrastructure as Code, Kief Morris explains how a team can implement a change-management pipeline to create a fast, reliable process for testing and hosting infrastructure for their microservices-based system.
Engineers who become managers are experts at the technical aspects of their job, but they are often unprepared for the human and political challenges they face. Marcus Blankenship teaches engineering leaders a framework for building strong relationships with their teams, creating a driven culture, and communicating upward and outward to benefit their teams.
Advertising has been foundational to the economics of the Web. But consumers have consistently expressed annoyance with advertisements that, ostensibly, get in way of the content they seek. Tony Ralph explains why the increasing prevalence of ad blocking and the related debate that has ensued could be the key to inciting progress.
The cloud holds the promise of bottomless capacity, available instantly. Recently, Capital One has been shifting a significant portion of its workload to the public cloud. Kevin McLaughlin explores what capacity management looks like in the cloud, which old concepts still apply, which should be retired, and what new metrics become important in the process.
Infosec failures are plentiful. Every day, we patch symptoms but fail to address the reasons for these behaviors. Jan Schaumann explains that it's time we come to terms with the fact we can't solve the (infosec) world's problems until we understand how people interact with the systems we build and how they make their choices.
One of the most exciting (and difficult) decisions for a software company to make is to rebuild its infrastructure. But once teams are spun up and start working, then what? Yvette Pasqua, Wayne Folkes, and Alessandro Bologna offer a practical perspective on how to successfully plan, design, build, and ship infrastructure, sharing lessons from the front lines of Meetup's recent redesign.
Matthias Rampke offers a brief history of SoundCloud's internal and external load balancing and explains how SoundCloud mixes and matches open source technologies to cover all its load balancing needs without the need for expensive hardware.
Alois Reitbauer describes Dynatrace's endeavor to build a conversational interface on top of artificial intelligence that makes performance data accessible by answering natural language questions and simplifying dialogs to expand the use of operational data beyond experts.
Single-page app frameworks present performance and monitoring challenges: they are processing intensive, and monitoring tools do not accurately track UX, masking users' pains from performance teams. Boris Livshutz and Manuel Alvarez explore the importance of selecting the right framework, the power of isomorphic JavaScript, and how to use the User Timing specification.
Seth Vargo walks you through modern infrastructure paradigms and techniques such as immutable infrastructure, service discovery, modern security, and application deployments using Terraform and Consul so you can spend more time doing the important things in life.
Building serverless applications foregoes most worries about the underlying infrastructure and its reliability. Yet, applications still need to be managed, debugged, and monitored. In a way, serverless is like building applications without an operating system. Eric Windisch explores the challenges of managing these serverless applications and pokes at the answers too.
To people responsible for the health of web properties, monitoring, measurement, and analysis of real-user experiences is key to delivering a consistent and delightful experience. Philip Tellis and Nic Jansma explore various tools and methods available to measure real-user performance in the browser.
Drawing on her experience at Uber, Susan Fowler explains how to smash a monolithic API into many hundreds of containerized microservices for fun and profit—and how to create a posture of resiliency and reliability around an infrastructure that grows and changes daily with incredible velocity.
Netflix Hystrix is a well-known OSS framework for Circuit Breaker in Java. It is lesser known for its application in operational monitoring and debugging production issues, which is one of the biggest usages of Hystrix inside Netflix. Billy Yuen explains how to apply Hystrix to existing applications with undocumented dependencies and implement DevOps monitoring with Hystrix Metrics.
More than ever, performance testing is central to ensuring a great user experience—the driving factor of customer satisfaction and business growth. Join Gaspare Marino for an interactive demonstration of load and performance testing basics.
Datadog had 400 VMs in AWS, was ingesting millions of metrics per second, and was having pain around service discovery and quick configuration changes. Darron Froese discusses how Datadog integrated Consul into its environment, mistakes made and lessons learned, and some tips for successful implementation in your own environment.
Florian Weingarten gives a brief history of how Shopify evolved from a single-database, single-data-center application to a multidatabase, multi-data-center setup and explains how Shopify performs fully scripted data-center failovers without considerable downtime.
Automation has reached a point where the CI/CD workflow from commit to deploy is controlled in some way by bots. Multitenant CI/CD platforms often have permissions to access a wide range of systems and services and hence can be an attractive target for attackers. Binu Ramakrishnan highlights current security risks and CI/CD threat modeling and offers novel solutions to mitigate those risks.
Security techniques have generally focused on protecting users by blocking requests going to the origin, but security is also a concern at the browser. Sonia Burney and Sabrina Burney explore how security can be enforced at the browser level through a combination of optimization techniques and security enhancements, which overall provide an optimal end-user experience.
Things tend to move quickly in a startup environment. When you're crushing code like candy and pulling requests like taffy, security can often fall by the wayside. Kelly Lum discusses her experiences working for both large corporations and startups, what works for one versus the other, and how to keep up with a break-neck pace while still producing hacker-resilient code.
In a serverless world, your ops team is your first, last, and only line of defense against some of the worst scum in the universe. Rachel Chalmers explains what that really means for security, networking, support, and most importantly, culture.
The DevOps community is becoming increasingly aware of incident response. Rare major events get lots of attention, but IT insiders encounter incidents much more often—sometimes on a daily basis. Richard Cook and David Woods offer an overview of a SNAFUcatchers Consortium project to study the interwoven activities of incident response and characterize the resources that make incident response work.
For more than 30 years, the DNS has been one of the fundamental protocols of the Internet. Businesses frequently bundle their DNS with other purchases or deploy it in a very vanilla fashion. Phil Stanhope explores the “hidden” power of managed DNS for building and running high-performance, distributed web applications.
Pete LePage explores the future of the mobile Web as it relates to performance improvements, offline support, and access to new device capabilities.
Rob Burke describes GameStop's migration of 12 ecommerce websites with millions of users from dedicated hardware to the cloud in three months. Because they were just "lifting and shifting," nothing should have changed—but it did. Now GameStop is facing some difficult challenges. Rob shares the lessons learned and examines the challenges that remain.
What is required to operate microservices at scale? Beyond containers, schedulers, and frameworks, what is actually required to turn hundreds of services, tens of thousands of machines, and millions of requests per second into a unified, performant application? Oliver Gould explores the evolution of Twitter's stack and the surprising glue that held it together: layer 5, the session layer.
We rely on our customer's browser to process large JavaScript applications, communicate in real time, and execute code from third parties, but the increasing diversity of customer devices and network connections means that ensuring a fast experience for all customers is nearly impossible. Simon Hearne explains why the CPU will soon be the biggest challenge for web performance, if it isn't already.
Engineers who become managers are experts at the technical aspects of their job, but they are often unprepared for the human and political challenges they face. Marcus Blankenship teaches engineering leaders a framework for building strong relationships with their teams, creating a driven culture, and communicating upward and outward to benefit their teams.
Seth Vargo walks you through modern infrastructure paradigms and techniques such as immutable infrastructure, service discovery, modern security, and application deployments using Terraform and Consul so you can spend more time doing the important things in life.
Andrew Shafer shares Pivotal's mission to bring the spirit and practice of Velocity to the enterprise. Pivotal helps the largest organizations in the world modernize their tools, processes, and mindsets. The problems aren’t all solved, and the story isn’t over. What role will you play?
We are witnessing an explosion in the sheer mass and velocity of data. But this data is most useful if the actual builders and operators—the people with all the context—can understand it and react to it quickly. Ozan Turgut discusses how to use visualization and analytics to turn data into leverage for decision making.
The mission of the United States Digital Service is straightforward but not easy: apply best practices in technology and design to improve the usability and reliability of the government’s most important digital services. That’s exactly what administrator Mikey Dickerson and his team have been doing for the past two years, since establishing this startup at the White House.
Google partnered with SOASTA to train a machine-learning model on a large sample of real-world performance, conversion, and bounce data. Patrick Meenan and Tammy Everts offer an overview of the resulting model, able to predict the impact of performance work and other site metrics on conversion and bounce rates.
As businesses push for more speed to market, more cost efficiencies, and more scale, IT ops teams face increasing pressure to monitor more and more things and analyze more and more data. Mehdi Daoudi examines the factors contributing to the demand for deeper analysis of increasingly large and diverse datasets and questions whether the industry’s current arms race is truly driving business results.
It has been 17 years since Ben Galbraith and Dion Almaer talked about Web versus apps at Velocity—and a lot has changed. Join them as they provide new perspectives based on the evolving landscape and consider how new mediums such as voice factor in, all with a touch of their new "Googley" perspective.