Skip to main content

Schedule: Operations sessions

This track covers building resilience into applications and infrastructure, operations escalation and outage handling patterns, BYOD, networking, security, metrics and monitoring, hybrid cloud implementations, and more.

Track Host

John Allspaw John Allspaw has worked in systems operations for over fourteen years in biotech, government and online media. He started out tuning parallel clusters running vehicle crash simulations for the U.S. government, and then moved on to the Internet in 1997. He built the backing infrastructures at Salon, InfoWorld, Friendster, and Flickr. He is now VP of Tech Operations at Etsy, and is the author of The Art of Capacity Planning published by O'Reilly.

 
Add to your personal schedule
Operations
Mission City Ballroom B4
Theo Schlossnagle (OmniTI/Circonus)
Average rating: ****.
(4.24, 25 ratings)
The more complicated our stacks become, the more difficult it becomes to diagnose slowness. In this session, we'll walk through hands-on techniques to measure and diagnose slowness in complex web applications. We'll dive into an architecture through all of its unsavory layers and emerge covered in muck...and faster. Read more.
Add to your personal schedule
Operations
Mission City Ballroom B4
Dan Slimmon (Exosite)
Average rating: ****.
(4.24, 17 ratings)
Borrowing the medical concepts of specificity and sensitivity, Dan will show how deceptive this tradeoff can be. He'll also make the case that putting in the extra effort to minimize both types of falsehoods is necessary and healthy. When the alarm goes off, you shouldn't have to spend precious minutes sniffing for smoke. Read more.
Add to your personal schedule
Operations
Mission City Ballroom B4
J. Paul Reed (Release Engineering Approaches)
Average rating: ****.
(4.10, 29 ratings)
As operational failure becomes more acceptable within our industry, the necessity for holding constructive, actionable postmortems increases. This tutorial will take attendees through postmortem techniques, pitfalls, and tools that they'll be able to take back to their own organizations to learn to run productive postmortems. Read more.
Add to your personal schedule
Operations
Mission City Ballroom B4
Ryan Frantz (Etsy), Laurie Denness (Etsy)
Average rating: ****.
(4.70, 27 ratings)
Being on call is stressful. You feel like you can't plan anything around your on-call rotation because you never know what to expect. It seems there's more chaff than wheat in those alerts and phantom alerts disturb your sleep. Etsy’s Operations team set out to quantify the on-call experience, identify its pain points, and use those data to make being on call more bearable. Read more.
Add to your personal schedule
Operations
Mission City Ballroom B4
Adam Jacob (Chef)
Average rating: ****.
(4.92, 51 ratings)
What does it mean to be great at Operations? This talk aims to answer that question, and provide actionable guidance on 12 of the core facets of our complicated profession. If you're new to Operations, this talk will set you off on the right foot - if you're a long time veteran, it's never a bad idea to revisit the fundamentals. Read more.
Add to your personal schedule
Operations
Mission City Ballroom B4
Toufic Boubez (Metafor Software)
Average rating: ****.
(4.00, 20 ratings)
You’ve set up thousands of metrics collecting millions of data points. Now what? Analyzing this mountain of data is not easy so you keep an eye on just a small fraction of it, or you run some simple analytics that don’t get you much. But there are some basic statistical methods that anyone can implement and they can provide surprisingly valuable insights. In this talk, Toufic will show you some. Read more.
Add to your personal schedule
Operations
Mission City Ballroom B4
Vibhav Garg (Twitter), Arun Kejariwal (Machine Zone)
Average rating: **...
(2.71, 17 ratings)
High performance and availability are highly dependent on the capacity allocated to a service. The ability to forecast when a service is expected to be under capacity is key to maintaining an efficient and highly performing infrastructure. We present a way to statistically forecast the number of days a service can run before its performance is expected to degrade based on pre-determined criteria. Read more.
Add to your personal schedule
Operations
Mission City Ballroom B4
Bryan Cantrill (Joyent)
Average rating: ****.
(4.89, 18 ratings)
Many expect the Internet of Things to finally take shape this year. But given the volume of data headed our way, the demands on infrastructure will greatly outpace its current capabilities. In this session, Joyent’s SVP of Engineering Bryan Cantrill, discusses how current technologies must adapt before performance capabilities can keep pace and deliver on the promise of the Internet of Things. Read more.
Add to your personal schedule
Operations
Mission City Ballroom B4
Mike Krieger (Instagram)
Average rating: ****.
(4.07, 15 ratings)
Integrating Instagram into Facebook's Infrastructure Read more.
Add to your personal schedule
Operations
Mission City Ballroom B4
Mark Burgess (Cfengine)
Average rating: ****.
(4.06, 16 ratings)
There are many tools for software building and what passes for process orchestration today, but two things are missing: a modern model-based approach, and the simplicity of the trusty "make" command, with handling of distributed dependencies. Mark Burgess shows how a promise-oriented approach, using CFEngine, can deliver both of these properties and more. Read more.
Add to your personal schedule
Operations
Mission City Ballroom B4
Sarah Novotny (NGINX)
Average rating: ****.
(4.67, 21 ratings)
You know NGINX is great as a webserver or a proxy, but there's so much more. Read more.
Add to your personal schedule
Operations
Mission City Ballroom B4
Chris Baker (Dyn)
Average rating: ***..
(3.33, 6 ratings)
A treatment of the human element involved in operating, monitoring, and understanding complex systems. This talk follows an example driven approach at how individual or group bias can impact time to identify, time to mitigate, and time to resolve issues in a production system. Read more.
Add to your personal schedule
Operations
Grand Ballroom CD
Tutorial Please note: to attend, your registration must include Tutorials on Tuesday.
James Wickett (Signal Sciences), Gareth Rushgrove (Puppet Labs)
Average rating: ***..
(3.75, 59 ratings)
We are releasing our applications at an ever increasing rate, leaving little time for formal security testing. Doing a penetration test once every 6 months and releasing 6 times a day is a recipe for disaster. But the automation that makes releasing at speed possible provides the perfect platform for building a comprehensive suite of automated security tests. This workshop will show you how. Read more.
Add to your personal schedule
Operations
Grand Ballroom CD
Tutorial Please note: to attend, your registration must include Tutorials on Tuesday.
James Turnbull (Empatico)
Average rating: ***..
(3.91, 70 ratings)
You've heard the hype about Docker and container virtualization now see it in action. This tutorial will introduce you to Docker and take you through installing it, running it, and integrating it into your development and operational workflow. Read more.
Add to your personal schedule
Operations
Grand Ballroom CD
Tutorial Please note: to attend, your registration must include Tutorials on Tuesday.
Vladimir Vuksan (Fastly)
Average rating: **...
(2.31, 26 ratings)
Ganglia is a widely used monitoring software used by companies of all sizes including Pinterest, Etsy, etc. In this tutorial, Vladimir will show some of the common uses of Ganglia and how Ganglia can help you detect issues, aid in corrective action as well as help you understand your infrastructure. Read more.
Add to your personal schedule
Operations
Grand Ballroom CD
Tutorial Please note: to attend, your registration must include Tutorials on Tuesday.
Dale Hamel (Shopify)
Average rating: **...
(2.67, 42 ratings)
Keeping track of a fleet of servers is time consuming and poor techniques, such a spreadsheets, lead to headaches as your infrastructure grows. A single Source of Truth is invaluable in capacity planning and for maintenance. We'll discuss how we used various open source tools to maintain our own source of truth, and how doing so enabled us to automate and streamline our infrastructure management. Read more.
Add to your personal schedule
SOLD OUT
Culture & Organizational Change | Operations
Grand Ballroom F
Tutorial Please note: to attend, your registration must include Tutorials on Tuesday.
Adele Shakal (Metacloud, Inc.)
Average rating: ***..
(3.90, 21 ratings)
Keeping IT participants engaged in a drill simulation can be very challenging, especially within the broader contexts of emergency operations, continuity planning/resiliency, disaster recovery, and IT architecture. Accept this challenge and become a practical gamemaster, worthy of designing and executing drills on likely emergency scenarios and realistic function failures for your organization. Read more.