Build Systems that Drive Business
Sep 30–Oct 1, 2018: Training
Oct 1–3, 2018: Tutorials & Conference
New York, NY
 
Beekman/Sutton North
11:35am Networks, echolocation, and fish GIFs Victoria Nguyen (Fastly)
1:30pm SLO burn Jamie Wilkinson (Google)
3:50pm How to break up with your vendor Amy Nguyen (Stripe), Cory Watson (Stripe)
4:45pm From silos to a single pane of glass at USA TODAY NETWORK Bridget Lane (Gannett | USA Today), Kris Vincent (Gannett | USA Today)
Sutton South/Regent Parlor
1:30pm The container operator's manual Alice Goldfuss (GitHub)
2:25pm Linux, BPF, and containers Jessica Frazelle (Microsoft)
Nassau
1:30pm Managing multiple sources of truth in distributed applications Adam Wolfe Gordon (DigitalOcean)
Murray Hill
11:35am Building successful site reliability engineering in large enterprises Liz Fong-Jones (Honeycomb), Dave Rensin (Google)
3:50pm Small-scale engineering effie mouzeli (Wikimedia Foundation)
4:45pm The ops in serverless Jennifer Davis (Microsoft)
Gramercy
11:35am Lessons learned migrating HealthCare.gov to Terraform Christian Monaghan (HealthCare.gov | Nava PBC)
3:50pm Bulk image processing using Kubernetes Mike Newswanger (Elastic)
Sutton Center
3:50pm Easy CI/CD done right (sponsored by Verizon) Deepank Sharma (Verizon Wireless)
4:45pm
Grand Ballroom
9:00am Tuesday opening welcome Nikki McDonald (O’Reilly Media), James Turnbull (Glitch), Ines Sombra (Fastly)
9:05am Continuous Disintegration Anil Dash (Fog Creek Software)
9:35am The programmer's mind Jessica McKellar (Pilot)
10:20am Practical performance theory Kavya Joshi (Samsara)
10:40am Closing remarks Nikki McDonald (O’Reilly Media), James Turnbull (Glitch), Ines Sombra (Fastly)
10:45am Morning Break (Sponsored by Oracle Dyn) | Room: Sponsor Pavilion
3:05pm Afternoon Break (Sponsored by Oracle Dyn) | Room: Sponsor Pavilion
5:25pm Sponsor Pavilion Reception | Room: Sponsor Pavilion
12:15pm Lunch and Tuesday Topic Tables | Room: America's Hall 1 & 2
8:00am Morning Coffee | Room: 2rd Floor Promenade by Registration
8:15am Tuesday Speed Networking | Room: 2rd Floor Promenade by Registration
11:35am-12:15pm (40m) Monitoring, Observability, and Performance
Networks, echolocation, and fish GIFs
Victoria Nguyen (Fastly)
Victoria Nguyen explains how Fastly overhauled the monitoring and data collection of its globally distributed network without its caches noticing.
1:30pm-2:10pm (40m) Monitoring, Observability, and Performance Systems Monitoring & Orchestration
SLO burn
Jamie Wilkinson (Google)
Jamie Wilkinson offers a brief overview of SLOs, shares a practical guide to implementing sustainable SLO-based alerting for systems of any size, and outlines the tooling required to supplement the system in the absence of cause-based alerting.
2:25pm-3:05pm (40m) Monitoring, Observability, and Performance Resilient, Performant & Secure Distributed Systems
Using distributed trace data to solve performance and operational challenges
Naoman Abbas (Pinterest)
Naoman Abbas offers an overview of tools Pinterest built to process trace data and the use cases they’ve enabled and shares some real-world examples. Join in to learn how to apply these techniques to your own challenges.
3:50pm-4:30pm (40m) Monitoring, Observability, and Performance Resilient, Performant & Secure Distributed Systems
How to break up with your vendor
Amy Nguyen (Stripe), Cory Watson (Stripe)
You're unsatisfied with one of your monitoring providers. You've considered finding a new solution, but the thought of migrating your data off their platform sounds extremely painful. Amy Nguyen and Cory Watson explain how to make a deadline for an infrastructure-critical software migration while ensuring that everyone's requirements are met and no data has been lost.
4:45pm-5:25pm (40m) Monitoring, Observability, and Performance Systems Monitoring & Orchestration
From silos to a single pane of glass at USA TODAY NETWORK
Bridget Lane (Gannett | USA Today), Kris Vincent (Gannett | USA Today)
Three years ago, technical teams at USA TODAY NETWORK were completely siloed, making improvements and troubleshooting difficult and often blind to the rest of the technical organization. Bridget Lane and Kris Vincent explain how drastically the teams' tool belts, thought processes, and goals have changed as the company moved from silos to a single pane of glass.
11:35am-12:15pm (40m) Distributed Systems, Microservices and Containers Systems Architecture & Infrastructure
Frankenstein's microservices: How to avoid the monster
Michael Hamrah (Namely)
Many companies adopt microservices to break down monoliths, but they soon uncover a hidden cost: How do you manage all these new interconnected things popping up? Michael Hamrah explains how to avoid creating Frankenstein's monster by understanding elements of a microservice platform. . .so you can sleep at night.
1:30pm-2:10pm (40m) Microservices and Containers Systems Architecture & Infrastructure
The container operator's manual
Alice Goldfuss (GitHub)
Containers can be a great infrastructure solution, but no one should drive them without a manual. Alice Goldfuss discusses some of the advantages and disadvantages of running containers in production at scale.
2:25pm-3:05pm (40m)
Linux, BPF, and containers
Jessica Frazelle (Microsoft)
Jessica Frazelle explores some cool bits of Linux, including BPF and container technologies, and details new ways to trace various things in the kernel and how to even use these traces to hot patch kernels in the case of zero day vulnerabilities. Come for the jokes about Linux; stay for the live demos.
3:50pm-4:30pm (40m)
Microreleases: How to safely rollout complex changes at scale
Jeffrey Valeo (Grubhub)
Jeffrey Valeo explains how to safely rollout complex changes at scale.
4:45pm-5:25pm (40m) Microservices and Containers, Monitoring, Observability, and Performance Systems Monitoring & Orchestration
Debugging microservices apps via a sevice mesh, OpenTracing, and Squash
Idit Levine (solo.io)
Idit Levine demonstrates common debugging techniques and offers an overview of Squash, a new tool and methodology that brings the power of modern popular debuggers to developers of microservices apps that run on container orchestration platforms.
11:35am-12:15pm (40m) Distributed Data Resilient, Performant & Secure Distributed Systems
Smooth scaling: Slack’s journey toward a new database
Ameet Kotian (Slack)
Slack’s rapid growth over the last few years outpaced the original database’s scaling capacity, which negatively impacted the company's customers and engineers. Ameet Kotian explains how a small team of engineers embarked on a journey for the right database solution, which eventually led them to Vitess, an open source cluster database.
1:30pm-2:10pm (40m) Distributed Systems Distributed State
Managing multiple sources of truth in distributed applications
Adam Wolfe Gordon (DigitalOcean)
When building distributed applications, it's highly desirable to maintain a single source of truth, such as a database, for all application state. Unfortunately, for some applications, multiple sources of truth are unavoidable. Adam Wolfe Gordon shares strategies, learned from real-world experience, for managing multiple sources of truth without sacrificing consistency and usability.
2:25pm-3:05pm (40m) Distributed Data Resilient, Performant & Secure Distributed Systems
Trade-offs in resiliency: Managing the burden of data recoverability
Kristina Bennett (Google)
Kristina Bennett shares best practices for practical data recoverability and shines a light onto some of the pitfalls awaiting the unwary, based on lessons learned from five years of data integrity tooling and consulting across Google.
3:50pm-4:30pm (40m) Distributed Data, Distributed Systems Resilient, Performant & Secure Distributed Systems
Kafka Streams in practice: What works and what doesn’t (yet)
Bart De Vylder (CoScale)
Bart De Vylder shares his experience migrating an existing codebase and production environment to Kafka Streams, a relatively new and promising streaming library. Join in to see what aspects worked remarkably well and the challenges he ran into along the way.
4:45pm-5:25pm (40m) Distributed Data Resilient, Performant & Secure Distributed Systems
Sell cron, buy Airflow: Modern data pipelines in finance
James Meickle (Quantopian)
Quantopian integrates financial data from vendors around the globe. As the scope of its operations outgrew cron, the company turned to Apache Airflow, a distributed scheduler and task executor. James Meickle explains how in less than six months, Quantopian was able to rearchitect brittle crontabs into resilient, recoverable pipelines defined in code to which anyone could contribute.
11:35am-12:15pm (40m) DevOps and SRE Systems Monitoring & Orchestration
Building successful site reliability engineering in large enterprises
Liz Fong-Jones (Honeycomb), Dave Rensin (Google)
Implementing site reliability (SRE) engineering doesn't have to be intimidating, and it isn't only for cloud-native organizations. Liz Fong-Jones and Dave Rensin share eight key lessons Google's customer reliability engineering team learned helping large enterprises adopt SRE as an operations engineering model.
1:30pm-2:10pm (40m) DevOps and SRE Systems Architecture & Infrastructure
How NTSB air disaster analysis can help you in an emergency
Matt Rogish (ReactiveOps)
Matt Rogish explains how NTSB investigations of air disasters have dramatically improved flight safety and applies lessons learned in disaster recovery and analysis, teamwork, task saturation, and systems design to modern software application and infrastructure architecture at scale to achieve higher availability, reduced errors, and more scalable systems.
2:25pm-3:05pm (40m) DevOps and SRE Resilient, Performant & Secure Distributed Systems
Disaster resilience the Waffle House way, from flattops to feature flags and more
Heidi Waterhouse (LaunchDarkly)
Waffle House's hurricane disaster plan has everything you could want from an IT disaster plan, including contact trees, failover states, and runbooks on partial operation. Heidi Waterhouse shares lessons about state drawn from the world outside computers and explains how to quantify them using a finite state machine and implement them automatically while you are in a less-than-perfect condition.
3:50pm-4:30pm (40m)
Small-scale engineering
effie mouzeli (Wikimedia Foundation)
Effie Mouzeli explains why small-scale engineering is just as challenging as large-scale engineering and offers ideas on how to survive technical debt, poor communication, and other everyday challenges.
4:45pm-5:25pm (40m) DevOps and SRE
The ops in serverless
Jennifer Davis (Microsoft)
Rather than a future of NoOps, serverless has increased the need for specialized operations engineering. Jennifer Davis explores the role of operations in serverless, covering testing, monitoring, and debugging functions.
11:35am-12:15pm (40m) How We Built It
Lessons learned migrating HealthCare.gov to Terraform
Christian Monaghan (HealthCare.gov | Nava PBC)
Christian Monaghan explains how he and his team successfully migrated HealthCare.gov, America's largest government website, to the cloud infrastructure provisioning tool Terraform, shares lessons learned along the way, and details how you can effectively use Terraform for your next project.
1:30pm-2:10pm (40m) Systems Engineering and Architecture Systems Architecture & Infrastructure
"Not invented here" syndrome and dark debt: The PagerDuty story
Aish Raj Dahal (PagerDuty)
Finding the right balance between writing custom in-house software and using an off-the-shelf solution is difficult. Aish Raj Dahal sheds light on the age old build versus buy problem and "not invented here syndrome" by explaining how PagerDuty built a distributed task scheduler and later moved off it to use an off-the-shelf open source solution.
2:25pm-3:05pm (40m) How We Built It Systems Architecture & Infrastructure
60,000 tests in six minutes: Create a reliable pipeline, eliminate flaky tests, and deploy safely but quickly
Sam Guckenheimer (Microsoft)
Good test coverage is essential for catching issues before a pull request has been merged, but they have to be the right kind of tests and must be reliable. Drawing on his experience at Microsoft, Sam Guckenheimer details what type of tests to do in your DevOps pipeline, when you should do them, and why.
3:50pm-4:30pm (40m) Distributed Systems, How We Built It Systems Architecture & Infrastructure
Bulk image processing using Kubernetes
Mike Newswanger (Elastic)
Mike Newswanger explains how he used Kubernetes and Google Cloud to burst and extend the capacity of a physical infrastructure for optimizing almost 10 million images in less than two weeks.
4:45pm-5:25pm (40m) How We Built It Systems Architecture & Infrastructure
Archaic to orchestrated: Ticketmaster's hybrid DevOps transformation
Heather Osborn (Ticketmaster)
Heather Osborn explains how Ticketmaster moved from a siloed on-premises environment to a DevOps hybrid cloud. If a company whose technology and human infrastructure have grown up organically around a custom-written VAX operating system can make the move to public cloud-native applications and begin a rapid march to a hybrid cloud solution, so can you.
11:35am-12:15pm (40m) Sponsored
Bot or human? Applying machine learning to combating the bot epidemic (sponsored by Oracle Dyn)
Laurent Gil (Oracle Dyn)
Bots now make up over 50% of website traffic and have become the primary source of malicious application attacks, from DDoS to sophisticated intrusions. Laurent Gil lays out what you need to know about bot traffic, different types of bots, and real-world applications of ML and AI to identify and defeat malicious bots.
1:30pm-2:10pm (40m) Sponsored
Performance anomaly detection at scale (sponsored by Salesforce)
Tuli Nivas (Salesforce)
Automated anomaly detection in production using simple data science techniques enables you to more quickly identify an issue and reduce the time it takes to get customers out of an outage. Tuli Nivas shows how to apply simple statistics to change how performance data is viewed and how to easily and effectively identify issues in production.
2:25pm-3:05pm (40m) Sponsored
API security: What you absolutely need to know now (sponsored by Oracle Dyn)
Laurent Gil (Oracle Dyn)
API-based integration is fundamental to business strategy and continued success, but the explosion of APIs is creating incremental security risks that must be addressed. Laurent Gil explains why API security is quickly becoming a key cross-cutting concern for everyone from DevOps to the CISO.
3:50pm-4:30pm (40m)
Easy CI/CD done right (sponsored by Verizon)
Deepank Sharma (Verizon Wireless)
Deepank Sharma offers an end-to-end look at continuous integration and continuous delivery (CI/CD), using Pipeline as Code and templates, quality gates, Docker, and Kubernetes while following best DevOps practices.
4:45pm-5:25pm (40m)
Session
9:00am-9:05am (5m)
Tuesday opening welcome
Nikki McDonald (O’Reilly Media), James Turnbull (Glitch), Ines Sombra (Fastly)
Cochairs Nikki McDonald, James Turnbull, and Ines Sombra welcome you to the first day of keynotes.
9:05am-9:25am (20m)
Continuous Disintegration
Anil Dash (Fog Creek Software)
As our industry faces its biggest reckoning ever with the social, ethical and cultural impacts of technology, what can we learn if we reflect on the assumptions we build into our systems? How could our processes and tools be designed to undo the biggest bugs and biases of today’s tech?
9:25am-9:35am (10m) Sponsored
Securing the edge: Understanding and managing security events (sponsored by Oracle Dyn)
Laurent Gil (Oracle Dyn)
Laurent Gil shares the latest cybersecurity research findings based on real-world security operations along with innovative approaches to managing and mitigating security events at the cloud edge.
9:35am-9:50am (15m)
The programmer's mind
Jessica McKellar (Pilot)
The programmer's mind is inherently an activist's mind. Jessica McKellar draws parallels between the free and open source software movement and the work to end mass incarceration.
9:50am-9:55am (5m) Sponsored
Test, measure, iterate: Balancing “good enough” and “perfect” in the critical path (sponsored by NS1)
Kris Beevers (NS1)
In critical path services such as DNS, stability is imperative above all else. Kris Beevers examines the trade-offs between risk and velocity faced by any high-growth, critical path technology business.
9:55am-10:15am (20m)
ML on code: Machine learning will change programming
Francesc Campoy (Dgraph)
Machine learning has revolutionized many fields, from cancer detection to self-driving cars. And let's not forget about connected toilets that allow Alexa to flush at your command. Francesc Campoy Flores explores some of the techniques used and the most relevant research, focusing on use cases where machine learning can help developers be more efficient.
10:15am-10:20am (5m) Sponsored
How do DevOps and SRE relate? Hint: They're best friends. (sponsored by Google Cloud)
Dave Rensin (Google)
SRE has exploded in the industry over the last two years, with the publication of two best-selling books from Google. Not surprisingly, there have been questions about how SRE and DevOps relate. Do they compete? Do they reinforce each other? The short answer is that they make each other better. Join Dave Rensin to hear why.
10:20am-10:40am (20m)
Practical performance theory
Kavya Joshi (Samsara)
Performance theory offers a rigorous and practical approach to performance tuning and capacity planning. Kavya Joshi dives into elegant results like Little’s law and the Universal Scalability Law. You'll also discover how performance theory is used in real systems at companies like Facebook and learn how to leverage it to prepare your systems for flux and scale.
10:40am-10:45am (5m)
Closing remarks
Nikki McDonald (O’Reilly Media), James Turnbull (Glitch), Ines Sombra (Fastly)
Cochairs Nikki McDonald, James Turnbull, and Ines Sombra close the first day of keynotes.
10:45am-11:35am (50m)
Break: Morning Break (Sponsored by Oracle Dyn)
3:05pm-3:50pm (45m)
Break: Afternoon Break (Sponsored by Oracle Dyn)
5:25pm-7:00pm (1h 35m)
Sponsor Pavilion Reception
Join us for the Sponsor Pavilion Reception on Tuesday, October 2, following the afternoon sessions.
12:15pm-1:30pm (1h 15m)
Lunch and Tuesday Topic Tables
Join other attendees during lunch at Velocity to share ideas, talk about the issues of the day, and maybe solve a few. Not sure which topic to pick? Don’t worry—it's not a long-term commitment. Try two or three and settle on a different topic tomorrow.
7:00pm-9:00pm (2h)
DevOps After Dark (cosponsored by NS1 and O'Reilly)
Join us for the social highlight of Velocity New York at one of New York's finest restaurants, Tao Uptown, located at 42 E 58th St. Enjoy great music, food, and cocktails while networking and making new connections.
8:00am-9:00am (1h)
Break: Morning Coffee
8:15am-8:45am (30m)
Tuesday Speed Networking
Jumpstart your networking at Velocity at Speed Networking before the keynotes begin. Bring your business cards and prepare a minute of patter about yourself, your projects, and your interests.