San Jose • New York • London

Build Systems that Drive Business

Sep 30–Oct 1, 2018: Training
Oct 1–3, 2018: Tutorials & Conference

New York, NY

Schedule List View Grid View

Topics

Beekman/Sutton North

11:35am Networks, echolocation, and fish GIFs Victoria Nguyen (Fastly)

1:30pm SLO burn Jamie Wilkinson (Google)

2:25pm Using distributed trace data to solve performance and operational challenges Naoman Abbas (Pinterest)

3:50pm How to break up with your vendor Amy Nguyen (Stripe), Cory Watson (Stripe)

4:45pm From silos to a single pane of glass at USA TODAY NETWORK Bridget Lane (Gannett | USA Today), Kris Vincent (Gannett | USA Today)

Sutton South/Regent Parlor

11:35am Frankenstein's microservices: How to avoid the monster Michael Hamrah (Namely)

1:30pm The container operator's manual Alice Goldfuss (GitHub)

2:25pm Linux, BPF, and containers Jessica Frazelle (Microsoft)

3:50pm Microreleases: How to safely rollout complex changes at scale Jeffrey Valeo (Grubhub)

4:45pm Debugging microservices apps via a sevice mesh, OpenTracing, and Squash Idit Levine (solo.io)

Nassau

11:35am Smooth scaling: Slack’s journey toward a new database Ameet Kotian (Slack)

1:30pm Managing multiple sources of truth in distributed applications Adam Wolfe Gordon (DigitalOcean)

2:25pm Trade-offs in resiliency: Managing the burden of data recoverability Kristina Bennett (Google)

3:50pm Kafka Streams in practice: What works and what doesn’t (yet) Bart De Vylder (CoScale)

4:45pm Sell cron, buy Airflow: Modern data pipelines in finance James Meickle (Quantopian)

Murray Hill

11:35am Building successful site reliability engineering in large enterprises Liz Fong-Jones (Honeycomb), Dave Rensin (Google)

1:30pm How NTSB air disaster analysis can help you in an emergency Matt Rogish (ReactiveOps)

2:25pm Disaster resilience the Waffle House way, from flattops to feature flags and more Heidi Waterhouse (LaunchDarkly)

3:50pm Small-scale engineering effie mouzeli (Wikimedia Foundation)

4:45pm The ops in serverless Jennifer Davis (Microsoft)

Gramercy

11:35am Lessons learned migrating HealthCare.gov to Terraform Christian Monaghan (HealthCare.gov | Nava PBC)

1:30pm "Not invented here" syndrome and dark debt: The PagerDuty story Aish Raj Dahal (PagerDuty)

2:25pm 60,000 tests in six minutes: Create a reliable pipeline, eliminate flaky tests, and deploy safely but quickly Sam Guckenheimer (Microsoft)

3:50pm Bulk image processing using Kubernetes Mike Newswanger (Elastic)

4:45pm Archaic to orchestrated: Ticketmaster's hybrid DevOps transformation Heather Osborn (Ticketmaster)

Sutton Center

11:35am Bot or human? Applying machine learning to combating the bot epidemic (sponsored by Oracle Dyn) Laurent Gil (Oracle Dyn)

1:30pm Performance anomaly detection at scale (sponsored by Salesforce) Tuli Nivas (Salesforce)

2:25pm API security: What you absolutely need to know now (sponsored by Oracle Dyn) Laurent Gil (Oracle Dyn)

3:50pm Easy CI/CD done right (sponsored by Verizon) Deepank Sharma (Verizon Wireless)

4:45pm

Grand Ballroom
9:00am Tuesday opening welcome Nikki McDonald (O’Reilly Media), James Turnbull (Glitch), Ines Sombra (Fastly)

9:05am Continuous Disintegration Anil Dash (Fog Creek Software)

9:25am Securing the edge: Understanding and managing security events (sponsored by Oracle Dyn) Laurent Gil (Oracle Dyn)

9:35am The programmer's mind Jessica McKellar (Pilot)

9:50am Test, measure, iterate: Balancing “good enough” and “perfect” in the critical path (sponsored by NS1) Kris Beevers (NS1)

9:55am ML on code: Machine learning will change programming Francesc Campoy (Dgraph)

10:15am How do DevOps and SRE relate? Hint: They're best friends. (sponsored by Google Cloud) Dave Rensin (Google)

10:20am Practical performance theory Kavya Joshi (Samsara)

10:40am Closing remarks Nikki McDonald (O’Reilly Media), James Turnbull (Glitch), Ines Sombra (Fastly)

10:45am Morning Break (Sponsored by Oracle Dyn) | Room: Sponsor Pavilion

3:05pm Afternoon Break (Sponsored by Oracle Dyn) | Room: Sponsor Pavilion

5:25pm Sponsor Pavilion Reception | Room: Sponsor Pavilion

12:15pm Lunch and Tuesday Topic Tables | Room: America's Hall 1 & 2

7:00pm DevOps After Dark (cosponsored by NS1 and O'Reilly) | Room: Tao Uptown

8:00am Morning Coffee | Room: 2rd Floor Promenade by Registration

8:15am Tuesday Speed Networking | Room: 2rd Floor Promenade by Registration

11:35am-12:15pm (40m) Monitoring, Observability, and Performance

Networks, echolocation, and fish GIFs

Victoria Nguyen (Fastly)

Victoria Nguyen explains how Fastly overhauled the monitoring and data collection of its globally distributed network without its caches noticing.

1:30pm-2:10pm (40m) Monitoring, Observability, and Performance Systems Monitoring & Orchestration

SLO burn

Jamie Wilkinson (Google)

Jamie Wilkinson offers a brief overview of SLOs, shares a practical guide to implementing sustainable SLO-based alerting for systems of any size, and outlines the tooling required to supplement the system in the absence of cause-based alerting.

2:25pm-3:05pm (40m) Monitoring, Observability, and Performance Resilient, Performant & Secure Distributed Systems

Using distributed trace data to solve performance and operational challenges

Naoman Abbas (Pinterest)

Naoman Abbas offers an overview of tools Pinterest built to process trace data and the use cases they’ve enabled and shares some real-world examples. Join in to learn how to apply these techniques to your own challenges.

3:50pm-4:30pm (40m) Monitoring, Observability, and Performance Resilient, Performant & Secure Distributed Systems

How to break up with your vendor

Amy Nguyen (Stripe), Cory Watson (Stripe)

You're unsatisfied with one of your monitoring providers. You've considered finding a new solution, but the thought of migrating your data off their platform sounds extremely painful. Amy Nguyen and Cory Watson explain how to make a deadline for an infrastructure-critical software migration while ensuring that everyone's requirements are met and no data has been lost.

4:45pm-5:25pm (40m) Monitoring, Observability, and Performance Systems Monitoring & Orchestration

From silos to a single pane of glass at USA TODAY NETWORK

Bridget Lane (Gannett | USA Today), Kris Vincent (Gannett | USA Today)

Three years ago, technical teams at USA TODAY NETWORK were completely siloed, making improvements and troubleshooting difficult and often blind to the rest of the technical organization. Bridget Lane and Kris Vincent explain how drastically the teams' tool belts, thought processes, and goals have changed as the company moved from silos to a single pane of glass.

11:35am-12:15pm (40m) Distributed Systems, Microservices and Containers Systems Architecture & Infrastructure

Frankenstein's microservices: How to avoid the monster

Michael Hamrah (Namely)

Many companies adopt microservices to break down monoliths, but they soon uncover a hidden cost: How do you manage all these new interconnected things popping up? Michael Hamrah explains how to avoid creating Frankenstein's monster by understanding elements of a microservice platform. . .so you can sleep at night.

1:30pm-2:10pm (40m) Microservices and Containers Systems Architecture & Infrastructure

The container operator's manual

Alice Goldfuss (GitHub)

Containers can be a great infrastructure solution, but no one should drive them without a manual. Alice Goldfuss discusses some of the advantages and disadvantages of running containers in production at scale.

2:25pm-3:05pm (40m)

Linux, BPF, and containers

Jessica Frazelle (Microsoft)

Jessica Frazelle explores some cool bits of Linux, including BPF and container technologies, and details new ways to trace various things in the kernel and how to even use these traces to hot patch kernels in the case of zero day vulnerabilities. Come for the jokes about Linux; stay for the live demos.

3:50pm-4:30pm (40m)

Microreleases: How to safely rollout complex changes at scale

Jeffrey Valeo (Grubhub)

Jeffrey Valeo explains how to safely rollout complex changes at scale.

4:45pm-5:25pm (40m) Microservices and Containers, Monitoring, Observability, and Performance Systems Monitoring & Orchestration

Debugging microservices apps via a sevice mesh, OpenTracing, and Squash

Idit Levine (solo.io)

Idit Levine demonstrates common debugging techniques and offers an overview of Squash, a new tool and methodology that brings the power of modern popular debuggers to developers of microservices apps that run on container orchestration platforms.

11:35am-12:15pm (40m) Distributed Data Resilient, Performant & Secure Distributed Systems

Smooth scaling: Slack’s journey toward a new database

Ameet Kotian (Slack)

Slack’s rapid growth over the last few years outpaced the original database’s scaling capacity, which negatively impacted the company's customers and engineers. Ameet Kotian explains how a small team of engineers embarked on a journey for the right database solution, which eventually led them to Vitess, an open source cluster database.

1:30pm-2:10pm (40m) Distributed Systems Distributed State

Managing multiple sources of truth in distributed applications

Adam Wolfe Gordon (DigitalOcean)

When building distributed applications, it's highly desirable to maintain a single source of truth, such as a database, for all application state. Unfortunately, for some applications, multiple sources of truth are unavoidable. Adam Wolfe Gordon shares strategies, learned from real-world experience, for managing multiple sources of truth without sacrificing consistency and usability.

2:25pm-3:05pm (40m) Distributed Data Resilient, Performant & Secure Distributed Systems

Trade-offs in resiliency: Managing the burden of data recoverability

Kristina Bennett (Google)

Kristina Bennett shares best practices for practical data recoverability and shines a light onto some of the pitfalls awaiting the unwary, based on lessons learned from five years of data integrity tooling and consulting across Google.

3:50pm-4:30pm (40m) Distributed Data, Distributed Systems Resilient, Performant & Secure Distributed Systems

Kafka Streams in practice: What works and what doesn’t (yet)

Bart De Vylder (CoScale)

Bart De Vylder shares his experience migrating an existing codebase and production environment to Kafka Streams, a relatively new and promising streaming library. Join in to see what aspects worked remarkably well and the challenges he ran into along the way.

4:45pm-5:25pm (40m) Distributed Data Resilient, Performant & Secure Distributed Systems

Sell cron, buy Airflow: Modern data pipelines in finance

James Meickle (Quantopian)

Quantopian integrates financial data from vendors around the globe. As the scope of its operations outgrew cron, the company turned to Apache Airflow, a distributed scheduler and task executor. James Meickle explains how in less than six months, Quantopian was able to rearchitect brittle crontabs into resilient, recoverable pipelines defined in code to which anyone could contribute.

11:35am-12:15pm (40m) DevOps and SRE Systems Monitoring & Orchestration

Building successful site reliability engineering in large enterprises

Liz Fong-Jones (Honeycomb), Dave Rensin (Google)

Implementing site reliability (SRE) engineering doesn't have to be intimidating, and it isn't only for cloud-native organizations. Liz Fong-Jones and Dave Rensin share eight key lessons Google's customer reliability engineering team learned helping large enterprises adopt SRE as an operations engineering model.

1:30pm-2:10pm (40m) DevOps and SRE Systems Architecture & Infrastructure

How NTSB air disaster analysis can help you in an emergency

Matt Rogish (ReactiveOps)

Matt Rogish explains how NTSB investigations of air disasters have dramatically improved flight safety and applies lessons learned in disaster recovery and analysis, teamwork, task saturation, and systems design to modern software application and infrastructure architecture at scale to achieve higher availability, reduced errors, and more scalable systems.

2:25pm-3:05pm (40m) DevOps and SRE Resilient, Performant & Secure Distributed Systems

Disaster resilience the Waffle House way, from flattops to feature flags and more

Heidi Waterhouse (LaunchDarkly)

Waffle House's hurricane disaster plan has everything you could want from an IT disaster plan, including contact trees, failover states, and runbooks on partial operation. Heidi Waterhouse shares lessons about state drawn from the world outside computers and explains how to quantify them using a finite state machine and implement them automatically while you are in a less-than-perfect condition.

3:50pm-4:30pm (40m)

Small-scale engineering

effie mouzeli (Wikimedia Foundation)

Effie Mouzeli explains why small-scale engineering is just as challenging as large-scale engineering and offers ideas on how to survive technical debt, poor communication, and other everyday challenges.

4:45pm-5:25pm (40m) DevOps and SRE

The ops in serverless

Jennifer Davis (Microsoft)

Rather than a future of NoOps, serverless has increased the need for specialized operations engineering. Jennifer Davis explores the role of operations in serverless, covering testing, monitoring, and debugging functions.

11:35am-12:15pm (40m) How We Built It

Lessons learned migrating HealthCare.gov to Terraform

Christian Monaghan (HealthCare.gov | Nava PBC)

Christian Monaghan explains how he and his team successfully migrated HealthCare.gov, America's largest government website, to the cloud infrastructure provisioning tool Terraform, shares lessons learned along the way, and details how you can effectively use Terraform for your next project.

1:30pm-2:10pm (40m) Systems Engineering and Architecture Systems Architecture & Infrastructure

"Not invented here" syndrome and dark debt: The PagerDuty story

Aish Raj Dahal (PagerDuty)

Finding the right balance between writing custom in-house software and using an off-the-shelf solution is difficult. Aish Raj Dahal sheds light on the age old build versus buy problem and "not invented here syndrome" by explaining how PagerDuty built a distributed task scheduler and later moved off it to use an off-the-shelf open source solution.

2:25pm-3:05pm (40m) How We Built It Systems Architecture & Infrastructure

60,000 tests in six minutes: Create a reliable pipeline, eliminate flaky tests, and deploy safely but quickly

Sam Guckenheimer (Microsoft)

Good test coverage is essential for catching issues before a pull request has been merged, but they have to be the right kind of tests and must be reliable. Drawing on his experience at Microsoft, Sam Guckenheimer details what type of tests to do in your DevOps pipeline, when you should do them, and why.

3:50pm-4:30pm (40m) Distributed Systems, How We Built It Systems Architecture & Infrastructure

Bulk image processing using Kubernetes

Mike Newswanger (Elastic)

Mike Newswanger explains how he used Kubernetes and Google Cloud to burst and extend the capacity of a physical infrastructure for optimizing almost 10 million images in less than two weeks.

4:45pm-5:25pm (40m) How We Built It Systems Architecture & Infrastructure

Archaic to orchestrated: Ticketmaster's hybrid DevOps transformation

Heather Osborn (Ticketmaster)

Heather Osborn explains how Ticketmaster moved from a siloed on-premises environment to a DevOps hybrid cloud. If a company whose technology and human infrastructure have grown up organically around a custom-written VAX operating system can make the move to public cloud-native applications and begin a rapid march to a hybrid cloud solution, so can you.

11:35am-12:15pm (40m) Sponsored

Bot or human? Applying machine learning to combating the bot epidemic (sponsored by Oracle Dyn)

Laurent Gil (Oracle Dyn)

Bots now make up over 50% of website traffic and have become the primary source of malicious application attacks, from DDoS to sophisticated intrusions. Laurent Gil lays out what you need to know about bot traffic, different types of bots, and real-world applications of ML and AI to identify and defeat malicious bots.

1:30pm-2:10pm (40m) Sponsored

Performance anomaly detection at scale (sponsored by Salesforce)

Tuli Nivas (Salesforce)

Automated anomaly detection in production using simple data science techniques enables you to more quickly identify an issue and reduce the time it takes to get customers out of an outage. Tuli Nivas shows how to apply simple statistics to change how performance data is viewed and how to easily and effectively identify issues in production.

2:25pm-3:05pm (40m) Sponsored

API security: What you absolutely need to know now (sponsored by Oracle Dyn)

Laurent Gil (Oracle Dyn)

API-based integration is fundamental to business strategy and continued success, but the explosion of APIs is creating incremental security risks that must be addressed. Laurent Gil explains why API security is quickly becoming a key cross-cutting concern for everyone from DevOps to the CISO.

3:50pm-4:30pm (40m)

Easy CI/CD done right (sponsored by Verizon)

Deepank Sharma (Verizon Wireless)

Deepank Sharma offers an end-to-end look at continuous integration and continuous delivery (CI/CD), using Pipeline as Code and templates, quality gates, Docker, and Kubernetes while following best DevOps practices.

4:45pm-5:25pm (40m)

Session

9:00am-9:05am (5m)

Tuesday opening welcome

Nikki McDonald (O’Reilly Media), James Turnbull (Glitch), Ines Sombra (Fastly)

Cochairs Nikki McDonald, James Turnbull, and Ines Sombra welcome you to the first day of keynotes.

9:05am-9:25am (20m)

Continuous Disintegration

Anil Dash (Fog Creek Software)

As our industry faces its biggest reckoning ever with the social, ethical and cultural impacts of technology, what can we learn if we reflect on the assumptions we build into our systems? How could our processes and tools be designed to undo the biggest bugs and biases of today’s tech?

9:25am-9:35am (10m) Sponsored

Securing the edge: Understanding and managing security events (sponsored by Oracle Dyn)

Laurent Gil (Oracle Dyn)

Laurent Gil shares the latest cybersecurity research findings based on real-world security operations along with innovative approaches to managing and mitigating security events at the cloud edge.

9:35am-9:50am (15m)

The programmer's mind

Jessica McKellar (Pilot)

The programmer's mind is inherently an activist's mind. Jessica McKellar draws parallels between the free and open source software movement and the work to end mass incarceration.

9:50am-9:55am (5m) Sponsored

Test, measure, iterate: Balancing “good enough” and “perfect” in the critical path (sponsored by NS1)

Kris Beevers (NS1)

In critical path services such as DNS, stability is imperative above all else. Kris Beevers examines the trade-offs between risk and velocity faced by any high-growth, critical path technology business.

9:55am-10:15am (20m)

ML on code: Machine learning will change programming

Francesc Campoy (Dgraph)

Machine learning has revolutionized many fields, from cancer detection to self-driving cars. And let's not forget about connected toilets that allow Alexa to flush at your command. Francesc Campoy Flores explores some of the techniques used and the most relevant research, focusing on use cases where machine learning can help developers be more efficient.

10:15am-10:20am (5m) Sponsored

How do DevOps and SRE relate? Hint: They're best friends. (sponsored by Google Cloud)

Dave Rensin (Google)

SRE has exploded in the industry over the last two years, with the publication of two best-selling books from Google. Not surprisingly, there have been questions about how SRE and DevOps relate. Do they compete? Do they reinforce each other? The short answer is that they make each other better. Join Dave Rensin to hear why.

10:20am-10:40am (20m)

Practical performance theory

Kavya Joshi (Samsara)

Performance theory offers a rigorous and practical approach to performance tuning and capacity planning. Kavya Joshi dives into elegant results like Little’s law and the Universal Scalability Law. You'll also discover how performance theory is used in real systems at companies like Facebook and learn how to leverage it to prepare your systems for flux and scale.

10:40am-10:45am (5m)

Closing remarks

Nikki McDonald (O’Reilly Media), James Turnbull (Glitch), Ines Sombra (Fastly)

Cochairs Nikki McDonald, James Turnbull, and Ines Sombra close the first day of keynotes.

10:45am-11:35am (50m)

Break: Morning Break (Sponsored by Oracle Dyn)

3:05pm-3:50pm (45m)

Break: Afternoon Break (Sponsored by Oracle Dyn)

5:25pm-7:00pm (1h 35m)

Sponsor Pavilion Reception

Join us for the Sponsor Pavilion Reception on Tuesday, October 2, following the afternoon sessions.

12:15pm-1:30pm (1h 15m)

Lunch and Tuesday Topic Tables

Join other attendees during lunch at Velocity to share ideas, talk about the issues of the day, and maybe solve a few. Not sure which topic to pick? Don’t worry—it's not a long-term commitment. Try two or three and settle on a different topic tomorrow.

7:00pm-9:00pm (2h)

DevOps After Dark (cosponsored by NS1 and O'Reilly)

Join us for the social highlight of Velocity New York at one of New York's finest restaurants, Tao Uptown, located at 42 E 58th St. Enjoy great music, food, and cocktails while networking and making new connections.

8:00am-9:00am (1h)

Break: Morning Coffee

8:15am-8:45am (30m)

Tuesday Speed Networking

Jumpstart your networking at Velocity at Speed Networking before the keynotes begin. Bring your business cards and prepare a minute of patter about yourself, your projects, and your interests.

Diamond Sponsor

Platinum Sponsors

Gold Sponsors

Silver Sponsors

Innovators

Supporters

Sponsorship Opportunities

For exhibition and sponsorship opportunities, email velocity@oreilly.com

Partner Opportunities

For information on trade opportunities with O'Reilly conferences, email partners@oreilly.com

Contact Us

View a complete list of Velocity contacts

©2018, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • confreg@oreilly.com

Schedule List ViewGrid View

Topics

Sponsorship Opportunities

Partner Opportunities

Contact Us

Schedule List View Grid View