San Jose • New York • London

Build Systems that Drive Business

June 11–12, 2018: Training
June 12–14, 2018: Tutorials & Conference

San Jose, CA

Speaker slides & video

Presentation slides will be made available after the session has concluded and the speaker has given us the files. Check back if you don't see the file you're looking for—it might be available later! (However, please note some speakers choose not to share their presentations.)

If you are looking for slides and video from 2017, visit the Velocity 2017 site.

A retrospective on retrospectives: How to be a nonexpert expert in system resilience

Jessica DeVita (Microsoft)

View slides

Jessica DeVita tells the story of how a team at Microsoft challenged themselves to retrospect their retrospectives and shares what they learned about applying human factors ideas to software development.

Artificial intelligence versus actionable intelligence (sponsored by PagerDuty)

David Hayes (PagerDuty)

Download slides (PDF)

Artificial intelligence has been almost here for 50 years, but we don't need to wait for it to escape the laboratory. Adding a manageable dose of actionable intelligence to your operations management workflow can save you time and aggravation. PagerDuty will talk about how AI's limitations and how it can decrease your noise and suggest possible courses of action.

Declarative application configuration: Mixing the old with the new

Bryan Liles (Heptio)

Download slides (PPTX)

Declarative application management enables developers and operators to simplify their configurations while deploying into increasingly complex environments. Bryan Liles explains how to evaluate and integrate these new practices into existing continuous integration pipelines.

Deploy security controls for serverless apps with infrastructure-as-code tools

Luis Colon (Amazon Web Services)

Download slides (PDF)

Many fundamental security practices and controls apply to serverless applications, including implementing proper monitoring and logging of all requests and events. Luis Eduardo Colon explores recommendations published by the Center for Internet Security (CIS), explains how to automate the deployment of some of these controls, and outlines considerations relevant to serverless functions.

Design for security

Serena Chen (BNZ Digital)

Download slides (PDF)

What insights do we gain if we apply user experience design to information security? Serena Chen shares four strategies that apply design thinking to security problems, pinpointing which practices work and which are detrimental. Serena then walks you through some common flows and dissects how design decisions affect your personal security.

EdgeControl: CDN tools to appease your inner control freak (sponsored by Verizon Digital Media Services)

Dave Andrews (Verizon Digital Media Services)

Download slides (PDF)

Change is inevitable, but the aftereffects can be both good and bad. Having the right tools is one way to meet this challenge. Dave Andrews explains how to wield the power of a global 50 Tbps application delivery network, featuring 125+ points of presence, to ensure maximum availability during and after a change.

End-to-end observability for fun and profit

Ben Hartshorne (Honeycomb), Christine Yen (Honeycomb)

Download slides (PDF)

Ben Hartshorne and Christine Yen explore what it means for a system to be “up” by discussing end-to-end (e2e) checks (what makes a good one and what techniques are valuable when thinking about them). Along the way, you'll learn how to write and evolve an e2e check against a common API.

For managers: How to keep up your technical skills without annoying your team(s)

K Vignos (Twitter)

Download slides (PDF)

Engineering teams want technically competent managers, but they also often want managers to keep their hands off their code. So how can managers keep their technical skills relevant in order to add the most value? Kathleen Vignos shares creative strategies for developing and maintaining technical skills—some through the act of managing itself.

From dandelion to tree: Scaling Slack

Bing Wei (Slack)

Download slides (PDF)

In 2016, Slack faced a problem: the load on its backend servers had increased by 1,000x. Bing Wei explains how rearchitecting the system with lazy loading, a publish/subscribe model, and an edge cache service overcame the problem with zero downtime, improved latency, and led to gains in reliability and availability.

Gaining efficiency with time series in ELK

Christian Saide (NS1)

Download slides (PDF)

Christian Saide explains how NS1 was able to reduce infrastructure, maintenance, and operational costs while simultaneously increasing throughput and visibility of key metrics by leveraging Elasticsearch as a time series database.

How to monitor your database

Baron Schwartz (VividCortex)

View slides

Baron Schwartz demonstrates how to monitor a database by understanding the difference between workload and resource monitoring—and the golden signals for each.

How we built a global search engine for genetic data

Miro Cupak (DNAstack)

Download slides (PDF)

The Beacon Network is the largest search and discovery engine of human genomic data in the world. Miro Cupak details the architecture and technologies behind the system with focus on the technical decisions that allow it to scale and disrupt the perception of genetic data.

Improving performance with Tesser

Kyle Kingsbury (Jepsen)

View slides

Kyle Kingsbury offers an overview of Tesser, a Clojure library for writing commutative, parallel folds that can be chained and composed into complex single-pass reductions that are dramatically faster on multicore systems and can be transparently distributed over Hadoop.

Introduction to continuous compliance and remediation

Nathen Harvey (Chef)

Download slides (PDF)

Join Nathen Harvey to learn how to easily integrate automated tests that check for adherence to policy into any stage of your deployment pipeline, using InSpec for compliance and Chef for remediation.

JavaScript, security, and the case for feature simplicity

Natalie Silvanovich (Google)

Watch the keynote

Download slides (PDF)

JavaScript engines are frequently targeted by malicious attackers, and dozens of vulnerabilities are reported in them every year. Most of these occur due to errors made while implementing well-specified features. Natalie Silvanovich discusses the link between feature complexity, developer error, and security vulnerabilities and the importance of considering implementation difficulty in design.

Jepsen 9: The center cannot hold

Kyle Kingsbury (Jepsen)

View slides

Kyle Kingsbury explores anomalies in three distributed systems—Tendermint, Hazelcast, and Aerospike—and shares general strategies for correctness testing using Jepsen, a distributed system testing harness that applies property-based testing to databases to verify their correctness claims during common failure modes: network partitions, process crashes, and clock skew.

Jumpstarting your DevSecOps pipeline with IAST and RASP (sponsored by Contrast Security)

Jeff Williams (Contrast Security)

Download slides (1-PDF)

View slides

Jeff Williams explains how to layer security tools on a CI/CD pipeline without disrupting it and demonstrates a fast, effective, scalable DevSecOps pipeline using free tools.

Kubernetes security best practices

Ian Lewis (Google)

View slides

Ian Lewis shares the easiest and best ways to improve the security of your Kubernetes clusters

Lessons learned while evolving Box’s database infrastructure

Tamar Bercovici (Box)

Watch the keynote

Download slides (PDF)

When Tamar Bercovici joined Box, the entire platform was running on a single MySQL DB host fronted by a simple pool of memcached servers. Tamar details how the team has evolved the Box database stack to handle an ever-growing query load and dataset. It now comprises hundreds of servers serving millions of queries per second over hundreds of billions of data records.

Leveraging multiplatform DNS for web application resiliency (sponsored by Oracle + Dyn)

Matt Torrisi (Oracle + Dyn)

Download slides (PDF)

Matt Torrisi demonstrates how to build domain traffic easily by enabling multiplatform DNS, covers the important criteria in assessing DNS network compatibility, and walks you through using DNS as a traffic-steering platform.

Lightweight mobile DevOps on GCP (sponsored by Google Cloud)

John LaBarge (Google)

Download slides (PDF)

John LaBarge details how to perform lightweight mobile DevOps on GCP, including building Android applications with Container Builder, doing functional testing with Firebase Device Lab, and distributing tested artifacts through Crashlytics Beta.

Load testing reinvented for DevOps (sponsored by Tricentis)

Tim Koopmans (Tricentis)

View slides

Tim Koopmans explains how load testing is being reinvented for DevOps, covering where traditional load testing approaches fall short for Agile and DevOps, what’s needed to rapidly expose performance issues before they impact users, and new approaches to making load testing faster, simpler, and more realistic.

More than a series of tubes: Networking in Kubernetes

Jeff Poole (Vivint Smart Home)

Download slides (PDF)

Networking with Docker and Kubernetes is a lot more complex than with traditional servers and virtual machines. Jeff Poole offers an overview of the concepts involved and explains what tuning may be required to use Kubernetes successfully.

Multicloud continuous delivery with Spinnaker

Tomas Lin (Netflix), Emily Burns (Netflix)

View slides

Tomas Lin and Emily Burns walk you through building continuous delivery pipelines for deploying and promoting code across cloud virtual machines and containers using Netflix's Spinnaker continuous delivery platform.

Netra Q&A: Scaling resource-intensive APIs (sponsored by Oracle + Dyn)

Kyle York (Oracle + Dyn), Richard Lee (Netra)

Download slides (PDF)

Kyle York and Richard Lee explore Netra’s high-performance computing environment, focusing on how the company's AI and deep learning models process tens of millions of images and videos each day in a time- and cost-effective manner. Along the way, they explain what worked, what didn't, and why you need an Agile, hybrid infrastructure if you want to build an AI business at the scale of social.

Observability of team health: Deciphering and reacting to organizational feedback (sponsored by NS1)

Renee Orser (NS1)

Watch the keynote

Download slides (PDF)

Engineering managers build the strongest teams by listening to their engineers, continuously calibrating their own alerts, and driving change management based on the feedback sourced from within their engineering organization. Renee Orser explains how to monitor the human networks within your engineering teams using models similar to your distributed technology systems.

Pat Helland and me: How to build stateful distributed applications that can scale almost infinitely

Sean Allen (Wallaroo Labs)

Download slides (PDF)

In 2007, Pat Helland published "Life Beyond Distributed Transactions: An Apostate’s Opinion," in which he conducts a thought experiment on how to design a distributed database that can scale almost infinitely. While the paper explicitly addresses distributed database design, Sean Allen shows that the ideas are far more widely applicable, particularly in scaling stateful applications.

Performance debugging: Finding bottlenecks in distributed systems

Christian Grabowski (NS1)

Download slides (PDF)

Performance debugging is a crucial part of ensuring code is production ready, particularly as a company and its products grow. However, bottlenecks that hold these services back can be hard to identify. Christian Grabowski shares his experience debugging bottlenecks in distributed systems, at both a macro (metrics, distributed tracing) and a micro (user space and kernel space profiling) level.

Rebuilding the airplane in flight. . .safely

Shannon Weyrick (NS1)

Download slides (PDF)

Rewriting the key software component of your platform from scratch is always intimidating, especially when you guarantee 100% uptime, your platform is in the critical application delivery path, and your environment is highly distributed. Shannon Weyrick discusses NS1's recent DNS server rewrite and the steps the company took to roll it out across its globally distributed network with no downtime.

Reliability from the ground up: Designing for five nines

Astrid Atkinson (Google)

Download slides (PDF)

Astrid Atkinson discusses techniques for building systems that are resilient by design.

Running stateful applications in Kubernetes: Is it worth the risk?

Kris Nova (Heptio)

Watch the keynote

Kris Nova explores the current state of running stateful applications in Kubernetes, the tooling gaps you'll want to watch out for, and the four metrics that will help you determine if it's worth the risk.

Scaling yourself during hypergrowth

Julia Grace (Slack)

Download slides (PDF)

When Julia Grace joined Slack two-and-a-half years ago, the company had fewer than 100 engineers. It's now at more than 350, and her own team grew from 10 to 50 people in 18 months. Julia shares tips and stories from the leadership front lines as she learned how to rapidly scale herself and her leadership team during a period when her job was substantially changing every six months.

Secrets and surprises of high performance: What the data says

Nicole Forsgren (GitHub)

Download slides (PDF)

Nicole Forsgren shares results and stories from four years of research to uncover the secrets and surprises of what really makes high-performing technology-driven teams and organizations.

Steering the Edgecast CDN with Heteractis

Marcel Flores (Verizon Digital Media Services)

Download slides (PDF)

Marcel Flores explores the design and implementation of Heteractis, the traffic management system Verizon Digital Media Services uses to turn network telemetry data into automated decisions in an automated fashion.

The distributed authorization system: A Netflix case study

Manish Mehta (Netflix), Torin Sandall (Open Policy Agent Project)

Download slides (PDF)

Manish Mehta and Torin Sandall lead a deep dive into how Netflix enforces authorization policies (“who can do what”) at scale in its microservices ecosystem in a public cloud without introducing unreasonable latency in the request path.

The internet versus your sites: Taking action against internet volatility (sponsored by Oracle + Dyn)

Kyle York (Oracle + Dyn)

Watch the keynote

Download slides (PDF)

When the internet is not bombarding your DNS with bogus requests, it’s trying to execute malicious SQL queries and crawling your site with bots (some good, some bad). Join Kyle York to learn how to take action.

The secret to building and delivering amazing apps at scale (sponsored by Akamai)

Javier Garza (Akamai Technologies)

Watch the keynote

Download slides (PDF)

We are more mobile now than ever. Although we use our mobile devices to optimize our time and do more anytime, anywhere, our apps are still too slow and cannot cope with our fast-paced lifestyle. Javier Garza details the ingredients you need to build and deliver an amazing app your users will love.

There can be only one (environment): Production

Paul McCallick (Nordstrom)

Download slides (PDF)

Paul McCallick discusses how and why Nordstrom has moved to an only-production viewpoint, saving countless engineering cycles and putting effort where it matters.

Tooling in the age of serverless computing

Donna Malayeri (Pulumi)

Download slides (PDF)

Tooling is necessary for serverless and service-full applications. Donna Malayeri shares a decision framework for choosing infrastructure deployment tools, based on whether you need flexibility and control or simplicity and ease-of-use. You'll learn how to evaluate several popular cloud automation tools, including AWS SAM, Terraform, Chalice, Serverless Framework, and more.

Using AI to solve performance problems (sponsored by Salesforce)

Jasmin Nakic (Salesforce ), Jackie Chu (Salesforce)

Download slides (PDF)

Jasmin Nakic and Jackie Chu share techniques to identify performance challenges by analyzing production data from Salesforce and other sources and explore the AI models to predict trends, detect anomalies, and troubleshoot performance problems.

Why Microsoft does DevOps (sponsored by Microsoft)

Martin Woodward (Microsoft)

Watch the keynote

Download slides (PDF)

Martin Woodward leads a whistle-stop tour of Microsoft's seven-year DevOps journey, explaining why the company embarked on this transformation and what benefits it has already seen.

Diamond Sponsor

Elite Sponsors

Platinum Sponsors

Gold Sponsors

Silver Sponsors

Innovators

Exhibitors

Sponsorship Opportunities

For exhibition and sponsorship opportunities, email velocity@oreilly.com

Partner Opportunities

For information on trade opportunities with O'Reilly conferences, email partners@oreilly.com

Contact Us

View a complete list of Velocity contacts

©2018, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • confreg@oreilly.com