Training: June 20–21, 2016
Tutorials: June 21, 2016
Keynotes & Sessions: June 22–23, 2016
Santa Clara, CA

Velocity 2016 Speakers

New speakers are added regularly. Please check back to see the latest updates to the agenda.

Filter

Search Speakers

John Allspaw has worked in software systems engineering and operations for twenty years in many different environments: biotech, government, online media, social networking, and e-commerce. He started out tuning parallel clusters running vehicle crash simulations for the U.S. government, and then moved on to the Internet in 1997. He built the backing infrastructure at Salon.com, InfoWorld.com, Friendster, and Flickr, and Etsy. He served as SVP of Engineering and then Chief Technology Officer at Etsy, and holds an MSc in Human Factors and Systems Safety from Lund University.

Presentations

Thursday opening welcome Keynote

Velocity program chairs John Allspaw, Steve Souders, and Mary Treseler open the second day of keynotes.

Wednesday opening welcome Keynote

Velocity program chairs Steve Souders, John Allspaw, and Mary Treseler open the first day of keynotes.

Lee Atchison is the principal cloud architect and advocate at New Relic, where he designs and leads the building of the New Relic infrastructure products and helps New Relic architect a solid service-based system architecture that scales as they have grown from a simple SaaS startup to a high-traffic public enterprise. Lee has a specific expertise in building highly available systems and has 28 years of industry experience. He learned cloud-based, scalable systems during his seven years as a senior manager at Amazon, where he led the creation of the company’s first software download store, created AWS Elastic Beanstalk, and led the team that managed the migration of Amazon’s retail platform from a monolith to a service-based architecture. Lee is the author of the book Architecting for Scale, published in 2016 by O’Reilly Media.

Presentations

Cloudy with a chance of scaling: Keeping your highly scaled application highly available using the cloud Session

As our applications grow, keeping them operational is challenging. High growth means more data, more computation, and more opportunities for problems. The cloud offers the ability to improve scalability while maintaining availability. Lee Atchison explains the “keep two mistakes high” principal and how to use the cloud to keep applications healthy and growing while keeping costs inline.

Office Hours with Lee Atchison Office Hours

Join Lee at Table 3 in the O'Reilly booth during the Thursday afternoon break to discuss improving availability and scalability in your applications, migrating to the cloud, and using the dynamic cloud.

Adam Auerbach is the technology senior director for advanced testing and release services for Capital One Financial Corporation, a diversified bank with 65 million customer accounts worldwide and more than 900 branch locations. Adam is responsible for Capital One’s enterprise performance and automated testing departments as well as enterprise release management, service virtualization, and test data management teams. Since joining Capital One, he has provided leadership for the agile transformation of their quality assurance group and led the enterprise adoption of DevOps and ATDD. Before joining Capital One, Adam was with Chase and other financial and insurance companies in various leadership positions focusing on quality and agile practices.

Presentations

Part of the pipeline: Why continuous testing is essential Session

Adam Auerbach and Tapabrata Pal discuss Captial One's transformation to continuous testing, covering core principles, tools, and best practices as well as common roadblocks and some recommendations on how best to remove them from the environment.

Chris Baker is an Internet cartographer, data analyst, and wanderlust researcher at Dyn, where he is responsible for an array of data analysis and research projects ranging from business intelligence to Internet measurements and communication analysis. Previously, Chris worked at Fidelity Investments as a senior data analyst. He graduated from Worcester Polytechnic Institute with a master’s degree in system dynamics and a bachelor’s degree in management of information systems and philosophy.

Presentations

Who is . . .? A question, not the Unix command Session

Chris Baker discusses Internet cartography and its implications for risk and security, focusing on building a mental model for how we know where on the network something is, what it is, and why that is important.

Tobias Baldauf is a web performance evangelist and consultant at Akamai, where he creates innovative web performance tools and new image optimization algorithms. He also speaks at conferences. He’s a proud dad, tries to be a mindful vegetarian, and loves making music. Find out more at Who.tobias.is.

Presentations

Your hero images need you: Save the day with HTTP/2 image loading Tutorial

Tobias Baldauf explains how to use HTTP/2's superpowers to optimize image delivery, thereby increasing the perceived performance of your page, reducing load times, and driving conversions.

Harkeerat Bedi is a research scientist at Verizon Digital Media Services. Previously, Harkeerat was a research and teaching assistant at the University of Memphis, where he conducted extensive research work on active queue management in routers, securing cloud infrastructure against coresident DoS attacks, search engine in Perl, semantic-based clustering of web forms for the Deep Web, and game-inspired defense architecture. He has coauthored 16 research articles that have appeared in publications such as Computer Communications. Harkeerat holds a bachelor’s degree in computer science and engineering from Anna University and a master’s degree in computer science from the University of Tennessee at Chattanooga. He has a doctorate in computer science from the University of Memphis.

Presentations

Jump-starting back-office connections Keynote

Large-scale cloud networks are constantly driven by the need for improved performance in communication between data centers. Such back-office communication makes up a large fraction of traffic in many cloud environments. Harkeerat Bedi offers an overview of a tool that improves the efficiency of data-center-to-data-center communication by learning the congestion level of links in between.

Kristopher Beevers is founder and CEO of NS1, the next-gen DNS and traffic management company. Previously, Kristopher led platform development at Voxel.net (acquired by Internap), where he built cloud and bare metal platforms, content delivery networks, and other distributed infrastructure products. Kristopher holds BS, MS, and PhD degrees in computer science from RPI.

Presentations

Future-proofing application delivery at Yelp: Building and tuning traffic management for large web-scale applications Session

At Yelp, application delivery means servicing a global audience from a highly dynamic infrastructure and meeting stringent performance and reliability requirements. Kris Beevers and Sarguru Nathan detail how they implemented automated, dynamic, and intelligent traffic management and DNS routing to ensure Yelp’s global traffic is serviced by the best data center.

Leveraging multi-CDN at Riot Games Session

Kristopher Beevers and Ray Panahon offer an overview of using a multi-CDN environment with DNS and traffic management to build, tune, and deliver an exceptional user experience.

Artur Bergman is the founder and CEO of Fastly, the future of content delivery. Previously, he served as CTO at Wikia, managed LiveJournal’s engineering team, and was an operations architect at Six Apart. In past lives, he was a committer to Varnish, built high-volume financial trading systems, reimplemented Perl 5’s threading system, and created djabberd.

Presentations

DDoS emotions Keynote

When a DDoS attack occurs, organizations respond using existing mental models of operations, not taking into account the emotional effects of the malicious nature of the attack. Artur Bergman explores how people react and why a DDoS attack is different than other challenges a company may face, proving that it isn't your fault and you're not alone.

Office Hours with Artur Bergman Office Hours

Join Artur at Table 2 in the O'Reilly booth during the Wednesday morning break to discuss content delivery, performance, and DDoS attacks.

Gianluca Borello is an engineering manager at Sysdig, where he wears many hats. Gianluca is a core developer of sysdig, an open source troubleshooting tool for Linux and containers, and spends his days dealing with backend development, performance analysis, and cloud infrastructure management. Prior to Sysdig, he was an engineer at Riverbed and CACE Technologies, the company behind Wireshark. He’s passionate about Linux, open source technologies, and distributed systems at scale. Gianluca has presented sysdig to the DevOps community at many events, including Docker meetups, DevOps Summit, KubenCon 2015, Lisa 2015, Sharkfest 2014, and the Facebook Engineering Tech Talk. He holds an MS in computer engineering from Politecnico di Torino, Italy.

Presentations

Monitoring microservices: Docker, Mesos, and Kubernetes visibility at scale Tutorial

Gianluca Borello explores the state of the art for visibility, monitoring, and troubleshooting for microservices and containers—including live demonstrations of popular tools and methods and the pros and cons of each—with special emphasis on sysdig, an open source system visibility tool.

Matt Brender is a developer advocate for the Software Defined Infrastructure (SDI) team as part of the Datacenter Group at Intel, which focuses on OSS contribution around orchestration, schedules, and automation with a growing number of projects written in Go. Matt describes himself as a technologist forever changed by Twitter. He uses his degree in computer science and love for APIs to code solutions to communication gaps throughout teams. Matt is recognized as a vExpert in the VMware community. In his free time, he likes to podcast as one of the Geek Whisperers or blog at Neckbeard Influence.

Presentations

My cloud makes pretty pictures Keynote

Data is beautiful when made visible. Matthew Brender and Raj Dutt offer a demo of Snap, an open telemetry framework designed to gather an increasingly diverse amount of measurements from the cloud, and illustrate how to visualize data in Grafana with unprecedented ease.

Buddy Brewer is the senior VP of products at SOASTA, where he leads strategic initiatives for all SOASTA products, including DOC, CloudTest, TouchTest, Data Science Work Bench, and mPulse, a real-user measurement tool that tells you how long people are waiting on your site and how it is impacting their behavior. Buddy has been helping some of the largest sites on the Web solve their performance challenges since 2001. Prior to SOASTA, Buddy cofounded LogNormal, the first company to put measuring real users first in the crowded space of performance monitoring tools. LogNormal was acquired by SOASTA in 2012.

Presentations

Performance is about more than time series charts Keynote

When most people think of performance monitoring tools, they think of things like time series charts, page load times, DNS resolution times, and backend service response times. Buddy Brewer explores alternative ways of visualizing performance data and explains how changing your perspective can sometimes lead you to surprising discoveries.

The top 10 performance defects that are costing you millions Session

There are literally hundreds of issues that could be slowing down your pages. How do you figure out (and fix) the ones that are hurting your business most? Buddy Brewer and Billy Hoffman outline a step-by-step process that helps you answer this question.

Nicolas Brousse is senior director of operations engineering at TubeMogul. The company’s sixth employee and first operations hire, Nicolas has grown TubeMogul’s infrastructure over the past seven years from several machines to over two thousand servers that handle billions of requests per day for clients like Allstate, Chrysler, Heineken, and Hotels.com. Adept at adapting quickly to ongoing business needs and constraints, Nicolas leads a global team of site reliability engineers and database architects that monitor TubeMogul’s infrastructure 24/7 and adhere to DevOps methodology. Nicolas is a frequent speaker at top US technology conferences and regularly gives advice to other operations engineers. Prior to relocating to the US to join TubeMogul, Nicolas worked in technology for over 15 years, managing heavy traffic and large user databases for companies like MultiMania, Lycos, and Kewego. Nicolas lives in Richmond, CA, and is an avid fisherman and aspiring cowboy.

Presentations

Look back, look now, see forward Session

Phil Stanhope and Nicolas Brousse explore why enterprises need to arm themselves with the right tools to detect and remediate problems and protect their Internet-connected assets.

Donovan Brown is a senior DevOps program manager on the US Developer Division team in Microsoft, focusing on developer tools including Visual Studio Team Services and Team Foundation Server. Before joining Microsoft, Donovan spent seven years as a process consultant and a Certified ScrumMaster. Donovan has traveled the globe helping companies in the US, Canada, India, Germany, and the UK develop solutions using agile practices, Visual Studio, and Team Foundation Server in industries including communications, healthcare, energy, and financial services. He is also an avid programmer, often finding ways to integrate software into his other hobbies and activities.

Presentations

Mobile-first, cloud-first continuous delivery: From responsive Web to iOS and Android native with Docker services and SQL Azure in the cloud in a modern DevOps pipeline Session

Donovan Brown explains how a release pipeline can manage containerized services across Windows and Linux and uses Azure, Xamarin, HockeyApp, .NET Core 5 on Linux, SQL Azure, and Visual Studio Team Services to demonstrate a DevOps experience for a multichannel mobile and cloud app. See the amazing changes brought by Microsoft's challenger mindset.

Sabrina Burney is currently a solutions architect at Akamai Technologies with a background in programming. Sabrina’s current focus involves security and protecting web infrastructures. More recently, she has gotten involved with improving the frontend end-user experience.

Presentations

Security and performance: Breaking the conundrum. . .again Session

Security techniques have generally focused on protecting users by blocking requests going to the origin, but security is also a concern at the browser. Sonia Burney and Sabrina Burney explore how security can be enforced at the browser level through a combination of optimization techniques and security enhancements, which overall provide an optimal end-user experience.

Sonia Burney is a solutions architect at Akamai Technologies with a background in frontend web development. Sonia’s main area of expertise is web performance with a strong focus on frontend optimizations that help enhance the user experience.

Presentations

Security and performance: Breaking the conundrum. . .again Session

Security techniques have generally focused on protecting users by blocking requests going to the origin, but security is also a concern at the browser. Sonia Burney and Sabrina Burney explore how security can be enforced at the browser level through a combination of optimization techniques and security enhancements, which overall provide an optimal end-user experience.

Bryan Cantrill is the CTO at Joyent, where he oversees worldwide development of the SmartOS and SmartDataCenter platforms and the Node.js platform. Prior to joining Joyent, Bryan served as a distinguished engineer at Sun Microsystems, where he spent over a decade working on system software, from the guts of the kernel to client code on the browser. In particular, he codesigned and implemented DTrace, a facility for dynamic instrumentation of production systems that won the Wall Street Journal’s Technology Innovation Award in 2006 and the USENIX Software Tools User Group Award in 2008. Bryan also cofounded the Fishworks group at Sun, where he designed and implemented the DTrace-based analytics facility for the Sun Storage 7000 series of appliances. Bryan received an ScB magna cum laude with honors in computer science from Brown University.

Presentations

The container hype cycle: Where we are and what’s next Session

2015 was a huge year for containers. A multitude of vendors rushed into the scene, leaving businesses scratching their heads about how to adopt this new technology. Will all this activity be a barrier to adoption as organizations wait out the hype? Bryan Cantrill explains why the “wait it out’ mentality won’t last long, why interoperability is key, and what’s next.

Ian Carrico works to make the Web a little faster and helps others do the same.

Presentations

The wild west of media performance: A Vox story Session

A little over a year ago, Vox Media created a dedicated performance team, which immediately set out to make all Vox Media sites as fast as possible—and has since made significant progress. Ian Carrico and Jason Ormand discuss what the team has done, how it did it, and what it's still working on.

Michelle Carrizosa is a data scientist at SOASTA, where she creates interactive data visualizations using Julia together with the JavaScript libraries C3, D3, and D3Plus and develops Julia tutorials and automatically generated documentation of SOASTA’s functions. Michelle graduated with a BS in general mathematics from California Polytechnic University, San Luis Obispo, where she researched the Gale-Shapley stable marriage problem and studied statistical programming with R. In her spare time, she plays tennis, practices yoga, snowboards, and fishes in the Sierra Nevada mountains.

Presentations

Prioritizing site performance improvements Session

Michelle Carrizosa and Iris Lieuw demonstrate how to prioritize improvements across your ecommerce site by identifying which of your pages are the most important to optimize and then looking at resource timing data to determine what affects those pages the most.

Adam Cath manages development and operations of Synthetic Monitoring at AppDynamics. Synthetic is a multicloud (ugh), multi-OS (ugh), globally distributed (ugh) web performance monitoring product. Adam also helped build AppDynamics Real-User Monitoring and previously worked at Adobe on performance analysis tools for game developers. Adam is passionate about visualizing what happens inside complex systems to enable creative problem solving.

Presentations

Outages, ownership, and craftsmanship: A DevOps story Session

DevOps is trendier than a cronut on a hoverboard, but what actually happens when you put developers and operators on the same team? Adam Cath shares the problems that led his product team to go from centralized Ops to distributed DevOps, the specific changes they made, some trade-offs they made and challenges they faced, and the surprising cultural shifts that resulted.

Richard Cook is a research scientist in the Department of Integrated Systems Engineering at the Ohio State University in Columbus, Ohio, and emeritus professor of healthcare systems safety at Sweden’s KTH. A physician, researcher, and educator, Richard is an internationally recognized expert on safety, accidents, and human performance at the sharp end of complex, adaptive systems. His most often cited publication is “Going Solid: A Model of System Dynamics and Consequences for Patient Safety.”

Presentations

Poised to deploy: The C-suite and adaptive capacity Keynote

The C-suite (continuous delivery, continuous integration, continuous delivery, and their enablers like agile, scrum, and so on) is an investment in future adaptive capacity. Richard Cook explains the value of adaptive capacity—being able to respond to new challenges and grasp new opportunities—and explores its far-reaching consequences.

R. Tyler Croy is a longtime Jenkins contributor, having tended the Jenkins project’s own infrastructure for almost as long as it has existed. Originally guiding the project from artisanal, hand-crafted physical machines into the world of Puppet and reproducible infrastructure. Tyler has most recently been focused on improving the reliability, testability, and immutability of the Jenkins project infrastructure with Puppet, ServerSpec, and Docker.

Presentations

Continuous delivery of infrastructure with Jenkins Session

R. Tyler Croy offers an overview of continuous delivery concepts and puts them into practice by building a continuous delivery pipeline with Jenkins to test, stage, and deploy infrastructure code to production. Tyler explains how reducing the effort, error rate, and time it takes to deploy a configuration to change to production means less time fighting fires and more time doing what you want.

Katherine Daniels is a senior operations engineer at Etsy who got her start in programming with TI-80 calculators back when GeoCities was still cool. These days, she has opinions on things like the cloud and DevOps. Before escaping to the world of operations, she spent a few years doing R&D and systems engineering in the corporate world. Katherine lives in Brooklyn with a perfectly reasonable number of cats and in her spare time can often be found rock climbing or handcrafting knitted server koozies for the data center. She is also coauthor of O’Reilly Media’s Effective DevOps.

Presentations

Leveling up monitoring: A decade of automating and scaling Nagios Session

Does monitoring have to suck? Etsy has been using Nagios for over a decade and over that time has created a set of tools that allow multiple teams to deploy, manage, and scale it. Katherine Daniels and Laurie Denness offer guidelines on how to scale monitoring and alerting setups, ideas for workflows around monitoring, and methods of reducing friction and alert fatigue for on-call engineers.

Jennifer Davis is a software engineer on the Community Engineering team at Chef, developing cookbooks to simplify building and managing infrastructure. Jennifer speaks about DevOps, tech culture, and monitoring and gives tutorials on a variety of technical topics. When she’s not working, she enjoys learning to make things and spending quality time with her dog.

Presentations

Crafting reusable infrastructure resources (Day 1) Training

Jennifer Davis walks attendees through crafting a community cookbook from scratch using both Chef and community resources. In the process, you'll learn some key patterns of DevOps practices.

Crafting reusable infrastructure resources (Day 2) Training

Jennifer Davis walks attendees through crafting a community cookbook from scratch using both Chef and community resources. In the process, you'll learn some key patterns of DevOps practices.

Laurie Denness is a British Technical Team Lead, currently working at Bloomberg LP, previously staff operations engineer at Etsy. He can often be seen looking at or creating graphs, stalking his coworkers to collect on-call sleep-habit data, or using multiple CDNs for fun and profit. Laurie likes graphs, Hadoop clusters, monitoring, and sarcasm. He once wore a onesie while giving a talk about how on-calls affect your employees and ways to fix it at 2014’s Velocity CA. Earlier in his career, he enjoyed streaming mp3s to millions of people at Last.fm.

Presentations

Leveling up monitoring: A decade of automating and scaling Nagios Session

Does monitoring have to suck? Etsy has been using Nagios for over a decade and over that time has created a set of tools that allow multiple teams to deploy, manage, and scale it. Katherine Daniels and Laurie Denness offer guidelines on how to scale monitoring and alerting setups, ideas for workflows around monitoring, and methods of reducing friction and alert fatigue for on-call engineers.

Martin Duke is one of four developers on F5’s TCP Optimization team. Martin’s work on transport protocols stretches back 14 years and includes authorship of one TCP RFC (RFC 7414). He regularly writes articles about TCP optimization on F5’s DevCentral website. Previously, Martin did R&D on the network and media access layers for wireless environments. He has degrees from MIT and the University of Washington and resides in Seattle with his family.

Presentations

TCP optimization: What you need to know for a faster application Session

TCP optimization leads to faster applications. Robert Haynes and Martin Duke offer essential knowledge and actionable advice that non-networking IT professionals can use to make their applications more responsive and improve their users' experience.

Raj Dutt is the CEO and cofounder of raintank. Formerly, he was the founder and CEO of Voxel, a cloud and hosting company, which was purchased by Internap in 2012, where he spent two more years as SVP of technology. He likes to fly airplanes and is currently in the middle of getting his motorglider license.

Presentations

My cloud makes pretty pictures Keynote

Data is beautiful when made visible. Matthew Brender and Raj Dutt offer a demo of Snap, an open telemetry framework designed to gather an increasingly diverse amount of measurements from the cloud, and illustrate how to visualize data in Grafana with unprecedented ease.

Mike Dvorkin is a recognized thought leader in policy-driven automation and operations. Mike’s work focuses on policy abstraction models and intent-based nonlinear automation methods applied to all aspects of infrastructure, applications, and operations. He is a founder of the Noiro project at Cisco focusing on bringing policy abstraction to open source compute and networking stacks. Prior to Noiro, Mike was a cofounder and chief scientist of Insieme Networks, where he was responsible for conceptualization of the policy layer that is now part of Cisco ACI/APIC. Before Insieme, as part of the early engineering team at Nuova Systems, he was the architect behind the management and control plane of UCS server platform, the first intent-defined server platform. Prior to Nuova, Mike held a number senior technical positions at infrastructure startups. Mike holds a degree in computer science from the University of Illinois at Urbana-Champaign.

Presentations

Tackling consumption chaos in microservice architectures Keynote

Service consumption chaos is currently one of the biggest challenges in microservice environments. Mike Dvorkin discusses how tackling the complexity of exploding service dependencies and their control through a consistent consumption abstraction can significantly simplify how teams develop, test, deploy, and control applications.

Jeremy Edberg, the CEO and Founder of MinOps, which makes using the cloud stupid easy. He is an angel investor and advisor for various incubators and startups. Previously, Jeremy was the founding reliability engineer for Netflix. Before that, he ran Ops for Reddit, which at the time had more than five billion pageviews a month. Jeremy’s expertise is in distributed computing, availability, rapid scaling, and cloud computing. He also edited the highly acclaimed AWS for Dummies.

Presentations

Maintaining your entirely serverless production infrastructure Session

Granularity has gotten smaller over the years, going from machines to instances to containers, but the smallest unit of granularity is the function. With AWS Lambda, Parse, App Engine, etc., the rise of the serverless production environment is upon us. Jeremy Edberg explains how CloudNative manages the conglomeration of functions built across the organization.

Office Hours with Jeremy Edberg Office Hours

Join Jeremy at Table 1 in the O'Reilly booth during the Wednesday afternoon break to discuss serverless architectures, why containers are last year's technology, and any other DevOps or reliability topic you can think of.

Tammy Everts is chief experience officer at SpeedCurve, where she helps companies understand how visitors use their websites. She has spent the past two decades studying how people use the web. Since 2009, she’s focused on the intersection between web performance, user experience, and business metrics. Tammy’s book, Time Is Money: The Business Value of Web Performance (O’Reilly), is a distillation of much of this research (but there’s always more to be learned). She cocurates (with Tim Kadlec) WPO Stats, a collection of performance case studies.

Presentations

Using machine learning to determine drivers of bounce and conversion Session

Google partnered with SOASTA to train a machine-learning model on a large sample of real-world performance, conversion, and bounce data. Patrick Meenan and Tammy Everts offer an overview of the resulting model—able to predict the impact of performance work and other site metrics on conversion and bounce rates.

Diego Ferreiro is an architect at Salesforce working on UI, performance, and instrumentation. Diego spends his days worrying about understanding web performance, 60 fps animations, and making the UI faster.

Presentations

Instrumenting, measuring, and monitoring the Salesforce Lightning Experience Session

Salesforce executes 4 billion+ transactions daily while keeping the average transaction speed under 300 ms and page speed under 2 seconds. As Salesforce's cloud services grow in complexity and scale, measuring and monitoring their performance becomes increasingly hard. Ashray Mathur and Diego Ferreiro Val discuss how Salesforce continues to improve and the lessons it has learned along the way.

Kevina Finn-Braun’s focus throughout her 18 years in the Internet industry has been operational excellence and risk management. Kevina is currently director of site reliability service management at Salesforce, where she leads the team focused on operational process improvements in the areas of incident, problem, and change management. In her previous role as director of business continuity at Yahoo, she led the team focused on risk management and service continuity best practices.

Presentations

Office Hours with Kevina Finn-Braun and J. Paul Reed Office Hours

Join Kevina and Paul at Table 2 in the O'Reilly booth during the Thursday morning break to discuss deploying a "blameless" culture and extending the Dreyfus model.

The blameless cloud: Experiments in actionable retrospectives at Salesforce Session

Are "blameless" postmortems real? Sure, small companies here and there talk about it, but what about large enterprises? J. Paul Reed and Kevina Finn-Braun discuss the hurdles, lessons, and surprises of rolling out actionable retrospectives in a large, complex organization, including what works, what doesn't, and techniques that, well. . .the jury is still out on.

Matthew Flaming has been designing complex web and SaaS servers for over 15 years, including content portals, CMS backends, and IoT runtime platforms. He is currently the VP of engineering for the Site Engineering and Data Services group at New Relic.

Presentations

Building a system that never stops: New Relic at scale Keynote

New Relic receives over half a trillion data points from customers that need to be processed, stored, and made available ASAP. Matthew Flaming explains why moving from a single application performance product to querying more than a billion events a second to serve multiple products has meant radically reinventing and re-architecting the whole platform multiple times.

Nicole Forsgren is the CEO and chief scientist at DevOps Research and Assessment (DORA). Nicole is an IT impacts expert who is best known for her work with tech professionals and as the lead investigator on the largest DevOps studies to date. She is a consultant, expert, and researcher in knowledge management, IT adoption and impacts, and DevOps. In a previous life, she was a professor, sysadmin, and hardware performance analyst. Nicole has been awarded public and private research grants (funders include NASA and the NSF), and her work has been featured in various media outlets, peer-reviewed journals, and conferences. She holds a PhD in management information systems and a master’s degree in accounting.

Presentations

DevOps: Continuous delivery, security, and lean management make your IT and business better Session

Research from the last few years has shown that DevOps can bring both throughput and stability to IT, driving value and profit for businesses. Nicole Forsgren, Jez Humble, and Gene Kim present the latest data on DevOps and explore the roles of security and compliance, continuous delivery, and management in driving world-class quality, efficiency, and value in innovative organizations.

Javier Garza is a multilingual technology evangelist who loves taking things apart, understanding how they work, and finding the best and most practical way to improve them. Javier works as a senior enterprise architect at Akamai Technologies, where he specializes in improving the performance and availability of websites and applications.

Presentations

Real-world experience with HTTP/2 Session

Michael Gooding and Javier Garza share their experiences using HTTP/2 over the last 12 months, exploring case studies that demonstrate how performance can be improved while also addressing backward compatibility, using RUM data to review performance-related observations of customers after making the switch, and hands-on demos of HTTP/2 with server push and HTTP/2 + QUIC.

Alice Goldfuss is a site reliability engineer at New Relic, where she spends her days wading through containers, comforting servers, and performing dark sacrifices to the network tier. You can find her on Twitter ranting about feminism or trying to kidnap every cat she meets.

Presentations

Rock stars, builders, and janitors: You're doing it wrong Session

Engineering teams depend on a mix of rock stars, builders, and janitors to function. However, too often people are pigeonholed into these roles, and they (and the company) suffer for it. Alice Goldfuss argues that people should rotate roles so that they appreciate the benefits of each.

Michael Gooding is a web performance evangelist for Akamai Technologies, where he helps customers identify performance bottlenecks and solves often complex problems. With 10 years’ experience in the IT industry, Michael started professional life as a developer but moved into consultancy when performance issues started to become the norm and not the exception. Previously, he worked for the NCC Group (formerly Site Confidence), helping customers from all industries improve the performance of their sites. Michael is constantly researching latest optimizations trends and current techniques and sits on the organizing committee of the London Web Performance Group. Michael has found himself becoming more and more impatient as he gets older, which fuels his desire for a faster web experience.

Presentations

Real-world experience with HTTP/2 Session

Michael Gooding and Javier Garza share their experiences using HTTP/2 over the last 12 months, exploring case studies that demonstrate how performance can be improved while also addressing backward compatibility, using RUM data to review performance-related observations of customers after making the switch, and hands-on demos of HTTP/2 with server push and HTTP/2 + QUIC.

Ilya Grigorik is a web performance engineer at Google, cochair of the W3C Web Performance Working Group, and author of High Performance Browser Networking (O’Reilly). In short, he’s an Internet plumber.

Presentations

Building Fast & Resilient Web Applications Session

To deliver a great—fast, responsive, and consistent—user experience, we must build resilient applications that adapt to these changes. In this talk we'll cover the architecture best practices and the critical tools and API's available in the browser that will enable you to deliver a great (fast and resilient) user experience.

Timothy Gross is a product manager for Joyent, providers of the Triton Elastic Container Service. Previously, Tim ran ops at DramaFever, where he and his scrappy team ran Docker in production to serve a few million fans their daily dose of dramas, documentaries, and gross-out horror movies. In another life, Tim was an architect (buildings, not software). He took the leap into programming and operations after he discovered he could automate away almost everything boring in his life.

Presentations

Stateful applications on autopilot Tutorial

Microservice architectures manage the complexity of the development process, and application containers help manage the dependencies and deployment of those microservices. But deploying and connecting services together is a challenge because it forces developers to design for operationalization. Timothy Gross explores autopiloting applications as a powerful design pattern to solve this problem.

Jason Hand – DevOps Evangelist at VictorOps, organizer of DevOpsDays – Rockies, author of “ChatOps for Dummies”, and co-host of the “Community Pulse” podcast about building community in tech. Jason has spent the last 18 months presenting and building content on a number of DevOps topics such as Blameless Post-mortems, ChatOps, and modern Incident Management. A frequent speaker at DevOps-related events and conferences around the country, Jason enjoys talking to audiences large and small on a variety of technical and non-technical subjects.
Photo: https://flic.kr/p/qFPZNd

Presentations

The Negative Downstream Consequences of Convergent Problem-Solving Session

A 2015 study uncovered that a well-defined problem-solving mindset can carry over to diminish performance on a subsequent ill-defined, creative task. This presentation is designed to explore this new knowledge and understand how it may relate to our efforts during Post Incident Analysis of Complex Systems.

Michael Hausenblas is a developer advocate for Go, OpenShift, and Kubernetes at Red Hat, where he helps app ops engineers build and operate distributed services. Michael shares his experience with distributed systems and large-scale data processing through demos, blog posts, and public speaking engagements and contributes to open source software such as OpenShift and Kubernetes. Previously, Michael was a developer advocate at Mesosphere, chief data engineer at MapR Technologies, and a research fellow at the National University of Ireland, Galway, where he researched large-scale data integration and the internet of things and gained experience in advocacy and standardization (World Wide Web Consortium, IETF). In his free time, Michael contributes to open source software (mainly using Go), blogs, and hangs out on Twitter too much.

Presentations

Building and operating containerized applications at scale (Day 1) Training

The advantages of containerized applications are increasingly recognized. Michael Hausenblas provides a gentle introduction to building and operating containerized applications at scale. The first day focuses on the basics of building app using containers; the second day expands this knowledge, focusing on the operations (monitoring, upgrades, etc.) of these apps.

Building and operating containerized applications at scale (Day 2) Training

The advantages of containerized applications are increasingly recognized. Michael Hausenblas provides a gentle introduction to building and operating containerized applications at scale. The first day focuses on the basics of building app using containers; the second day expands this knowledge, focusing on the operations (monitoring, upgrades, etc.) of these apps.

Office Hours with Michael Hausenblas Office Hours

Join Michael at Table 1 in the O'Reilly booth during the Wednesday morning break to discuss container operations, microservices, and DC/OS and Apache Mesos.

Chris Hawley is deputy program manager on contract managing the International Counterproliferation Program (ICP) of the Defense Threat Reduction Agency (DTRA), the US Department of Defense’s official combat support agency for countering the entire spectrum of chemical, biological, radiological, nuclear, and high-yield explosive threats globally.

Presentations

Incident management for DevOps Tutorial

The Incident Management System (IMS) has been battle tested by the American Fire Service for over 40 years across fires, rescues, hazardous materials incidents, and every other type of emergency. Rob Schnepp, Chris Hawley, and Ron Vidal explain how they adapted IMS for IT and offer an early look at content from Incident Management for IT Operations, their upcoming book from O’Reilly Media.

David Hayes is a full-time time data nerd and the director of platform strategy at PagerDuty, where he is scaling the most reliable way of waking up the IT world. Dave can be comfortably blamed for anything you hate about PagerDuty’s product, but he’d rather talk about integrating your product with PagerDuty, PagerDuty’s APIs, his JavaScript wrapper, or rock climbing and Mario Kart.

Presentations

Why DevOps is more than just automation Keynote

DevOps brings proven benefits by automating the deployment pipeline. David Hayes explores the benefits DevOps brings beyond automation, explaining the importance of shared operational responsibility on organizational culture, why reliability means both systems and people, and how aggregating alerts helps to maintain situational awareness.

Robert Haynes takes IT products and turns them into business solutions. First, Robert tests that things actually work. Then, he creates a story to help humans talk to other humans about how great the solution is. Occasionally, there are diagrams a preschooler would be proud of.

Presentations

TCP optimization: What you need to know for a faster application Session

TCP optimization leads to faster applications. Robert Haynes and Martin Duke offer essential knowledge and actionable advice that non-networking IT professionals can use to make their applications more responsive and improve their users' experience.

Jon C. Hodgson is an APM subject-matter expert for Riverbed Technology. For over a decade, Jon has helped hundreds of organizations around the world optimize the reliability and performance of their mission-critical applications. With a background in systems administration, networking, programming, and application architecture, Jon’s multidisciplinary approach to troubleshooting has enabled him to analyze and solve a wide variety of problems ranging from typical code/SQL inefficiencies to more complex issues related to resource saturation, network effects, and low-level OS/virtualization bottlenecks. When he’s not obsessing about how to make things perform faster, Jon enjoys digging things up with his tractor at his home in Missouri.

Presentations

Is your performance analysis approach as cutting edge as your application architecture? Session

To analyze and improve the performance of modern applications, you must abandon outdated approaches and toolsets which are rooted to the physical topology of servers and JVMs. Jon Hodgson discusses a new paradigm to reveal unexpected relationships and hotspots obscured by the elasticity of containers and microservices so that you can find and fix issues with the most overarching business impact.

Billy Hoffman is the director of product at Rigor, where he focuses on delivering best-in-class performance software. From hacking into sites as a penetration tester to optimizing performance for the Fortune 500, Billy has over a decade of deep technical experience into the inner workings of complex web applications. Billy was the lead researcher at the web security software company SPI Dynamics, where he developed and automated new techniques to exploit web vulnerabilities and spoke extensively at conferences such as Black Hat and RSA about his findings. Following SPI’s acquisition by Hewlett-Packard, Billy managed HP’s web security research group. Billy founded the web performance company Zoompf, where he developed the industry’s first site-wide performance analysis product that tells you the root cause of your performance issues. He has spoken at industry conferences such as Velocity and was a contributing author for O’Reilly’s Web Performance Daybook. Billy holds a BS in computer science from the Georgia Institute of Technology. Outside of work, Billy enjoys refinishing rooms in his house and spending time with his wife, Jill, and their two daughters.

Presentations

Policing ads and 3PC at scale on media sites Session

Ads are an especially tough problem for media sites due to the complexities of ad networks. Billy Hoffman spent 8 months defining a set of performance compliance guidelines at the Hearst media company for ads and 3PC encompassing everything from latency and size SLAs to policing malicious actors to IAB compliance. Billy reviews the guidelines and releases open source code to audit any media site.

The top 10 performance defects that are costing you millions Session

There are literally hundreds of issues that could be slowing down your pages. How do you figure out (and fix) the ones that are hurting your business most? Buddy Brewer and Billy Hoffman outline a step-by-step process that helps you answer this question.

Jez Humble is coauthor of the Jolt award-winning Continuous Delivery, published in Martin Fowler’s Signature series (Addison Wesley, 2010), and Lean Enterprise, in Eric Ries’s Lean series (O’Reilly, 2015). Jez has consulted for many Global 500 companies, helping them achieve technical excellence in software product development and deploy a culture of experimentation and learning. He works at 18F and teaches at UC Berkeley.

Presentations

DevOps: Continuous delivery, security, and lean management make your IT and business better Session

Research from the last few years has shown that DevOps can bring both throughput and stability to IT, driving value and profit for businesses. Nicole Forsgren, Jez Humble, and Gene Kim present the latest data on DevOps and explore the roles of security and compliance, continuous delivery, and management in driving world-class quality, efficiency, and value in innovative organizations.

Dean Hume is an author, Google Developer Expert, and all-around web performance geek. He regularly writes articles based on software development on his blog at Deanhume.com.

Presentations

Unleashing the power within: A hands-on guide to browser developer tools Tutorial

As any web developer knows, the developer tools built into modern browsers are packed with loads of features. The question is, do you really understand how or when to use them? These tools are capable of so much more than just debugging and inspecting elements in the DOM. Dean Hume teaches you exactly how to use the tools to become a better developer, one web page at a time.

Karl Isenberg is a distributed systems architect at Mesosphere working on DC/OS (the Datacenter Operating System). Prior to Mesosphere, Karl worked on CloudFoundry and BOSH at Pivotal. Karl’s current side projects include Probe (a service-ready check), Inject (a Golang dependency injection library), and Mesos Compose Docker-in-Docker. Karl is, as of this writing, the only person to have been a committer on CloudFoundry, Kubernetes, and DC/OS, so he is uniquely qualified to address the container platform market, cloud-native frameworks, lifecycle management strategies, and deployment tools in general. Karl’s publications include Obfuscation, an irregularly updated tech blog, and a more active stream of technology-related tweets.

Presentations

Container orchestration wars Session

The orchestration space is fast moving and full of competing products, platforms, and frameworks. How do you choose the right one for your requirements? Karl Isenberg explores the features of several container orchestrators, breaking down the feature sets and characteristics into categories and scoring multiple solutions—including Kubernetes, Marathon, and Docker Swarm—against each other.

C.J. Jameson is a software engineer with Pivotal Labs. He has worked internally across Pivotal’s Cloud Foundry and big data platforms.

Presentations

Better bash: Unit and integration testing Session

C.J. Jameson explains how to test all the things, even your bash scripts. Using the BATS framework, it's cheap to write both high- and low-level tests to drive out more modular and readable bash scripts.

Nic Jansma is a software developer building high-performance websites, apps, and open source tools at SOASTA, where he works on mPulse and boomerang. He is also a Microsoft MVP for IE.

Presentations

Measuring continuity Session

Whenever we speak of measuring web performance, we always refer to measuring static events, like page load or time to first tweet. A performant user experience is much more than that. Philip Tellis and Nic Jansma explore methods of measuring web performance as it relates to continuous interactions between the user and a page.

Tim Kadlec is the head of developer relations at Snyk, a company focused on making open source code more secure. He is the author of Implementing Responsive Design: Building Sites for an Anywhere, Everywhere Web (New Riders) and was a contributing author for High Performance Images (O’Reilly), Smashing Book #4: New Perspectives on Web Design, and Web Performance Daybook Volume 2 (O’Reilly). He writes sporadically at Timkadlec.com.

Presentations

Debugging frontend performance (Day 1) Training

Tim Kadlec and Patrick Meenan explain how the construction of websites and applications impacts performance as well as how to quickly debug and resolve performance issues. Tim and Patrick dive into how browsers work, how web pages are delivered, backend and frontend issues, optimizations, and techniques to get the best performance and provide hands-on experience for working on web performance.

Debugging frontend performance (Day 2) Training

Tim Kadlec and Patrick Meenan explain how the construction of websites and applications impacts performance as well as how to quickly debug and resolve performance issues. Tim and Patrick dive into how browsers work, how web pages are delivered, backend and frontend issues, optimizations, and techniques to get the best performance and provide hands-on experience for working on web performance.

Gene Kim is a multiple award-winning CTO, researcher, and author. Gene was founder and CTO of Tripwire for 13 years and has researched high-performing IT organizations for almost as long. He has written four books, including The Visible Ops Handbook, The Phoenix Project: A Novel About IT, DevOps, and Helping Your Business Win, and the forthcoming DevOps Cookbook.

Presentations

DevOps: Continuous delivery, security, and lean management make your IT and business better Session

Research from the last few years has shown that DevOps can bring both throughput and stability to IT, driving value and profit for businesses. Nicole Forsgren, Jez Humble, and Gene Kim present the latest data on DevOps and explore the roles of security and compliance, continuous delivery, and management in driving world-class quality, efficiency, and value in innovative organizations.

Courtney Kissler is vice president of retail technology for the Starbucks Technology team, where she is responsible for the teams leading the development and enhancements of point of sale (POS) systems and other associated retail technologies. Previously, Courtney spent 14 years at Nordstrom, where she most recently held the role of vice president for ecommerce and store technologies. In that role, she had accountability for program management, delivery, and support for all customer-facing technologies including in-store, web, and mobile touch points. During her tenure at Nordstrom, Courtney held a variety of leadership positions in both the engineering and infrastructure disciplines and was at the leading edge in bringing DevOps capability to the Nordstrom technology organization. Courtney started her career at Nordstrom as a security engineer and prior to this gained experience with startups CyberSafe and WorldStream Communications.

Presentations

Employee burnout and its often overlooked impact on productivity Session

Employee burnout is an overlooked anchor dragging down productivity and employee engagement in our industry. Courtney Kissler highlights some tactics and metrics to help leaders proactively address this issue.

Bridget Kromhout is a Principal Cloud Developer Advocate at Microsoft. Her CS degree emphasis was in theory, but she now deals with the concrete (if ‘cloud’ can be considered tangible). After 15 years as an operations engineer, she traded being on call for being on a plane. A frequent speaker and program committee member for tech conferences, she leads the devopsdays organization globally and the devops community at home in Minneapolis. She podcasts with Arrested DevOps, blogs at bridgetkromhout.com, and is active in a Twitterverse near you.

Presentations

Containers will not fix your broken culture, and other hard truths Keynote

Bridget Kromhout explains why containers will not fix your broken culture. Microservices won’t prevent your two-pizza teams from needing to have conversations with one another over that pizza. No amount of industrial-strength job scheduling makes your organization immune to Conway’s law.

Dan Kuebrich is CTO of AppNeta, where he is responsible for guiding the company’s long-term technology and product vision as well as directly managing forward-looking research. Prior to his role as CTO, Dan served as product manager for AppNeta, focused on TraceView, which he cofounded as Tracelytics (acquired by AppNeta in 2012). Previously, Dan was an engineer at AmieStreet/Songza.com and Sun Microsystems. Dan holds a bachelor of science in computer science from Brown University.

Presentations

Monitoring to the nth tier: The state of distributed tracing in 2016 Session

Distributed applications have advantages in terms of scale and reliability, but they also complicate or invalidate a lot of guarantees and tooling enjoyed in a monolithic world. Dan Kuebrich introduces distributed tracing as a visibility and performance tool, walking attendees through the implementation of a tracing system and surveying prior art in distributed tracing from academia to industry.

Office Hours with Dan Kuebrich Office Hours

Join Dan at Table 1 in the O'Reilly booth during the Thursday afternoon break to discuss server-side application performance, distributed tracing, and the OpenTracing project.

Karan Kumar is a software engineer at Instart Logic. Previously, Karan developed and tested big data platforms at Couchbase and Aster Data. He holds an MS in information networking from Carnegie Mellon and a BS in computer engineering from NITK, Surathkal, India. When not speeding up the Web, he likes to spend time playing cricket (as a fast bowler) and riding bikes with his partner and rambunctious 15-month-old daughter.

Presentations

Measuring ad blocker impact on site performance Session

Users' ad blockers are impacting your site's perceived performance, but measuring the impact of ad blockers on actual and perceived performance can be difficult. Karan Kumar offers an overview of new testing he has created that measures the overall impact ad blockers have on the quality of user experience and performance across a number of sites.

Office Hours with Karan Kumar Office Hours

Join Karan at Table 3 in the O'Reilly booth during the Thursday morning break to discuss ad blockers and how they work and multivariate web performance testing and tools.

Aneel Lakhani drinks a lot of coffee. He’s been doing technology full time since high school—from startups to consulting to teaching to big tech companies to analyst-ing and back to startups. Aneel does marketing at analytics-based monitoring startup SignalFx and was recently a research director at Gartner covering cloud operations.

Presentations

OODA, maps, and antifragility: A case study in pragmatism Session

OODA isn’t just for DevOps. Mapping isn’t just for strategists. Antifragility isn’t just for Netflix. Once we get beyond cargo-culting ideas, we can begin to understand how they arose and in what context. But how do we apply them? Aneel Lakhani explores the practical application of these ideas through examples from his daily work.

Diego Lapiduz is leading an effort to change how the government deploys software at 18F. He, and the cloud.gov team, built a platform to allow faster and easier government compliance without sacrificing developer experience. Earlier in his career, Diego worked on projects ranging from ecommerce to social networks. You can find Diego rambling about software architecture, DevOps, and other unrelated topics.

Presentations

Building an open platform for government innovation Keynote

There's a software culture revolution inside the government. User-centric design, lean, and agile are first citizens, but the increased velocity in development and testing requires a change in the way the government does deployment, security, and compliance. Diego Lapiduz shows how the cloud.gov team is building tools to achieve faster deployments and continuous compliance in a secure environment.

Ben Lavender works on development tools at GitHub. He lives in New Orleans, where he enjoys helping run a popular local meetup, Tuesday Hack Night

Presentations

ChatOps in 2016 Session

By now, you've probably have heard of ChatOps (especially if you're in operations). GitHub has been using ChatOps for more than five years and continues to scale these practices. Ben Lavender explains the guidelines that GitHub has created to work with ChatOps and the lessons learned in the process.

Elizabeth Lawler is CEO and cofounder of Conjur, Inc., provider of security orchestration software. Elizabeth has over 20 years of experience working in highly regulated and sensitive data environments. Prior to founding Conjur, Elizabeth was chief data officer of Generation Health and held a leadership position in research at the Department of Veterans Affairs. She has been a programmer herself and is constantly working to make software development and IT systems easier to manage for people working in regulated industries. Follow her on Twitter at @ElizabethLawler.

Presentations

DevOps, security, and compliance: Working in unison Session

DevOps teams help high-performing organizations deploy code 30x more often with 50% fewer failures, boosting productivity, profitability, and market share. However, many DevOps teams lack information security best practices. Elizabeth Lawler demonstrates how to bring DevOps, security, and compliance teams together, bridging this gap and forging a secure infrastructure for your company.

Bruce Lawson evangelizes Open Web standards as Deputy CTO for Opera Software. Bruce was a member of the Web Standards Project and cowrote Introducing HTML5. Previously, he was frontend lead for the Solicitors Regulation Authority in the UK, a tarot card reader in Istanbul, a volunteer pharmacist in Calcutta, tutor to a Thai princess, and a movie actor in Mumbai. You can follow him on Twitter.

Presentations

Making bad ads sad. Rad! Keynote

Ads are annoying and intrusive and can compromise privacy, but, worst of all, they're disastrous for website performance. Bruce Lawson outlines the performance gains Opera has made by deploying a native ad blocker in its flagship browsers, explains how Opera did it, and explores how the whole advertising ecosystem can (hopefully) improve.

Philippe Le Hegaret heads the W3C Interaction Domain, which produces
frontend web technologies including HTML5, CSS3, SVG, WOFF, and web APIs. Philippe is currently a participant in the Web Performance Working Group and organized the W3C workshop on video on the Web in 2007. Until July 2008, Philippe led the W3C Architecture Domain, which produced W3C core technologies in the area of XML, web services, and internationalization. He is a former chair of the Document Object Model (DOM) Working Group and coedited some of the DOM specifications.

Presentations

The state of the W3C Web Performance Working Group Keynote

Todd Reifsteck and Philippe Le Hegaret discuss the work the W3C Web Performance Working Group is doing, as well as performance-related efforts by other groups, so that you can be up to date with the latest developments and what's coming next. They also explain how easy it is to get involved, provide feedback, and influence the direction that these standards will take.

Pete LePage is a developer advocate at Google working on the Chrome Web Store team and the Open Web platform. Pete has been designing websites since his early days in high school, evolving from overlapping blink, marquee, and font tags on GeoCities to properly styled CSS and managed hosted websites. Pete works with the web developer community to build awesome new web applications using Open Web technologies like HTML5 and CSS3. He can often be found traveling the world with his friends, seeing both exotic and ordinary sites, or in a black and white dark room printing something he shot with a film camera. He has studied and taught at the prestigious Photographic Center Northwest in Seattle, where he completed his thesis in fine art photography.

Presentations

The fundamentals of building progressive web apps Tutorial

Pete LePage explores the fundamentals of progressive web apps, covering how to architect a single-page web app using the App Shell model, how to identify the different service worker caching strategies and choose the most appropriate one for a use case, and how to implement an installable web app using manifests, metatags, and other techniques.

Iris Lieuw is a data scientist at Akamai. In her daily work, Iris develops for the Data Science Workbench, which helps clients analyze their site’s perf data and create meaningful visualizations they can extract insights from. In her free time, she likes to rock climb, cook, knit, and calligraph.

Presentations

Prioritizing site performance improvements Session

Michelle Carrizosa and Iris Lieuw demonstrate how to prioritize improvements across your ecommerce site by identifying which of your pages are the most important to optimize and then looking at resource timing data to determine what affects those pages the most.

Bryan Liles is a principal engineer on the cloud engineering team at Capital One. When not helping a huge back move to the public cloud, he gets to speak at conferences on topics ranging from machine learning to building the next generation of developers. In his free time, Bryan races cars in straight lines and around turns and builds robots and devices.

Presentations

Under the covers: Capturing application errors, logs, and metrics Tutorial

Writing an application is only half the battle. Once an application goes to production, insight is required to determine how it's performing. Bryan Liles walks attendees through scenarios that will allow them to view application metrics and logs and capture errors for further analysis.

Justin Lintz is the director of web operations at Spring, where he is in charge of Spring’s growing AWS infrastructure. Previously he worked in operations roles at Chartbeat, Bitly, and TheStreet. Justin earned his degree in computer science with applied mathematics at University at Albany, SUNY. Justin resides in New York with his Amazon Echo, Alexa.

Presentations

Dealing with anxiety in operations Session

Justin Lintz defines some of the stresses operations people face, outlines methods for mitigating them, and discusses his personal experience of having an anxiety disorder while working in an operations role, raising awareness about the anxiety issues many people face but are afraid to talk about.

Stephen Ludin is a chief architect for Akamai’s Web Experience group. He currently heads the company’s Foundry team—a small group dedicated to innovating on the edge of technology. Stephen’s primary focus has been on projects related to the core proxy technology that is responsible for routing, accelerating, and securing Akamai’s traffic. Stephen received his degree from the University of California at San Diego in computer music, where his coding abilities were used in writing C programs to create experimental music. We are all grateful for his decision to use his technical and management skill in the more aesthetically pleasing field of making the Web a faster and safer realm for commerce and communications.

He currently serves on the board of the Internet Security Research Group, the parent organization for Let’s Encrypt.

Presentations

Measuring what matters Keynote

What good are detailed web page timings if you are not measuring the right things? Stephen Ludin offers an overview of the User Timing API, exploring adoption rates, current levels of support, and a path toward universal adoption and usage.

Ritesh Maheshwari is a performance engineer at LinkedIn, working on making LinkedIn fast using his medley of skills in data and performance analysis, network optimizations, and automation. Before LinkedIn, Ritesh was a performance engineer at Akamai, doing something similar. Ritesh holds a PhD in computer science from Stony Brook University, where he first became passionate about performance while working on computer networks. He is also an alumni of IIT Kharagpur.

Presentations

Robust anomaly detection for real user monitoring data Session

For the past year, LinkedIn has been running and iteratively improving Luminol, its anomaly detection system for real user monitoring data. Ritesh Maheshwari and Yang Yang offer an overview of Luminol, focusing on how to build a low-cost end-to-end system that can leverage any algorithm, and explain lessons learned and best practices that will be useful to any engineering or operations team.

Charity Majors is the cofounder and CTO of Honeycomb, a new startup focused on mining machine data. Previously, Charity ran infrastructure at Parse and was an engineering manager at Facebook. She also worked with the RocksDB team to build and deploy the world’s first Mongo + Rocks in production. Charity likes single malt scotch.

Presentations

A young lady's illustrated primer to technical decision making Keynote

Charity Majors discusses making better choices with software. Whether you're selecting a new polyglot persistence layer, launching a startup from scratch, or modernizing a mature environment, there have never been more opportunities for chaos. Charity explains when you should use boring technology, when to take a flyer on the bleeding edge, and best practices for making solid technical decisions.

Maslow's hierarchy of needs for databases Session

Charity Majors outlines DevOps/DBA best practices from the earliest seed stages (survival, selecting the right storage layer, etc.) to what you should expect from a mature, self-actualized database tier. Along the way, Charity explores how to ensure that your database is a first-class citizen of your engineering and operational processes and how your observability requirements may evolve.

Pete Mastin has 25 years of experience in business and product strategy and software development. Pete has expert knowledge of content delivery networks (CDN), IP services, and Internet and cloud technologies, and he has overseen the implementation of highly scalable, multihomed, global SaaS systems. Pete is a frequent speaker at conferences such as NAB (National Association of Broadcasters), Streaming Media, the CDN/Cloud World Conference Hong Kong, the Content Delivery Summit, Digital Hollywood, and Interop, among others.

Presentations

Understanding the performance penalty: HTTP vs. HTTPS Session

Pete Mastin and Sam Richardson discuss the performance penalty levied by SSL. As more sites move to 100% HTTPS, this topic deserves more visibility. Pete and Sam use real-world measurements (RUM) to analyze HTTP versus HTTPS in different world regions and across wireline, rural wireline, and wireless ISPs and explores strategies to offset the performance hit.

Ashray Mathur is a Senior Engineering Manager at Salesforce working on Salesforce Lightning Experience. Ashray has been a key enabler, multiplier and developer advocate for optimizing performance of Salesforce applications – working with many engineers across Salesforce to make the Lightning Experience faster and driving adoption of performance best practices.

Presentations

Instrumenting, measuring, and monitoring the Salesforce Lightning Experience Session

Salesforce executes 4 billion+ transactions daily while keeping the average transaction speed under 300 ms and page speed under 2 seconds. As Salesforce's cloud services grow in complexity and scale, measuring and monitoring their performance becomes increasingly hard. Ashray Mathur and Diego Ferreiro Val discuss how Salesforce continues to improve and the lessons it has learned along the way.

Ben Maurer is the tech lead of the Web Foundation team at Facebook, which is responsible for the overall performance and reliability of Facebook’s user-facing products. Ben joined Facebook as a member of the infrastructure team. Before Facebook, he cofounded reCAPTCHA with Luis von Ahn. Recently, Ben worked with the US Digital Service to improve the use of technology within the federal government.

Presentations

Facebook web performance: Using data from the field Session

You've measured the performance of your site using developer tools on your computer. But what do real users see when they go to your site? Ben Maurer discusses how Facebook pieces together data from multiple sources—load balancers, CDNs, HHVM, resource timing, navigation timing, etc.—and uses it to make a fast site.

Alois Mayr is a technology lead for cloud and virtualization in the Dynatrace Innovation Lab. Alois is passionate about performance, scaling things, and software quality. Before joining Dynatrace, Alois was a researcher for an organization focused on software quality measurements and evaluation. He is also a speaker at conferences and performance blogger.

Presentations

Running microservice environments is no free lunch Session

Migrating toward microservices tends to result in a 20x larger environment than monolithic counterparts. While the bright side of microservices and their enabling container platforms is high availability and scalability, what about the dark side—the side that nobody talks about in their presentations. Alois Mayr and Alexander Ramos uncover the truth so you don’t have to learn it the hard way.

Caitie McCaffrey is a backend brat and distributed systems diva at Twitter. Previously, Caitie spent the majority of her career at 343 Industries, Microsoft Game Studios, and HBO building the large-scale services and systems that power the entertainment industry. Caitie has a degree in computer science from Cornell University and has worked on several video games including Gears of War 2, Gears of War 3, Halo 4, and Halo 5. She maintains a blog at CaitieM.com and frequently discusses technology on Twitter.

Presentations

So we hear you like papers Keynote

Surprisingly enough, academic papers can be interesting and very relevant to the work we do in industry as practitioners. Ines Sombra and Caitie McCaffrey demonstrate how academic papers can radically change your perspective and introduce you to new ideas, offering a tour of papers that have reshaped the way they think about building large-scale distributed systems.

Patrick Meenan has been working on web performance in one form or another for the last 15 years and is currently working at Google to make Chrome and the web faster. Patrick created the popular open source WebPagetest web performance measurement tool, runs the free instance of it at WebPagetest.org, and can frequently be found in the forums helping site owners understand and improve their website performance.

Presentations

Debugging frontend performance (Day 1) Training

Tim Kadlec and Patrick Meenan explain how the construction of websites and applications impacts performance as well as how to quickly debug and resolve performance issues. Tim and Patrick dive into how browsers work, how web pages are delivered, backend and frontend issues, optimizations, and techniques to get the best performance and provide hands-on experience for working on web performance.

Debugging frontend performance (Day 2) Training

Tim Kadlec and Patrick Meenan explain how the construction of websites and applications impacts performance as well as how to quickly debug and resolve performance issues. Tim and Patrick dive into how browsers work, how web pages are delivered, backend and frontend issues, optimizations, and techniques to get the best performance and provide hands-on experience for working on web performance.

Scaling frontend performance Session

Patrick Meenan outlines techniques for serving rich experiences to users on fast connections while still offering a fast experience for users on slow connections and addresses some eye-opening issues and solutions for serving content on mobile connections.

Using machine learning to determine drivers of bounce and conversion Session

Google partnered with SOASTA to train a machine-learning model on a large sample of real-world performance, conversion, and bounce data. Patrick Meenan and Tammy Everts offer an overview of the resulting model—able to predict the impact of performance work and other site metrics on conversion and bounce rates.

Donny Nadolny is a Scala developer at PagerDuty working on improving the reliability of its backend systems. Donny spends a large amount of time investigating problems experienced with distributed systems like Cassandra and ZooKeeper.

Presentations

Debugging distributed systems Session

Distributed systems are hard. They are complicated, hard to understand, and very challenging to manage. But they are critical to modern software, and when they have problems, we need to fix them. Donny Nadolny looks at what it takes to debug a problem in a distributed system like ZooKeeper, walking attendees through the process of finding and fixing one cause of many of these failures.

Emily Nakashima is a full stack JS engineer who loves design, web perf, and metrics. Emily is a co-organizer of the AndConf code retreat and unconference and a volunteer for RailsBridge.

Presentations

RUM beyond performance Session

We talk about RUM a lot at Velocity, but we usually focus on performance metrics. What else we can learn by monitoring the browser? Emily Nakashima explores some of the other aspects of RUM monitoring, from exceptions and security issues to accessibility and design problems, and outlines a journey from "zero to visibility" on the new React dashboard, pointing out all the pitfalls along the way.

Sangeeta Narayanan leads the Edge Developer Experience team at Netflix, which focuses on creating solutions that increase development velocity and provide operational insight into system health and behavior. Sangeeta has held various roles in her career in fields such as test engineering, sales engineering, and engineering management. Throughout all those experiences, the common theme has been her passion for simplifying the process of developing and operating software.

Presentations

Operational visibility on a global scale Session

Netflix is now a global service. Ensuring that subscribers enjoy the best possible streaming experience at all times requires operational insights at various levels. Sangeeta Narayanan discusses the strategies, infrastructure, and tools that give Netflix the requisite insights for operating the Netflix Edge services at scale for a global audience.

Sarguru Nathan is a site reliability engineer who works on large-scale infrastructure automation for Yelp. He’s an avid open source contributor and lives in London.

Presentations

Future-proofing application delivery at Yelp: Building and tuning traffic management for large web-scale applications Session

At Yelp, application delivery means servicing a global audience from a highly dynamic infrastructure and meeting stringent performance and reliability requirements. Kris Beevers and Sarguru Nathan detail how they implemented automated, dynamic, and intelligent traffic management and DNS routing to ensure Yelp’s global traffic is serviced by the best data center.

Sam Newman is an independent consultant specializing in helping people ship software fast. Sam has worked extensively with the cloud, continuous delivery, and microservices and is especially preoccupied with understanding how to more easily deploy working software into production. For the last few years, he has been exploring the capabilities of microservice architectures. He has worked with a variety of companies in multiple domains around the world, often with one foot in the developer world and another in the IT operations space. Previously, he spent over a decade at ThoughtWorks and then another year with a startup. Sam speaks frequently at conferences. He is the author of Building Microservices (O’Reilly). If you would like to get in touch, please email him.

Presentations

AppSec and microservices Tutorial

Microservices offer many options for securing your systems. Done right, microservices can increase the security of your vital data and processes. Done wrong, and they can increase the surface area of attack. Sam Newman explores the importance of defense in depth, discussing the many different ways in which you can secure your fine-grained, distributed architectures.

Elizabeth (Betsy) Nichols is chief data scientist at Netuitive, Inc., where she is responsible for leading the company’s vision and technologies for data science, analytics, modeling, and algorithms. Over her career, Betsy has applied mathematics and computer technologies to create systems for war gaming, space craft mission optimization, industrial process control, supply chain logistics, digital advertising, IT security/risk models, application performance management, and DevOps in academia, the federal government, private enterprise, and several startups. She has cofounded three companies, all of which delivered analytics to commercial and government enterprises.

The first company that Betsy cofounded was Digital Analysis Corporation (acquired by Legent in 1994). A year later, Legent was acquired by Computer Associates (CA). As a result, Betsy became one of two principal architects for Unicenter TNG reporting to CA’s CTO. The second company that Betsy cofounded was Clear Point Metrics (acquired by nCircle in 2011), which offered the first platform for IT security metrics. PlexLogic, Betsy’s most recent venture, offered analytics consulting services.

As one of the founding members of the Java JMX Expert Group, Betsy contributed to the several early releases of the Java Management Extension (JMX). She has also been active in numerous industry efforts to define and standardize analytics for measuring IT security, risk, and compliance, contributed to three books on security metrics, coauthored five books on topics in computer hardware, software, and digital communications, and is lead inventor on patents that define algorithms and methods for heterogeneous, scalable data integration for analytics. Betsy has served on technical advisory boards for several companies including BladeLogic (acquired by BMC). Betsy holds an AB from Vassar College and a PhD in mathematics from Duke University.

Presentations

Effectively adding analytics to your monitoring Session

Effective monitoring for today’s agile environments is both science and art. (Analytics can provide the “science” while experts and business context can provide the “art.”) There is no perfect solution, but a framework for integrating these varied information sources as collaborators can drive continuous improvement. Elizabeth Nichols highlights (anonymized) examples from real environments.

Office Hours with Elizabeth Nichols Office Hours

Join Elizabeth at Table 2 in the O'Reilly booth during the Thursday afternoon break to discuss her experiences—both good and bad—applying analytics to monitoring, possible offline analytics that would be useful, and specific models for cost optimization.

Over his 19-year career, Dave Nielsen has been a web developer, systems architect, technical trainer, developer evangelist, and startup entrepreneur. Today, Dave is a Technical Program Manager at Intel for the BigDL Deep Learning Framework as well as the cofounder of CloudCamp, a series of unconferences about cloud computing. Dave resides in Mountain View with his wife Erika, to whom he proposed in his coauthored book PayPal Hacks.

Presentations

Deploying Redis using Kubernetes vs. Swarm vs. Amazon ECS vs. Cloud Foundry Tutorial

Redis is a critical part of scalable web architecture, whether for consumer-facing apps or as an operational database for microservices, and Redis is the most common NoSQL database deployed in containers. Dave Nielsen walks attendees through Redis and compares the steps to deploy Redis in various container frameworks, including Kubernetes, Docker Swarm, Amazon ECS, and Cloud Foundry.

At Flynn, Alex Nobert is working on the next-generation PaaS of your dreams. Previously, he built and led Ops at Shopify, Vox Media, and Minted. For some reason, he enjoys wearing a tie. Alex has a side gig as a personal assistant to Pupperz and Trudy, the two dogs with whom he shares a house.

Presentations

So you want to be an Ops manager Session

When you ask Ops engineers where they want to go in their careers, the only answer you get after "I don't know" is "management." But what does that entail, and how do you get there? Alex Nobert discusses his career transition from engineer to manager to director, describing the day-to-day work, expectations, priorities, and goals of each so that you can learn from his mistakes.

Jason Ormand is a performance engineer at Vox Media and the creator of Justice.js.

Presentations

The wild west of media performance: A Vox story Session

A little over a year ago, Vox Media created a dedicated performance team, which immediately set out to make all Vox Media sites as fast as possible—and has since made significant progress. Ian Carrico and Jason Ormand discuss what the team has done, how it did it, and what it's still working on.

Tapabrata Pal has 20 years of IT experience in various technology roles (developer, operations engineer, and architect) in the retail, healthcare, and finance industries. Over the last three years, Tapabrata has served as director of Capital One’s Enterprise Architecture group and led the company’s DevOpsSec initiatives. He is currently Senior Director and Senior Engineering Fellow focused on DevOps, and Continuous Delivery at large scale in a regulated environment. Previously, Tapabrata spent some time in academics doing doctoral and postdoctoral research in the field of solid state physics.

Presentations

Part of the pipeline: Why continuous testing is essential Session

Adam Auerbach and Tapabrata Pal discuss Captial One's transformation to continuous testing, covering core principles, tools, and best practices as well as common roadblocks and some recommendations on how best to remove them from the environment.

Ray Panahon currently handles technical operations at Riot Games for multiple initiatives. He has led the build out and expansion in Latin America and E-sports, as well as introducing scalable global multi-CDN architectures for player-facing products.

Presentations

Leveraging multi-CDN at Riot Games Session

Kristopher Beevers and Ray Panahon offer an overview of using a multi-CDN environment with DNS and traffic management to build, tune, and deliver an exceptional user experience.

Mike Panchenko joined New Relic along with the rest of the Opsmatic, a company he started after spending time building, breaking, and evolving various systems at Urban Airship, SimpleGeo, and Flickr. When not wrasslin’ with computers, Mike is likely surfing, exploring new places with his special lady friend, enjoying the company of friends, watching sports, reading, or playing guitar poorly, but his real passion is growing beards.

Presentations

Monitoring containers at New Relic, by New Relic Session

New Relic went all-in on on Docker early and has continued to stay on the forefront of the container ecosystem, both as a user of the technology and as a monitoring vendor. Mike Panchenko explores how New Relic's teams utilize Docker in a variety of ways using a mix of home-grown and external OSS frameworks.

Kassandra Perch is a developer, crafter, gamer, and cat mom living in Austin, TX. Kassandra spends her days as a developer evangelist at Auth0 and her nights building NodeBots, crafting in her lab, or playing video games. Kassandra is an unrepentant JS addict on servers, in browsers, and on hardware.

Presentations

Sandboxing user code using Node.js, CoreOS, and Docker Session

Kassandra Perch demonstrates how to use Node.js sandboxes to run custom user code safely and securely using technologies like CoreOS and Docker.

Dieter Plaetinck is a principal engineer building out 100% open source and open SaaS monitoring at raintank.

Presentations

Next-generation alerting and fault detection Session

Alerting on your stack is the key to happy customers and a healthy business. Dieter Plaetinck explains what's wrong with the oft-touted complicated alerting methods and explores how to get the in-depth coverage and address complicated alerting needs using simple techniques, with a focus on the workflow using an alerting IDE.

Andrey Pokhilko is a renowned thought leader and innovator within the Apache JMeter performance testing community. Andrey has worked extensively to create and develop tools that enhance the Apache JMeter protocol coverage and analytical reporting capabilities. He is the founder of JMeter-Plugins.org, which he led to its undisputed status as the premier source of JMeter plugins in the world. Loadosophia was similarly born from Andrey’s desire to complete JMeter’s offerings. He has world-class experience with large-scale performance testing, having led the load testing team for five years at search engine giant Yandex.

Presentations

Take your load testing up a notch with open source tools Session

Andrey Pokhilko offers an overview of Taurus, a simple-to-learn open source command-line tool that enables you to utilize tried and tested tools for generating load.

Aditya Punjani is a Software Engineer (Web) at Facebook, where he works on building high-performance experiences. Previously Aditya was the Tech Lead of Flipkart Lite, the world’s first major Progressive Web App. Aditya likes to push the Web forward and is passionate about JavaScript, network and rendering performance, developer tooling, browser internals, and user experience.

Presentations

Building high-performance progressive web apps Session

Progressive web apps are a new class of web apps that provide an immersive full-screen experience right from the home screen on mobile phones, just like native apps. But how do they compete with the performance characteristics of native apps? Aditya Punjani explains how Flipkart built a high-performance PWA that runs smoothly on low-end devices and works seamlessly even on 2G or offline.

Office Hours with Aditya Punjani Office Hours

Join Aditya at Table 3 in the O'Reilly booth during the Wednesday afternoon break to discuss performance best practices for mobile web apps, using service workers at scale, and progressive web apps and the future of the mobile Web.

Alexander Ramos has been designing and implementing distributed systems for more than 10 years, with a focus on Java backend applications and Linux administration. Alexander is currently responsible for designing and running the infrastructure of B2W Digital, Latin America’s biggest ecommerce company.

Presentations

Running microservice environments is no free lunch Session

Migrating toward microservices tends to result in a 20x larger environment than monolithic counterparts. While the bright side of microservices and their enabling container platforms is high availability and scalability, what about the dark side—the side that nobody talks about in their presentations. Alois Mayr and Alexander Ramos uncover the truth so you don’t have to learn it the hard way.

J. Paul Reed is the founder of Release Engineering Approaches, a consultancy incorporating a host of tools and techniques to help organizations “Simply Ship. Every time.” Paul has worked across a number of industries, from financial services to cloud-based infrastructure, with teams from 2 to 2,000 on everything from tooling, operational analysis and improvement, and team culture transformation to business value optimization. He is also the chief delivery officer and a visiting scientist at Praxisflow.

Presentations

Office Hours with Kevina Finn-Braun and J. Paul Reed Office Hours

Join Kevina and Paul at Table 2 in the O'Reilly booth during the Thursday morning break to discuss deploying a "blameless" culture and extending the Dreyfus model.

The blameless cloud: Experiments in actionable retrospectives at Salesforce Session

Are "blameless" postmortems real? Sure, small companies here and there talk about it, but what about large enterprises? J. Paul Reed and Kevina Finn-Braun discuss the hurdles, lessons, and surprises of rolling out actionable retrospectives in a large, complex organization, including what works, what doesn't, and techniques that, well. . .the jury is still out on.

Todd Reifsteck is a senior program manager specializing in performance and fundamentals for Microsoft Edge. Todd previously integrated notifications into Azure Mobile Services, helped scale the Azure Notification Hub and Service Bus Relay, and worked on fundamentals for the .Net Framework.

Presentations

The state of the W3C Web Performance Working Group Keynote

Todd Reifsteck and Philippe Le Hegaret discuss the work the W3C Web Performance Working Group is doing, as well as performance-related efforts by other groups, so that you can be up to date with the latest developments and what's coming next. They also explain how easy it is to get involved, provide feedback, and influence the direction that these standards will take.

Sam Richardson has over 15 years in Internet service delivery experience. Sam is a service performance and capacity specialist at Microsoft, utilizing CDN services and running all forms of load balancing. He is currently responsible for operations and the distribution of Microsoft content via channels such as Windows Update, Windows Store, and Xbox.

Presentations

Understanding the performance penalty: HTTP vs. HTTPS Session

Pete Mastin and Sam Richardson discuss the performance penalty levied by SSL. As more sites move to 100% HTTPS, this topic deserves more visibility. Pete and Sam use real-world measurements (RUM) to analyze HTTP versus HTTPS in different world regions and across wireline, rural wireline, and wireless ISPs and explores strategies to offset the performance hit.

Gareth Rushgrove is a senior software engineer at Puppet Labs, where he builds interesting tools for people to better manage infrastructure. Previously, he worked for the UK Government Digital Service, focusing on infrastructure, operations, and information security. When not working, he can be found writing the DevOps Weekly newsletter or hacking on software in newfangled programming languages.

Presentations

The two sides of Google infrastructure for everyone else Session

The meme "Google infrastructure for everyone else" has emerged alongside Kubernetes and other recent releases by Google. Gareth Rushgrove explores whether Google's approach to infrastructure will be how all organizations run software in the future and why that's an interesting question to ask for anyone running systems today.

Eleanor Saitta is a security consultant specializing in architectural security for large-scale systems, integrating security into the development life-cycle, and cross-domain security for news organizations and NGOs targeted by nation states. Eleanor is a cofounder and developer for Trike, an open source threat-modeling methodology and tool, contributes to the Briar and Mailpile secure messaging projects, and is a frequent speaker.

Presentations

You got your design team in my security team! Keynote

What if the answer to managing security issues starts with the product and design teams? Thinking about security design can drive everything from business decisions through operations, but it means rethinking what security is and building different kinds of relationships between teams. It's a long journey, but Eleanor Saitta outlines three steps to a safer future.

Rob Schnepp is a 30-year veteran of the fire service and retired as the division chief of special operations for the Alameda County, CA, Fire Department. Rob has vast experience in emergency response and served as incident commander on numerous large-scale emergencies. Rob has written two hazardous materials response textbooks and numerous peer-reviewed fire-service-related articles on incident command. He is an instructor at the National Fire Academy and for the US Defense Threat Reduction Agency, providing hazmat/WMD training to an international audience. Rob is a principal in Blackrock 3 Partners, a firm specializing in consulting, training, and war-gaming in the areas of incident management and command.

Presentations

Incident management for DevOps Tutorial

The Incident Management System (IMS) has been battle tested by the American Fire Service for over 40 years across fires, rescues, hazardous materials incidents, and every other type of emergency. Rob Schnepp, Chris Hawley, and Ron Vidal explain how they adapted IMS for IT and offer an early look at content from Incident Management for IT Operations, their upcoming book from O’Reilly Media.

David Shackelford is an empathy-driven senior product manager at PagerDuty who loves solving challenging customer problems. With a background in education and game theory, David nerds out on the intersection of complex technical systems and even more complex human behavior.

Presentations

Lessons from the dark side: DevOps and product management Session

Product management is not often part of the DevOps conversation, but collaboration between product, development, and operations is critical to shipping great software. David Shackelford explains why performance and reliability are just part of shipping customer value, how managing complex systems applies to product management, and opportunities for both practices to learn from each other.

Dan Slimmon is in charge of Ops for the Minneapolis-based machine-to-machine engineering firm Exosite, where he’s responsible for developing the infrastructure that drives Exosite’s data platform toward ever-greater performance and reliability. Before moving to the Midwest, Dan managed the operations team at Blue State Digital, whose SaaS online marketing platform is leveraged by hundreds of political campaigns and nonprofits. Making frequent use of statistical methods and data visualization, Dan evangelizes for ubiquitous measurement in software development, infrastructure design, and really everything a business does. He builds insight by examining tech through lots of different lenses, from the social to the mathematical to the historical to the linguistic.

Presentations

Office Hours with Dan Slimmon Office Hours

Join Dan at Table 2 in the O'Reilly booth during the Wednesday afternoon break to discuss differential diagnosis, common ground, and ChatOps.

Troubleshooting without losing common ground Session

Common ground, an important concept in recent teamwork research, helps us understand why collaborative troubleshooting breaks down over time, leading to wasted effort and mistakes. Drawing on common ground as well as some ideas from medical diagnosis, Dan Slimmon demonstrates that by extending ChatOps, we can make troubleshooting much easier without losing the benefits of fluid team conversation.

Ines Sombra is director of engineering at Fastly, where she spends her time helping the web go faster. Ines holds an MS in computology with an emphasis on cheesy ’80’s rock ballads. She has a fondness for steak, fernet, and a pug named Gordo. In a previous life, she was a data engineer.

Presentations

So we hear you like papers Keynote

Surprisingly enough, academic papers can be interesting and very relevant to the work we do in industry as practitioners. Ines Sombra and Caitie McCaffrey demonstrate how academic papers can radically change your perspective and introduce you to new ideas, offering a tour of papers that have reshaped the way they think about building large-scale distributed systems.

Steve Souders is chief curver at SpeedCurve, where he works on the interplay between performance and design. Steve previously served as Google’s head performance engineer, Yahoo’s chief performance officer, and Fastly’s chief performance officer. Steve pioneered much of the work in the world of web performance. He is the author of High Performance Web Sites and Even Faster Web Sites as well as the creator of many performance tools and services including YSlow, HTTP Archive, Episodes, ControlJS, and Browserscope. Steve taught CS193H: High Performance Web Sites at Stanford and serves as cochair of Velocity, the web performance and operations conference from O’Reilly.

Presentations

Thursday opening welcome Keynote

Velocity program chairs John Allspaw, Steve Souders, and Mary Treseler open the second day of keynotes.

Wednesday opening welcome Keynote

Velocity program chairs Steve Souders, John Allspaw, and Mary Treseler open the first day of keynotes.

Phil Stanhope is vice president of technology at Dyn. Phil’s focus varies across engineering, infrastructure, architecture, analytics, operations, and emerging technology strategy and planning. Phil is a known thought leader in the industry, having served on numerous advisory boards and technology adoption programs over the past 25 years.

Presentations

Look back, look now, see forward Session

Phil Stanhope and Nicolas Brousse explore why enterprises need to arm themselves with the right tools to detect and remediate problems and protect their Internet-connected assets.

Philip Tellis is a geek who likes to make the computer do his work for him. As chief architect and RUM distiller at SOASTA, he analyzes the impact of various design decisions on web-application performance, scalability, and security. He is the lead developer of boomerang—a JavaScript-based web-performance testing tool. Philip has spoken at several conferences in the past, including FOSS.IN, FREED.IN, Ubuntulive, Linux Symposium, OpenSource Bridge, PHP Quebec, ConFoo, FOSDEM, IPC, WebDU, Velocity and JSConf. He writes for Smashing Magazine and blogs at The Other Side of the Moon. In his spare time, Philip enjoys cycling, reading, cooking, and learning spoken languages.

Presentations

Measuring continuity Session

Whenever we speak of measuring web performance, we always refer to measuring static events, like page load or time to first tweet. A performant user experience is much more than that. Philip Tellis and Nic Jansma explore methods of measuring web performance as it relates to continuous interactions between the user and a page.

Zack Tollman is lead engineer on the Wired tech team. Before joining Wired, Zack worked as technical lead at the Theme Foundry and senior web engineer at 10up. He holds a bachelor of arts degree from Lawrence University and a PhD from Washington State University. He currently lives in San Francisco with his wife.

Presentations

HTTPS is coming: Are you prepared? Session

An HTTPS-only Web is imminent. Google, Firefox, and the IETF are currently engaged in major efforts to make the Web secure by default. According to SSL Pulse, ~64% of the top 1 million websites that use HTTPS are misconfigured and insecure. Do you know how to configure HTTPS properly? Zack Tollman explores the key aspects of HTTPS to empower developers to deploy secure HTTPS sites.

M​ary Treseler is vice president of content strategy at O’Reilly Media, ​where she leads an editorial team that covers a wide range of topics from DevOps to design, and the chair of O’Reilly’s Velocity Conference. Mary has been working on technical content for 25 years, acquiring and developing content in areas such as programming, software engineering, and product design. A Boston native, Mary lives​ oceanside​ ​in Padanaram, MA.

Presentations

Thursday opening welcome Keynote

Velocity program chairs John Allspaw, Steve Souders, and Mary Treseler open the second day of keynotes.

Wednesday opening welcome Keynote

Velocity program chairs Steve Souders, John Allspaw, and Mary Treseler open the first day of keynotes.

Ozan Turgut is an engineer at SignalFx who has designed and led implementations of cutting-edge web applications. Ozan is a JavaScript expert and a web application generalist, experienced with the full stack and the many flavors it comes in. You can see some of his personal projects on GitHub.

Presentations

Turning high-velocity data into leverage for people Keynote

We are witnessing an explosion in the sheer mass and velocity of data. But this data is most useful if the actual builders and operators—the people with all the context—can understand it and react to it quickly. Ozan Turgut discusses how to use visualization and analytics to turn data into leverage for decision making.

Andre Vachon has worked on the Windows operating system and development tools in various groups at Microsoft, including WinDbg, the MS C++ compiler, Microsoft crash analysis, and Skype telemetry. Currently, Andre is part of the Azure Performance team, focused on delivering tools to help identify and improve the performance of Azure Compute and Storage services. Andre is a frequent speaker at WinHEC conferences, delivering insights into device driver development and debugging.

Presentations

Execution graphs Session

Execution graphs enable the capabilities of a debugger and profiler on top of log and tracing data emitted by a multiple services. Andre Vachon explains that although distributed tracing tools work well when a common tracing library is used across all components, execution graphs provide a more general schema to extract and save data from heterogeneous, loosely coupled distributed systems.

Tom Valine is a Silicon Valley veteran with a passion for producing technology that has a real impact on both businesses and individuals. Having had the good fortune to work at some of the most recognizable names in the industry, including IBM, NVIDIA, Transmeta, Sun Microsystems, and Atmel, Tom is currently director of infrastructure engineering for the Diagnostics, Visibility, and Analytics group at Salesforce.

Presentations

Getting started with Argus 3-hour tutorial

Developed at Salesforce, Argus is an open source time series monitoring and alerting platform designed to scale. Tom Valine walks attendees through building, configuring, deploying, and operating Argus, offering an overview of Argus features and architecture from building and running unit tests and working with metrics and events to custom dashboards, alerting, and the REST services.

Peter van Hardenberg was an early employee of Heroku and has seen the organization grow from a dozen people through acquisition and massive scale. During the past six years, Peter has held a wide variety of roles from engineer to senior management. In past lives, Peter was an oceanographer, a Shakespeare scholar, and a game developer.

Presentations

Building a user-centric Ops, support, and engineering team Session

The old wisdom about keeping engineers away from customers is bunk. Your product team may be experts on the customer perspective, but everyone can benefit from developing user empathy. Using the Heroku Postgres team as a case study, Peter van Hardenberg explains how to build a highly scaled organization with world-class operations and support and a deep appreciation for the challenges users face.

Seth Vargo is the director of technical advocacy at HashiCorp. Previously, he worked at Chef (Opscode), CustomInk, and a few Pittsburgh-based startups. He is the author of Learning Chef. Seth is passionate about reducing inequality in technology. When he is not writing, working on open source, teaching, or speaking at conferences, Seth enjoys spending time with his friends and advising nonprofits. He loves all things bacon.

Presentations

Modern infrastructure with Terraform Tutorial

Seth Vargo offers a comprehensive, engineer­-led overview of two of HashiCorp's tools: ­Terraform and Atlas.

Ron Vidal is a partner at Blackrock 3 Partners, a leading incident management firm. Ron’s technology career spans 30 years as a senior executive in critical infrastructure including fiber optic and wireless telecommunications networks, data centers, electric power networks, and oil and gas facilities for Level 3 Communications, MFS Communications, UUNet Technologies, and Kiewit. Ron led teams on $19 billion of M&A transactions and $14 billion of public market financings. Ron managed Level 3’s executive response in New York City after the 9/11 World Trade Center terrorist attack and previously served on Mayor Dinkins’s NYC Task Force on Network Reliability. Ron is a technical peer reviewer for FEMA’s Assistance to Firefighters Grant program and has been a volunteer firefighter in four states. Ron is a member of two working groups on the California Cybersecurity Task Force.

Presentations

Incident management for DevOps Tutorial

The Incident Management System (IMS) has been battle tested by the American Fire Service for over 40 years across fires, rescues, hazardous materials incidents, and every other type of emergency. Rob Schnepp, Chris Hawley, and Ron Vidal explain how they adapted IMS for IT and offer an early look at content from Incident Management for IT Operations, their upcoming book from O’Reilly Media.

Avleen Vig is a staff operations engineer at Etsy, where he spends much of his time growing the infrastructure for selling knitted gloves and cross-stitch periodic tables. Before joining Etsy, Avleen worked at several large tech companies, including EarthLink and Google, as well as a number of small successful startups.

Presentations

ELK: Moose-ively scaling your log system Tutorial

Deploying and scaling ELK is often a trial by fire. Avleen Vig presents the lessons Etsy learned over the last three years working with ELK and scaling it from tens of millions to over five billion log lines per day. Avleen covers sizing your cluster, monitoring and extracting metrics from it, and patterns for effectively scaling it up as your logs grow.

Yoav Weiss is a principal architect at Akamai, where he focuses on making the web platform faster by adding performance-related features to browsers as well as to Akamai’s CDN. Yoav has been working on mobile web performance for longer than he cares to admit. He takes image bloat on the web as a personal insult, which is why he joined the Responsive Image community group and implemented the various responsive images features in Blink and WebKit. When he’s not writing code, he’s probably slapping his bass, mowing the lawn in the French countryside, or playing board games with his family.

Presentations

Office Hours with Yoav Weiss Office Hours

Join Yoav at Table 1 in the O'Reilly booth during the Thursday morning break to discuss third-party performance issues and how to solve them, preload and the various use cases it solves, and responsive images.

Taking back control over third-party content Session

Our love-hate relationship with third parties has taken a turn for the worse. While they often pay the bills, HTTP/2 means they’re more of a performance burden, ad blockers mean users have had enough, and projects like Google AMP mean that embedders feel the same. Yoav Weiss explores how to gain back control of your site, discussing mitigation tactics as well as a long-term plan to restore sanity.

Working in Internet infrastructure, web app security, and design taught Casey West to be a paranoid, UX-oriented, problem-solving Internet plumber. His earliest contributions to Perl live on to this day on your Mac. Casey’s speaking and writing topics range from open source communities and culture to technical architecture and automation tips and tricks. Casey wears the mantle of principal technologist focused on Pivotal’s Cloud Foundry platform and lives in Pittsburgh, where he’s raising three sarcastic children.

Presentations

Minimum viable platform Session

Platforms: either you have one, or you’re building one. Drawing on observations made over the years, Casey West explores in detail six high-level characteristics common to operationally mature production environments and demonstrates these capabilities in an interactive, live exposition.

Estelle Weyl started her professional life in architecture and then managed teen health programs. In 2000, Estelle took the natural step of becoming a web standardista. She is the Open Web and performance evangelist for Instart Logic and has consulted for Kodak Gallery, SurveyMonkey, Samsung, Yahoo, Visa, and Apple, among others. Estelle shares esoteric tidbits learned while programming and detailed grids of CSS3 and HTML5 browser support in her blog. She is a coauthor of Mobile HTML5, CSS3: The Definitive Guide, and HTML5 and CSS3 for the Real World. While not coding, Estelle works in construction, dehippifying her 1960s throwback abode.

Presentations

Accessibility as performance Session

Semantic markup helps ensure accessibility while reducing the need for frameworks. Estelle Weyl explains how you can reduce your CSS and JS by up to 95% and obliterate your queue of accessibility bugs by writing semantic HTML and leveraging CSS selectors and the cascade. By developing with web standards, you can create accessible, performant websites.

Dustin Whittle is a developer evangelist at AppDynamics, where he focuses on helping organizations manage application performance. Before joining AppDynamics, Dustin was CTO at Kwarter, a consultant at SensioLabs, and developer evangelist at Yahoo. He has experience building and leading engineering teams and working with developers and partners to to scale up. When Dustin isn’t working, he enjoys flying, sailing, diving, golfing, and traveling around the world. Find out more at Dustinwhittle.com.

Presentations

Modern performance best practices Tutorial

Dustin Whittle offers a practical introduction to modern performance best practices for web apps, diving into the latest tools and best practices for launching an ideal end-user experience. Find out how you can leverage Chrome Developer Tools, Google PageSpeed, and WebPagetest to get started improving your applications.

Office Hours with Dustin Whittle Office Hours

Join Dustin at Table 1 in the O'Reilly booth during the Wednesday lunch break to discuss modern performance best practices, optimizing frontend performance, and monitoring the end-user experience.

Jamie Wilkinson works as a site reliability engineer in Google’s storage infrastructure group, primarily on automation and monitoring, to improve the lives of his teammates when on call.

Presentations

A practical guide to monitoring and alerting with time series at scale Session

Monitoring only sucks when the cost of maintenance scales proportionally with the size of the system being monitored. Recently, tools have emerged that assist with scaling out monitoring configurations sublinearly with the size of the system. Jamie Wilkinson explores time series-based alerting and offers practical examples that can be employed in your environment today.

Volker Will leads technical evangelism for DevOps in the Microsoft DX/TED organization with a focus on technical strategy and enablement for the global Microsoft DX field evangelism organization. Volker has over 25 years of experience working with developers and IT operations professionals in various roles. In his current role, he combines his experience and knowledge of developer and operations audiences to get companies started on improving their DevOps strategies and help customers discover the positive impact that DevOps practices have on organizational culture, the development life-cycle, and the bottom line.

Presentations

“Microsoft is irrelevant to me. I use a Mac": Is this you? Keynote

Microsoft has evolved. Among many other things, it is investing and transforming to provide the best mobile app development experience on the planet. DevOps is the backdrop for Microsoft's transformation. Volker Will shares Microsoft's experiences to help you on your own journey and evolve your point of view.

Gabe Wishnie is a principal software engineering manager at Microsoft, where he focuses on cloud infrastructure systems—specifically on monitoring. Currently, Gabe is the lead for the MDM system within Microsoft’s Windows Azure organization.

Presentations

Monitoring the Microsoft cloud Session

Gabe Wishnie explains how Microsoft monitors the cloud services it provides at high scale with low latency through a multidimensional metric (MDM) system. Gabe offers an introduction to the architecture Microsoft uses, lessons learned along the way, and the areas in which it is still investing.

Yang Yang is a statistician interested in understanding data and digging signals via statistical modeling. Yang’s work involves developing metrics and methods to track the long-term success of web services and detect key drivers of user engagement and verify them rigorously through statistical analysis. Yang is also keen on developing novel statistical methods to solve real data problems. During her PhD at University of Michigan, Yang proposed new methodologies to analyze panel data with incomplete observations by stochastic optimization techniques and MCMC sampling.

Presentations

Robust anomaly detection for real user monitoring data Session

For the past year, LinkedIn has been running and iteratively improving Luminol, its anomaly detection system for real user monitoring data. Ritesh Maheshwari and Yang Yang offer an overview of Luminol, focusing on how to build a low-cost end-to-end system that can leverage any algorithm, and explain lessons learned and best practices that will be useful to any engineering or operations team.