Build & maintain complex distributed systems
17–18 October 2017: Training
18–20 October 2017: Tutorials & Conference
London, UK

Speakers

Using a full stack, end to end view of applications and services, expert practitioners will share insights for building the fastest and most durable systems.

New speakers are added regularly. Please check back to see the latest updates to the agenda.

Filter

Search Speakers

Alexander Akbashev is a senior DevOps engineer at HERE Technologies. Alexander’s background is in QA and performance engineering. He maintains several plugins for Jenkins, including the S3 plugin, Build Rotator, FluentD, and Groovy Event Listener, and has fixed several performance and stability issues in the Jenkins ecosystem (Jenkins Core, EC2 plugin, Timestamper, Parameterized Trigger, Build Failure Analyzer, etc.).

Presentations

Scale CI from 20K to 140K builds per day Session

Alexander Akbashev explains how his company scaled a single-instance Jenkins master from 20K builds per day to 140K using Amazon AWS services (EC2, S3, Memcache, etc.). Everything done to achieve this result was open sourced and upstreamed.

Manuel Alvarez is an enterprise architect at Akamai, where he focuses on designing complex technical solutions over a variety of technologies and driving business objectives by implementing solutions aligned with the organization goals with proven ROI, such as implementing advanced caching techniques to reduce platform costs. Over his career, Manuel has worked on large projects with financial institutions, high-tech companies, and telecommunication services across multiple geographies.

Presentations

Failover early: When to failover at your CDN Session

By failing to prepare, you are preparing to fail. Your risk mitigation strategy must layer the most cost-efficient strategies to effectively mitigate or reduce the adverse effects of failure. Manuel Alvarez explores using the CDN as a failover tool, reviewing use cases and demonstrating how to decide whether to use a CDN by evaluating costs, benefits, operations, and time to mitigate.

Kolton Andrus is cofounder and CEO of Gremlin Inc. Previously, he was a chaos engineer at Netflix working on improving streaming reliability and operating edge services. He also designed and built FIT, Netflix’s failure injection service. Prior to Netflix, was at Amazon, where he improved the performance and reliability of the company’s retail website. He is an experienced call leader, managing the resolution of company-wide incidents. Kolton is passionate about building resilient systems, largely because it lets him break things for fun and profit.

Presentations

Meet the Experts with Kolton Andrus Meet the Experts

Ask Kolton all of your chaos engineering questions.

The evolution of chaos Keynote

Chaos engineering is intentionally injecting failure into a system to proactively identify and fix problems before they cause outages. It’s an emerging discipline, but its roots are decades old. Kolton Andrus explores the evolution of chaos engineering, how to begin your journey toward resilient systems, and how to make those pagers quit buzzing at 3:00am.

Thomas Barns is risk modeling and performance engineering service lead at Capacitas, where he is responsible for service definition and ensuring consistent best practice across projects. Over the past 10 years, Thomas has provided capacity and performance expertise to clients and owned the roadmap for developing Capacitas’s technical software solutions. During this time, he has seen a big shift in how software engineering is undertaken and viewed by the business and has built on this to introduce more effective and efficient performance risk management processes. This has meant shifting focus away from large-scale system testing to a full lifecycle approach, alongside research and development in automated data analysis. Thomas is currently defining and governing performance engineering processes and standards for a multimillion-pound multivendor program of work at a FTSE 100 company.

Presentations

Continuous performance engineering: Moving fast without breaking things Session

With ever-increasing demands for fast business change, how can we ensure our digital channels reflect the exacting standards of performance our customers (and business owners) expect? What does this look like in an age of DevOps and continuous delivery? Thomas Barns and John Pillar share a strategy for shifting left and automating performance analysis.

Andrew Betts is principal developer advocate at Fastly. Andrew’s area of expertise is emerging web technologies, particularly on mobile and tablet platforms. Previously, he was a PHP and JavaScript developer, web standards advocate, and founder of FT Labs, an emerging web technologies division of the Financial Times, where he and his team created the FT web app, one of the best examples of what can be achieved with HTML5.

Presentations

And now for something Vary different Session

Most people working with CDN caches know about the Vary header, but few properly understand what it really does. And with the advent of the Key header, new patterns for varying cache content will emerge. Andrew Betts shares common and advanced use cases for Vary, such as language, A/B testing, compression, and service worker support, and outlines potential changes to consider when Key arrives.

Marianna Bezler is a software engineer on Instart Logic’s security and performance engineering team. She focuses her efforts on creating a platform that makes building integrity and performance features easy and fast. Her coworkers tell her she has an uncanny ability to run and debug entire programs in her head. Previously, she was an engineering manager at Famo.us, where she corralled a group of talented, enthusiastic hackers into creating a platform for quickly creating mini-apps.

Presentations

Seeing what’s wrong just right Session

A developer hunting for a bug is like a doctor hunting for an illness. She does not need complete understanding of the body for the hunt to be successful. Jasvir Nagra and Marianna Bezler share a few painful distributed web app debugging anecdotes and an alternate approach using virtualization and visualization to get a holistic view of a program to track down elusive bugs.

Kiran Bhattaram is an engineer on the observability team at Stripe. She has previously written code for the New York Times, LinkedIn, and MIT CSAIL. In her spare time, Kiran enjoys making things, whether tinkering with circuits, sewing dresses, or woodworking.

Presentations

A tour of sketching data structures for stream processing Session

As the scale of data our systems produce continues to increase, the techniques our systems use to process it must evolve. Kiran Bhattaram explains why sketches are a good option for leveraging more sophisticated data structures.

Mihai Bojin is a software engineer at Salesforce, based in Dublin, Ireland. He’s also an open source author. When he’s not busy writing code in Java or JavaScript or investigating challenging production issues at Salesforce, Mihai is either spending time with his two-year-old son and partner or baking bread.

Presentations

Scaling up your monitoring Session

Have you ever had to monitor the health of your service (server stats, application errors, etc.)? What if you had to monitor the cloud, with its hundreds of thousands of servers? Alerts can create noise and spam your team. Mihai Bojin and Kamil Smuga explain how Salesforce approaches monitoring at scale by putting customers first.

Peter Bourgon is a distributed systems engineer moving bytes around at Fastly. Peter is also an open source contributor.

Presentations

Emergent distributed architectures: Microservices and data pipelines Session

Peter Bourgon and Sean Braithwaite offer an overview of microservices and data pipelines, explaining how both systems reflect the organizations and people that build them (in adherence to Conway’s law) and can be well understood in terms of their relationship to change and time. You'll learn the virtues and vices of each architecture and get enough context to apply them coherently.

Sam Boyer is a site reliability engineer at VividCortex. Sam fell into software engineering by accident but decided to stick around because he loves complex, ecosystem-class problems, cares deeply about building healthy communities, and is always looking for ways to bring technology’s enormous potential to bear on the world’s critical problems.

Presentations

The path to resilience Session

Resilience engineering is a holy grail of modern software engineering, granting enormous benefits but difficult to achieve and dangerous to even attempt for the unprepared. Sam Boyer explores major concepts behind resilience engineering and discusses how to move toward resilience without shooting yourself in the foot.

Sean Braithwaite is a data scientist and engineer based in Berlin. For the past eight years, he’s been using data for everything from data-driven art installations to real-time ad bidding. Most recently, Sean has been responsible for scaling SoundCloud’s data pipeline to handle billions of events per day.

Presentations

Emergent distributed architectures: Microservices and data pipelines Session

Peter Bourgon and Sean Braithwaite offer an overview of microservices and data pipelines, explaining how both systems reflect the organizations and people that build them (in adherence to Conway’s law) and can be well understood in terms of their relationship to change and time. You'll learn the virtues and vices of each architecture and get enough context to apply them coherently.

Janna Brummel is an IT chapter lead (a line manager who still does day-to-day work) on the SRE team at ING Bank in Amsterdam. Previously, Janna worked as a business manager for ING’s CIO and as a dev engineer developing software for ING’s debit and credit card backend systems.

Presentations

How to make a lion bulletproof: Setting up site reliability engineering (SRE) in a global financial organization Session

Did you read the O’Reilly book about Google SREs but doubt that SRE will work for your more traditional or more regulated company? Janna Brummel and Robin van Zijll explain how they implemented SRE in a global financial organization, providing an overview of methods and technologies and sharing lessons learned from a year of doing SRE.

Francesc Campoy Flores is a developer advocate for Go and the cloud at Google. Francesc joined the Go team in 2012, and since then he has written considerable didactic resources and traveled the world attending conferences, organizing live courses, and meeting fellow gophers. He joined Google as a backend software engineer working mostly in C++ and Python but rediscovered how fun programming can be when he found Go.

Presentations

Go performance analysis in action Tutorial

Francesc Campoy Flores walks you through the tools that make Go a great programming language, from the well known "go" tool to lesser known tools that allow you to profile, debug, and understand the performance of your programs.

Colin Charles is the chief evangelist at Percona. Previously, Colin was on the founding team of MariaDB Server, worked at MySQL, and worked actively on the Fedora and OpenOffice.org projects. Colin has been a MySQL user since 2000. He’s well known within open source communities in APAC and has spoken at many conferences.

Presentations

Best practices for MySQL high availability Tutorial

The MySQL world is full of trade-offs, and choosing a high-availability (HA) solution is no exception. However, only with high availability can you achieve distributed systems in your database layer. Colin Charles explores the MySQL high-availability landscape, offering deep dives into current technologies, recommendations, and what to look out for.

Capacity planning for your data stores Session

Databases require capacity planning. (To those coming from traditional RDBMS solutions, this can be thought of as a sizing guide.) Capacity planning prevents resource exhaustion, but it can be hard. Colin Charles explores storage capacity planning for OLTP and data warehousing uses.

Meet the Experts with Colin Charles Meet the Experts

Colin is here to talk about MySQL, MariaDB server, high availability, security, capacity planning, MongoDB, and other database-related issues.

Soo Choi is part of Devops Research and Assessment team aka, In the past, she has been a Sr Product Manager at Chef Software, Senior Director at Rackspace for OpenStack products, a Co-Founder and COO of Anso Labs, and a Program Manager at NASA for the Nebula Project. In her spare time, she enjoys chasing her two children around in Northern California coast.

Presentations

Changing Diversity Constructs, My Journey As A Women In DevOps Session

Soo shares her experiences as a woman in tech. Even though she worked for NASA and co-founded her own successful company, rampant sexism in IT and bad experiences speaking in public nearly destroyed her career. She will examine common constructs about diversity and propose ideas to bring productive change to continue to build upon the solid foundation of inclusion we have created.

Jürgen Cito is a PhD candidate at the University of Zurich, Switzerland, where his research investigates the intersection between software engineering and performance engineering. In the summer of 2015, he was a visiting research scholar working on cloud analytics at the IBM TJ Watson Research Center in New York. In the spring of 2016, he was a visiting PhD student at the Massachusetts Institute of Technology (MIT), where he worked on program analysis to conserve energy in mobile applications. Prior to starting his PhD, Jürgen was a software engineer for performance monitoring solutions at Catchpoint Systems, a technology consultant at Accenturem, and a software developer for web agency itellico internet solutions.

Presentations

Can we make developers care about operations? Session

Can we make developers care about operations? Jürgen Cito shares real-world experience of developers struggling with operations and details a journey to incorporate runtime performance aspects into the developer's daily workflow and reduce performance problems reaching production.

Matthew Clark leads the architecture for many of the BBC’s websites and apps. He’s overseen the design and operation of some of the BBC’s biggest online events, including the Olympic Games and UK elections. He lives in Manchester, England.

Presentations

Watch out! The nanoservices are comIng. Session

Welcome to the world of nanoservices: smaller than a microservice, bigger than a function, they are the perfect unit of software. Nanoservices are flexible, manageable, and scalable and a great way to do serverless computing. Matthew Clark explains how to get nanoservices right, drawing on his experience at the BBC, which now has over a thousand in production.

Anne Currie is Chief Strategist at Container Solutions. Anne has worked in tech for over 20 years as an engineer, manager, and entrepreneur, doing everything from inventing desktop products to helping people buy underwear 24 hours a day. She likes to believe she can still read assembly language AND help you buy the right socks at 1:00am.

Presentations

Distributed systems: What can go wrong will go wrong Session

Forget Conway's law. In distributed systems, Murphy’s law rules: Everything that can go wrong will go wrong. Anne Currie discusses common failure modes, how to approach diagnosing highly complex issues, and what we can learn from detectives like Sherlock Holmes, Hercule Poirot, and Miss Marple.

Frederik Deweerdt is a software engineer at Fastly, the real-time CDN, where he leads a team that writes software to handle edge traffic. Frederik has almost 20 years of experience in software development, mostly coding networking software in C. His projects have ranged from a carrier-grade SMTP/DNS/SMS server to a satellite-oriented TCP/HTTP proxy.

Presentations

HTTP/2 development: A hands-on approach Tutorial

HTTP/2 (or H2, as the cool kids call it) has been ratified for months, and browsers already support it. But do the exciting features that HTTP/2 offers meet expectations? Frederik Deweerdt explores how HTTP/2 fares in the real world, how browser behavior is changing to accommodate new server-side functionality, and how you can get the most of the new protocol everybody’s talking about.

Sara-Jane Dunn is a scientist working in the area of biological computation at Microsoft Research in Cambridge, where her research focuses on uncovering the fundamental principles of biological information processing, particularly investigating decision making in stem cells throughout development. Sara joined Microsoft Research as a postdoctoral researcher before transitioning to a permanent scientist role. In 2016, she was invited to become an affiliate researcher of the Wellcome Trust-Medical Research Council Stem Cell Institute at the University of Cambridge, where she collaborates extensively with experimentalists to better understand how to harness the power of these unique cells. She holds an MMath in mathematics and a DPhil in computational biology from the University of Oxford, where her doctoral research focused on the development of a computational model of the gut to investigate the earliest stages of carcinogenesis in colorectal cancer.

Presentations

Biological computation Keynote

Sara-Jane Dunn discusses an entirely different paradigm of computing: the information-processing carried out by cells. Focusing on examples from cutting-edge stem cell research, Sara shares formal techniques from computer science that allow us to peer into the inner workings of biology, make sense of the earliest stages of development, and even program cells for use in therapy.

Rick Fast is a distinguished software engineer at Expedia. Previously, Rick was an engineer at CDK Global/Cobalt in Portland, an industry leader in automotive retail software, and served as lead engineer at Nokia HERE (formerly Navteq), a provider of digital map data.

Presentations

Building next-gen edge architecture at Expedia Session

As Expedia refactors its backend services into a finer-grained microservice architecture, frontend applications have begun to be split into smaller applications serving a small number of pages or content on the website. Rick Fast details how Expedia is creating an extremely configurable, self-service edge architecture for routing between frontend applications and managing bot traffic.

Steven Faulkner is the director of engineering at Bustle, where he champions all things serverless. Previously, he cofounded the streaming music startup Murfie. Steve lives in Philadelphia.

Presentations

Going serverless with GraphQL Session

Bustle has transitioned its entire production platform to AWS Lambda and API gateway. But it didn't happen overnight. The change was iterative, and GraphQL played a huge part of the process. Steven Faulkner discusses the different approaches Bustle used to transition services and data off of legacy infrastructure and explains why and how the company used GraphQL as part of the process.

Nicole Forsgren is the CEO and chief scientist at DevOps Research and Assessment (DORA). Nicole is an IT impacts expert who is best known for her work with tech professionals and as the lead investigator on the largest DevOps studies to date. She is a consultant, expert, and researcher in knowledge management, IT adoption and impacts, and DevOps. In a previous life, she was a professor, sysadmin, and hardware performance analyst. Nicole has been awarded public and private research grants (funders include NASA and the NSF), and her work has been featured in various media outlets, peer-reviewed journals, and conferences. She holds a PhD in management information systems and a master’s degree in accounting.

Presentations

Meet the Experts with Nicole Forsgren Meet the Experts

Join Nicole to discuss insights from the latest State of DevOps Report, infrastructure architecture patterns, and the importance of experimentation in software development and delivery.

The key to high performance: What the data says Session

The State of DevOps Report has shown that high-performing IT teams decisively outperform low-performing peers (with greater throughput and stability), creating value that shows up on the bottom line. Nicole Forsgren and Nigel Kersten share insights into the key leadership, technical, architectural, and product capabilities that drive these outcomes.

Hannah Foxwell is a product manager at Server Density. She’s also a human ops champion, hug ops evangelist, and recovering DevOps consultant. Hannah believes that the health of your infrastructure is not just about hardware, software, automation, and uptime; it also includes the health and well-being of your team.

Presentations

Machine learning in ops: Do I need it? Session

Machine learning is the new big data. Everyone is supposed to be on board, but do we understand why? As platforms become more complex and change more frequently than ever before, it's time we stopped trying to maintain them manually. Hannah Foxwell explores the technology and real use cases for machine learning in infrastructure operations and SRE.

Uwe Friedrichsen is CTO of codecentric AG, where he focuses on resilience, scalability, and the IT of (the day after) tomorrow. Uwe has traveled the IT world for many years and is always in search of innovative ideas and concepts. Often, you can find him sharing ideas at conferences or in his many articles, blog posts, and tweets.

Presentations

Real-world consistency explained Session

Uwe Friedrichsen explores the challenges, options, and trade-offs of different consistency models in distributed system landscapes, covering the limitations of ACID transactions, eventual consistency, and current research that tries to fill the gaps between ACID and BASE transactions.

Sebastien Goasguen is senior director of cloud technologies at Bitnami, where he leads all the Kubernetes efforts. Sebastien joined Bitnami through the acquisition of his startup Skippbox. Sebastien is a 20-year open source veteran. A member of the Apache Software Foundation, he worked on Apache CloudStack and Libcloud for several years before diving into the container world. He is an avid blogger and enjoys spreading the word about new cutting-edge technologies. He also trains developers and sysadmins on all things Docker and Kubernetes. Sebastien is the author of the O’Reilly Docker Cookbook and 60 Recipes for Apache CloudStack.

Presentations

Kubernetes training 2-Day Training

Kubernetes is becoming the leading platform for building distributed applications. Sebastien Goasguen walks you through the architecture of a Kubernetes installation, covering the API objects that make up a distributed application, basic operations of Kubernetes primitives, and advanced scheduling scenarios and production concerns.

Meet the Experts with Sebastien Goasguen Meet the Experts

Sebastien is here to talk to you about the Kubernetes API, distributed application design, and the new serverless paradigm.

Laura Hackney is the cofounder and executive director of AnnieCannons, an organization working to transform survivors of human trafficking into software professionals, where she leads AnnieCannons’s curriculum development, nonprofit partnerships, and trauma-informed care and work model. Laura is also a full stack software engineer. Previously, Laura was the manager of the program on Human Rights at Stanford University, a senior research associate for Stanford’s antitrafficking project in the Mekong subregion, and a program associate for the Freeman Spogli Institute’s International Policy Implementation Lab. In addition to analyzing various forms of human trafficking around the world, she has extensively investigated the specific issue of trafficking Burmese women and children into the marriage market of China. Domestically, she works with the Human Trafficking Task Force within the San Francisco Police Department’s Special Victims Unit and supports local Bay Area antitrafficking groups. Laura holds a master’s degree from Stanford and a BA from the University of California, Berkeley.

Presentations

Informed intentions meet Tech for Good Keynote

What happens when Tech for Good and human-centered design actually support the needs of their end users? Laura Hackney explores the pitfalls and successes of the movement to bring social justice work into the technology landscape. Laura also shares insights from AnnieCannons, her nonprofit dedicated to transforming survivors of human trafficking into software professionals.

Ben Hall is the founder of Ocelot Uproar, a company focused on building products loved by users. Ben has worked as a systems administrator, tester, and software developer and launched several companies. He still finds the time to publish books and speak at conferences. Ben enjoys looking for the next challenges to solve, usually over an occasional beer. Ben recently launched Katacoda, an online learning environment for developers that helps break down the barriers to learning new technologies such as Docker and containers.

Presentations

How secure are Docker containers? Session

Docker offers many advantages, simplifying both development and production environments. But there is still uncertainty around the security of containers. Ben Hall answers the question, How secure are Docker containers?, exploring Docker's security model, its limitations, and how to handle them.

Heinrich Hartmann is the lead data scientist at Circonus, where he is driving the development of analytics methods that transform monitoring data into actionable information as part of the Circonus monitoring platform. Previously, he worked as a researcher for the University of Oxford. Heinrich holds a PhD in mathematics from the University of Bonn.

Presentations

Data science for effective operations 2-Day Training

Gathering telemetry data is key to operating reliable distributed systems at scale. Heinrich Hartmann explores a wide range of data science and analysis methods (both theoretical and practical) that can make you more effective at an operations task.

Michael Hausenblas is a developer advocate for Go, OpenShift, and Kubernetes at Red Hat, where he helps app ops engineers build and operate distributed services. Michael shares his experience with distributed systems and large-scale data processing through demos, blog posts, and public speaking engagements and contributes to open source software such as OpenShift and Kubernetes. Previously, Michael was a developer advocate at Mesosphere, chief data engineer at MapR Technologies, and a research fellow at the National University of Ireland, Galway, where he researched large-scale data integration and the internet of things and gained experience in advocacy and standardization (World Wide Web Consortium, IETF). In his free time, Michael contributes to open source software (mainly using Go), blogs, and hangs out on Twitter too much.

Presentations

Meet the Experts with Michael Hausenblas Meet the Experts

Michael is a Gopher. If you know what that means, you'll likely want to talk with him. He’s also happy to chat about all sorts of cloud-native topics, including containers (CRI-O, Docker, etc.), Kubernetes, OpenShift, Prometheus, and functions as a service (from Amazon Lambda to OpenWhisk).

Edward Hiley is a principal systems engineer with NHS Digital, where he has worked on national services such as SUS+, a ground-up full replacement of the current Secondary Uses Service (SUS), which involves myriad challenges, including immutable infrastructure, disputed compute clusters, and multi-data-center deployments. Previously, Edward was a solution architect for the Health and Social Care Information Centre and an associate director for the National Institute for Health and Clinical excellence.

Presentations

You had one job! Learning to cope with failures in a complex distributed system Session

What are your perceptions of NHS IT? Not great? Well the truth is very different from what you might expect. Ed Hiley and Dan Rathbone offer an overview of the technical renaissance going on in parts of the NHS, where things are being done in a modern way.

Emma Jane Hogbin Westby leads the operations team for shared digital services at the United Nations Office for the Coordination of Humanitarian Affairs (OCHA). She is the author of O’Reilly’s Git for Teams and two books on web development.

Presentations

Humane teams at home and around the world Session

Software development is a social activity that favors direct human contact, yet 21st century life can often get in the way, forcing us to reconsider our communication patterns. Daniel Young and Emma Jane Hogbin Westby explore how to build and maintain happy productive teams, regardless of geography.

Mandy Hubbard is a software engineer and QA architect at Care.com, where she is leading the creation of a CI/CD environment for end-to-end automation of microservices for Care.com’s Financial Services Division. A seasoned professional with over 17 years of QA experience, most of which was spent in fast-paced startup environments driving product quality, Mandy has played a critical role in all stages of software development and has a deep understanding of build-test-deploy processes from development to production.

Presentations

Indispensable, disposable Jenkins Session

You rely on Jenkins to manage the full stack of your continuous delivery pipeline, but why shouldn’t Jenkins itself be software defined, ephemeral, and available at the push of a button? Mandy Hubbard explains how Care.com uses a customized, script-based startup process and Joyent’s ContainerPilot with a just few edits to a Docker Compose _env file to launch Jenkins in a Docker container.

Chris Jackson is the director of cloud platforms at Pearson, where he leads a small team of incredibly talented engineers into a brave new world of containerized platforms for Pearson’s next generation of digital services. Previously, Chris spent eight years at Rackspace observing and participating in the cloud explosion through the eyes of a vendor and learning how open source software is genuinely changing our approach to everything. He is a reformed thought leader who, over a beer, will tell you that many of the world’s problems could be solved with a bit of DevOps. Chris’s wife and two young boys serve as a brilliant reminder that bettering education services is a worthy investment of his time. He also is an avid rugby, football, and Formula One fan who dreams of one day owning a car faster than his release cycles.

Presentations

The story of a startup built to disrupt an enterprise from the inside out Session

Chris Jackson explains how 175-year-old company Pearson built a tech startup within the enterprise with the aim of innovating the developer experience. Chris shares the journey from inception to B-round funding and explains how this startup is establishing the foundation of the company's future.

Kishore Jalleda is the head of production engineering in the Americas in Yahoo’s Publisher Products Unit, which includes many popular destinations like Yahoo.com, Yahoo Finance, Yahoo Sports, and Yahoo News. Previously, Kishore was the head of SRE at Zynga and worked at IMVU, one of the pioneers of continuous delivery (with cofounder Eric Ries, the author of Lean Startup).

Presentations

Want to solve overmonitoring and alert fatigue? Create the right incentives Session

Keeping your signal-to-noise ratio high is a nontrivial problem. Modern tools make it easy to overmonitor (which leads to noise). The result? Missed alarms and unhappy customers. Filtering the noise is not the answer. Kishore Jalleda explains how Yahoo reduced the alert volume from ~200K a month to a few hundred by creating the right incentives and culture.

Angie Jones is a senior software engineer in test at Twitter. Angie has developed automation strategies and frameworks for countless software products. An IBM Master Inventor, she is known for her innovative and out-of-the-box thinking, which has resulted in more than 20 patented inventions in the US and China. Angie shares her wealth of knowledge by speaking and teaching at software conferences all over the world and leading tech workshops for young girls through Black Girls Code.

Presentations

The build that cried broken: Building trust in your continuous integration tests Session

Angie Jones explains how to build stability and credibility into your continuous integration tests so that your team is able to receive the fast feedback it needs for Agile development.

Kavya Joshi is a backend and systems developer at Samsara. Her favorite aspects of being a programmer are reasoning about systems at scale and delving into the inner workings of sophisticated software. When not programming, Kavya tends to spend her time on rock walls and mountaintops.

Presentations

Keeping time in real systems Session

Kavya Joshi explores the fascinating timekeeping mechanisms used in real systems, covering the different expressions of time in the context of practical systems that use them and investigating how the timekeeping mechanism affects the properties of the entire system.

T-minus 3, 2, 1: Future-proofing production systems Keynote

Kavya Joshi shares strategies to prepare systems for flux and scale. Drawing from a range of use cases, including Facebook’s Kraken, which provides shadow traffic, and Samsara's custom load simulator, Kavya demonstrates how to improve your understanding of your systems as they run today and plan for how they'll run tomorrow.

Vasia Kalavri is a postdoctoral researcher in the Systems Group at ETH Zurich, where she is working on distributed data processing, data center performance, and graph streaming algorithms. Vasia is a PMC member of Apache Flink and a core developer of its graph processing API, Gelly. Vasia holds a PhD in distributed computing from KTH in Stockholm and UCLouvain in Belgium. She interned at Telefónica Research and data Artisans.

Presentations

Online performance analysis of distributed dataflow systems Session

Vasia Kalavri offers an overview of Strymon, a system for predictive data center analytics, and its online critical path analysis module. Strymon analyzes live traces from distributed dataflow systems like Apache Spark, Apache Flink, and TensorFlow to predict bottlenecks and provide insights on streaming application performance.

Nigel Kersten is chief technical strategist at Puppet. Nigel was responsible for the development of the initial versions of Puppet Enterprise and has since served in a variety of roles at the company, including head of product, CTO, and CIO. He has been deeply involved in Puppet’s DevOps initiatives and regularly speaks around the world about the adoption of DevOps in the enterprise and IT organizational transformation. Previously, Nigel worked at Google, where he was responsible for the design and implementation of one of the largest Puppet deployments in the world.

Presentations

The key to high performance: What the data says Session

The State of DevOps Report has shown that high-performing IT teams decisively outperform low-performing peers (with greater throughput and stability), creating value that shows up on the bottom line. Nicole Forsgren and Nigel Kersten share insights into the key leadership, technical, architectural, and product capabilities that drive these outcomes.

Nikki McDonald is a content director at O’Reilly Media, where she writes, edits, and works with the industry’s leading practitioners to develop books, online courses, and training videos to help engineers and developers collaborate more effectively and create and deploy complex distributed systems. She also cochairs O’Reilly’s Velocity Conference, held annually in San Jose, New York, and London. Nikki started out as a features editor at MacUser magazine back when people were still dialing up to the internet with AOL. She lives in Ann Arbor, MI.

Presentations

Friday opening welcome Keynote

Velocity program chairs Nikki McDonald, Ines Sombra, and James Turnbull open the second day of keynotes.

Thursday opening welcome Keynote

Velocity program chairs James Turnbull, Ines Sombra, and Nikki McDonald open the first day of keynotes.

Tyler McMullen is CTO of Fastly, where he is responsible for the system architecture and leads the company’s technology vision. As part of the founding team, Tyler built the first versions of Fastly’s instant purging system, API, and real-time analytics. Before Fastly, Tyler worked on text analysis and recommendations at Scribd. A self-described technology curmudgeon, Tyler has experience in everything from web design to kernel development and loathes all of it. Especially distributed systems.

Presentations

Edge compute: The missing pieces Keynote

Edge computing is a hot topic, but despite all the hype, there are still some major hurdles to overcome before it reaches its full potential. Tyler McMullen outlines the technical and economic challenges and explains how we can get past them.

Meet the Experts with Tyler McMullen Meet the Experts

Tyler is here to discuss edge computing and more.

Christopher Meiklejohn is an independent software engineer and a PhD student at the Université Catholique de Louvain in Belgium. Previously, Christopher worked at Basho Technologies on the distributed key-value store Riak. He loves distributed systems and programming languages and is developing Lasp, a programming language for distributed computation.

Presentations

Scaling a startup with a 21st century language Keynote

Christopher Meiklejohn is building an application that helps users select a bottle of wine based on the wines that they enjoy, using a new programming language called Martinelli. Christopher offers an overview of Martinelli, highlighting the key features of this new language that allow the fault-tolerant, highly scalable operation of his application.

Miriah Meyer is an associate professor in the School of Computing at the University of Utah, where she runs the Visualization Design Lab. Her research focuses on the design of visualization systems for helping analysts and researchers make sense of complex data. Miriah was named a University of Utah distinguished alumni, a TED fellow, and a PopTech science fellow and was included on MIT Technology Review’s TR35 list of the top young innovators.

Presentations

Why an (interactive) picture is worth a thousand numbers Keynote

Feeling overwhelmed by huge amounts of data has become the norm. Creating effective visual representations of data offloads some of the work of quickly finding interesting patterns to our powerful perceptual system. Miriah Meyer explores the role that interactive visualizations can play in helping us find meaning in mounds of data and discusses the limitations of this approach.

Lorna Mitchell is a Leeds-based developer advocate with IBM Cloud Data Services. She brings her technical expertise on a range of topics to audiences all over the world with her writing and speaking engagements, always delivered with a very practical slant. Lorna is the author of PHP Web Services (O’Reilly), PHP Master (Sitepoint), and Git Workbook (Leanpub) and is regularly published at a number of outlets, including net magazine and her blog, Lornajane.net.

Presentations

Meet the Experts with Lorna Mitchell Meet the Experts

Talk with Lorna about the best use cases for queues and how to set yourself up to survive when things go wrong.

Surviving failure in RabbitMQ

Does it matter if this message doesn't get delivered or gets delivered more than once? What about if the system keeps trying to deliver a message that will always fail or if a failure occurred earlier but now those messages can be safely handled? Lorna Mitchell details how to approach different failure scenarios, drawing on examples involving RabbitMQ.

Mindaugas Mozūras is head of engineering at Vinted, the world’s biggest preloved fashion marketplace with a community of 15 million people in nine countries, backed by $60M from Accel, Insight Venture Partners, and Hubert Burda Media, where he spends his time doing his darned best to help make second-hand the first choice worldwide. A software developer, book reader, speaker, movie lover and all-around geek, Mindaugas has made significant contributions to open source projects and is a frequent speaker at software development community events, where he shares his personal experience and that of the Vinted engineering team, who are big fans of continuous deployment, releasing code hundreds times per day into production.

Presentations

Three "last conversations" Session

Last year, Mindaugas Mozūras's company was in dire straits. Its strategy was not working. All the key metrics were drifting downward. People left. The company even did a reorg. During this time, he had many last conversations—sometimes trying to stop people from leaving, other times to let them go. Mindaugas relates three such conversations, sharing lessons on honesty and delivering bad news.

Catherine Mulligan is a research fellow at Imperial College London, where she is codirector of the Centre for Cryptocurrency Research and Engineering, which has launched several startups in the cryptocurrency space. She has 15 years’ international experience in the mobile telecommunications industry and is working extensively to apply blockchain in this space with several operators and SMEs. Cathy is coauthor of the UK Government Office for Science Report Blockchain: Beyond Bitcoin and has authored five books covering both the technology and economics of the mobile communications industries, including SAE/EPC and M2M/IoT. Cathy is a member of the Open and Agile Smart Cities (OASC) Task Force, where she leads standardization activities around ETSI, ISO, ITU, 5G, and open APIs, a fellow and expert member of the World Economic Forum’s Blockchain Council, and the vice chairman of the ETSI ISG on Context Information Management (IoT). Cathy holds a PhD and MPhil from the University of Cambridge and a BSc. (Hons 1) from UNSW, Australia.

Presentations

Blockchain and the future of distributed computing Keynote

Although the blockchain is technically a distributed system, there has been a surprising lack interest from the distributed systems community. Catherine Mulligan explores the implications of the blockchain to distributed systems and explains what needs to be addressed in order to build and maintain them effectively.

Jasvir Nagra is the product security lead at Instart Logic. Jasvir is an internationally published author, security researcher, and speaker who loves tinkering with, designing, and breaking software security systems. He is the coauthor of Surreptitious Software, a book on obfuscation, software watermarking, and tamper proofing, and the former technical lead for Caja, an open source pure JavaScript sandbox for HTML, CSS, and JavaScript, widely used at companies like Google, MySpace, Yahoo, and Magento. Jasvir’s proudest achievement is building a working tic-tac-toe player from a model railroad.

Presentations

Seeing what’s wrong just right Session

A developer hunting for a bug is like a doctor hunting for an illness. She does not need complete understanding of the body for the hunt to be successful. Jasvir Nagra and Marianna Bezler share a few painful distributed web app debugging anecdotes and an alternate approach using virtualization and visualization to get a holistic view of a program to track down elusive bugs.

Max Neunhöffer is senior software developer and architect at ArangoDB. Previously, Max was an academic mathematician at the University of St. Andrews, where he worked on the development and implementation of new algorithms in computer algebra, mainly for the open source system GAP. During this time, he juggled with mathematical big data like group orbits containing trillions of points. He studied mathematics in Heidelberg and holds a PhD from the RWTH Aachen University, where he also finished his habilitation.

Presentations

The computer science behind a modern distributed data store Session

What we see in the modern data store world is a race between different approaches to achieve distributed and resilient storage. The IoT, genomics, and applications for other fields also raise the demand for a stateful layer. Max Neunhöffer walks you through the components and the inner workings of modern open source databases like ArangoDB, Cassandra, Cockroach, and RethinkDB.

Sam Newman is an independent consultant specializing in helping people ship software fast. Sam has worked extensively with the cloud, continuous delivery, and microservices and is especially preoccupied with understanding how to more easily deploy working software into production. For the last few years, he has been exploring the capabilities of microservice architectures. He has worked with a variety of companies in multiple domains around the world, often with one foot in the developer world and another in the IT operations space. Previously, he spent over a decade at ThoughtWorks and then another year with a startup. Sam speaks frequently at conferences. He is the author of Building Microservices (O’Reilly). If you would like to get in touch, please email him.

Presentations

Confusion in the land of the serverless Session

Like any hyped technology, serverless computing promises a lot. However questions remain around its concept and implementation, especially when you start to compare how we've built systems in the past, and what serverless offers us now. Sam Newman asks (and answers), "Is serverless the future or just the emperor's new clothes?"

Sean O’Connor is the lead application developer at Bitly, where he builds systems, reviews code, and works with the awesome crew at the company to create powerful new tools to help people understand the social web. Outside of work, Sean spends most of his time biking around NYC, brewing cider, and keeping his dog out of trouble.

Presentations

The great migration Session

Data center migrations are rare but interesting events. Sean O'Connor shares a play-by-play of Bitly’s 2016 move, touching on the choices made, trade-offs, mistakes, and successes from the company's decision to turn off the lights in the old data center.

John Pillar is a technology leader at Arcadia Group, where he is transforming the company’s digital technology platforms, processes, and culture to be at the forefront of digital commerce. An evangelist with a passion for driving digital innovation and transformation, John has spent the past 19 years in ecommerce building, transforming, and leading technology teams and helping brands to be digital leaders by combining context, discovery, aspiration, and content with high-end ecommerce experiences. John’s career achievements include transforming M&S into a multichannel market-leading digital retailer as head of software engineering for its mobile, retail IT, labs, and digital stores teams. He also helped build the technology team at ASOS, the UK’s leading online fashion retailer, and helped shape the transformation of the retail mortgage industry through technology at GMAC–RFC with the first online mortgage application capturing and processing solution. His teams have won a number of awards, including the Retail Week technology award and Retail Week’s IT team of the year. John is passionate about Agile software development, continuous delivery, and cloud technologies. He has been a keynote speaker at conferences such as the National Retail Federation, Cloud World Forum, Apps World Europe, Cloud Expo Europe, and ITDF, to name a few. In his spare time, John advises technology-based startups.

Presentations

Continuous performance engineering: Moving fast without breaking things Session

With ever-increasing demands for fast business change, how can we ensure our digital channels reflect the exacting standards of performance our customers (and business owners) expect? What does this look like in an age of DevOps and continuous delivery? Thomas Barns and John Pillar share a strategy for shifting left and automating performance analysis.

Guy Podjarny is a cofounder and CEO at Snyk.io, where he focuses on securing open source code. He was previously CTO at Akamai and founder of Blaze.io. He also worked on the first web app firewall and security code analyzer. Guy is a frequent conference speaker, the author of Responsive & Fast, High Performance Images, and the upcoming Securing Third Party Code, and the creator of Mobitest. He also writes on Guypo.com and Medium.

Presentations

Serverless security: What's left to protect? Session

Serverless means handing off server management to the cloud platforms—along with their security risks. With the “pros” ensuring our servers are patched, what’s left for application owners to protect? As it turns out, quite a lot. Guy Podjarny explores the aspects of security serverless doesn’t solve, the problems it could make worse, and the tools and practices you can use to keep yourself safe.

Dan Rathbone is cofounder and technical director of Infinity Works, a 100-strong consultancy and software house based out of Leeds and London, where he builds and operates high-scale and high-performance systems for Infinity Works’ clients. Most recently, Dan has been working with NHS Digital to drive the modernization of critical national services, reengineering them using FOSS, end-to-end DevOps teams, and Agile and Lean delivery techniques. Over his career, Dan has held a number of varied roles focusing on areas from infrastructure to frontend development and most things in between.

Presentations

You had one job! Learning to cope with failures in a complex distributed system Session

What are your perceptions of NHS IT? Not great? Well the truth is very different from what you might expect. Ed Hiley and Dan Rathbone offer an overview of the technical renaissance going on in parts of the NHS, where things are being done in a modern way.

Liz Rice is the technology evangelist at container security specialists Aqua Security. Previously, she cofounded container startup Microscaling Systems, which built a real-time scaling engine and the popular image inspector MicroBadger. Liz has a wealth of software development, team, and product management experience from her years spent working on network protocols and distributed systems and in digital technology sectors such as VOD, music, and VoIP. When not building startups and writing code, Liz loves riding bikes in places with better weather than her native London.

Presentations

Cloud native: Security threat or opportunity? Keynote

Your organization wants to go cloud native, but you don't want to hit the headlines as the victim of the latest hacking scandal. Liz Rice addresses the questions you need answers to: Will your deployments be less secure or more? How do DevOps processes like CI/CD and cluster orchestration affect your security profile? And what can we all do to minimize the risk of exploits?

Meet the Experts with Liz Rice Meet the Experts

Liz would be happy to discuss anything related to containers, particularly container security best practices.

Your (container) secret's safe with me. Session

In a containerized deployment, how do you safely pass secrets like passwords and certificates between containers without compromising their safety? If orchestration means a container can run on any machine in the cluster, how do you minimize who knows your secrets? Liz Rice explores the risks and shares best practices for keeping your secrets safe.

Mike Roberts is an engineering leader and cofounder of Symphonia, a serverless and cloud technology consultancy. Mike is a longtime proponent of Agile and DevOps values and is excited by the role that cloud technologies have played in enabling such values for many high-functioning software teams. He sees serverless architectures as the next technological evolution of cloud systems and is optimistic about their ability to help teams be awesome. Mike can be reached at mike@symphonia.io.

Presentations

Building and running serverless data pipelines on AWS Session

Mike Roberts describes a real-life example where an existing data platform was rearchitected and reengineered to provide several improvements: significantly increased data capacity, reduced cost, and vastly improved development cycle time.

Raj Rohit is a senior data scientist at Episource, where he builds ML algorithms, architects data pipelines, stares at endless Linux logs, and is building the company’s DevOps team. Raj is the author of the Julia Cookbook and is also the moderator of Stack Overflow’s DevOps site.

Presentations

Lessons learned building serverless distributed systems Session

Episource just finished building a scalable, resilient serverless distributed data pipeline for coding medical charts using NLP, which scales seamlessly with the amount of data it takes in as input. Raj Rohit explores the system and the tools used to build it, such as Ansible, Lambda, and Terraform, and shares the pitfalls, failures, successes, and lessons learned along the way.

Gareth Rushgrove is a senior software engineer at Puppet Labs, where he builds interesting tools for people to better manage infrastructure. Previously, he worked for the UK Government Digital Service, focusing on infrastructure, operations, and information security. When not working, he can be found writing the DevOps Weekly newsletter or hacking on software in newfangled programming languages.

Presentations

The monorepo: Storing your source code has never been so much fun Session

The popularity of Git and GitHub has led to an explosion in the number of software repositories. But is creating a new repository always the right approach? Gareth Rushgrove offers an overview of the monorepo—putting all your product's or organization's code in a single repository—covering the advantages of monorepos and the tools to help maintain them.

Baron Schwartz is founder and CEO of VividCortex, the best way to see what your production database servers are doing. He is the lead author of High Performance MySQL and a variety of open source software.

Presentations

Quantifying scalability with the Universal Scalability Law Session

Distributed systems used to be the exception, but today they're the norm, so it's more useful than ever to be able to quantify scalability. Baron Schwartz explains how to use the Universal Scalability Law to characterize how your systems truly behave, why they don't scale like they should, and how to improve them. It's a simple, elegant solution, and, although formal, it requires no math.

Eric Sigler is the head of DevOps for PagerDuty, where he evangelizes culture, automation, measurement, and sharing. Previously, he led the developer tools team, implementing continuous deployment across the company’s infrastructure. During his 16-year career in infrastructure engineering, he’s been the technical operations manager at Minted, where he improved holiday cheer through high availability, and manager of computing systems at Missouri S&T, saving the world one student mailbox at a time.

Presentations

A postmortem of postmortems: Trends and behaviors across organizations Session

Eric Sigler shares data collected and patterns observed in postmortems across a large number of infrastructure operating organizations, covering specific trends and groupings of various types of postmortem practices, follow-on actions, and related behavior.

Dalia Simons is a server developer at Wix. An experienced software engineer, writing server code has been her passion and career for the last 10 years. Dalia believes that keeping your code simple and readable is the key to easily maintaining it over time.

Presentations

Slaying the dragon: How to rewrite a monolith into microservices and stay alive Session

Do you have an old monolith you really want to rewrite, but don’t know where to start? Dalia Simons shares ideas, tips, and strategies for rewriting an important monolith service into microservices while maintaining full availability.

Matthew Skelton is a cofounder and principal consultant at Skelton Thatcher Consulting, where he specializes in helping organizations adopt and sustain good practices for building and operating software systems, such as continuous delivery, DevOps, aspects of ITIL, and software operability. Matthew has been building, deploying, and operating commercial software systems since 1998. He curates the well-known DevOps Team Topologies Patterns and is coauthor of Database Lifecycle Management (Redgate) and Continuous Delivery with Windows and .NET (O’Reilly).

Presentations

Practical, team-focused operability techniques for distributed systems Session

Matthew Skelton shares five practical, tried-and-tested techniques for improving operability with many kinds of software systems, including the cloud, serverless, on-premises, and the IoT.

Kamil Smuga is a senior software development manager leading a team of programmers at Salesforce. Kamil is passionate about running software in production—monitoring, scaling, HA, performance, incident management, and all that jazz. He started in software quality and worked as an SRE before moving to software development.

Presentations

Scaling up your monitoring Session

Have you ever had to monitor the health of your service (server stats, application errors, etc.)? What if you had to monitor the cloud, with its hundreds of thousands of servers? Alerts can create noise and spam your team. Mihai Bojin and Kamil Smuga explain how Salesforce approaches monitoring at scale by putting customers first.

Ines Sombra is director of engineering at Fastly, where she spends her time helping the web go faster. Ines holds an MS in computology with an emphasis on cheesy ’80’s rock ballads. She has a fondness for steak, fernet, and a pug named Gordo. In a previous life, she was a data engineer.

Presentations

Friday opening welcome Keynote

Velocity program chairs Nikki McDonald, Ines Sombra, and James Turnbull open the second day of keynotes.

Thursday opening welcome Keynote

Velocity program chairs James Turnbull, Ines Sombra, and Nikki McDonald open the first day of keynotes.

Mike Strickland leads the FPGA high-performance computing vision within Intel’s Programmable Solutions Group. Mike has more than 20 years of computer, networking, and storage experience with companies such as Hewlett-Packard, Silverback Systems, and Altera (acquired by Intel). Previously, Mike led the development and launch of products in the networking, storage management, TCP/IP offload, and iSCSI spaces. He holds a BS in electrical engineering from Brown University and an MS in management from the Sloan School of Management at MIT.

Presentations

FPGA-accelerated data analytics (sponsored by Intel) Session

Microsoft has widely deployed field-programmable gate arrays (FPGAs) for accelerating search, networking, and machine learning—with a little help from Intel's software expertise and its FPGA programmers. Mike Strickland explains how a single FPGA can deliver significant acceleration for multiple workloads.

Overcoming traditional data analytics performance bottlenecks with inline acceleration (sponsored by Intel) Keynote

A new approach to data analytics acceleration is delivering benchmarked performance increases of 3X to 10X+ at the system level for traditional relational and NoSQL databases.

James Turnbull is the CTO of Empatico. A longtime member of the open source community, James is the author of nine technical books about open source software: The Terraform Book, The Art of Monitoring, The Logstash Book, The Docker Book, Pro Puppet, Pulling Strings with Puppet, Pro Linux System Administration, Pro Nagios 2.0, and Hardening Linux. He was formerly CTO at Kickstarter and an advisor at Docker. James likes food, wine, books, photography, and cats. He is not overly keen on long walks on the beach or holding hands.

Presentations

Friday opening welcome Keynote

Velocity program chairs Nikki McDonald, Ines Sombra, and James Turnbull open the second day of keynotes.

Thursday opening welcome Keynote

Velocity program chairs James Turnbull, Ines Sombra, and Nikki McDonald open the first day of keynotes.

Robin van Zijll is a site reliability engineer and product owner on the SRE team at ING Bank in Amsterdam. He has years of experience being on call for all functionalities used by retail banking customers.

Presentations

How to make a lion bulletproof: Setting up site reliability engineering (SRE) in a global financial organization Session

Did you read the O’Reilly book about Google SREs but doubt that SRE will work for your more traditional or more regulated company? Janna Brummel and Robin van Zijll explain how they implemented SRE in a global financial organization, providing an overview of methods and technologies and sharing lessons learned from a year of doing SRE.

Seth Vargo is the director of technical advocacy at HashiCorp. Previously, he worked at Chef (Opscode), CustomInk, and a few Pittsburgh-based startups. He is the author of Learning Chef. Seth is passionate about reducing inequality in technology. When he is not writing, working on open source, teaching, or speaking at conferences, Seth enjoys spending time with his friends and advising nonprofits. He loves all things bacon.

Presentations

Consul as a monitoring service Tutorial

There are two sides to monitoring: exposing problems and taking action to resolve them. Most monitoring systems handle the first, but Consul handles both. Seth Vargo explains how Consul enables self-healing infrastructure. By coupling service discovery with monitoring, Consul is able to intelligently route traffic away from unhealthy hosts or fail over to geographically different data centers.

Emile Vauge is the founder and developer at Containous and the creator of Traefik. He has more than 10 years experience developing applications for both the web and industrial uses. He’s also a certified Docker trainer. Emile’s current interests include DevOps methods and orchestration tools like Swarm, Mesos, and Kubernetes.

Presentations

Make load balancing great again Session

Emile Vauge explains how to effectively manage inbound network traffic in your container-based infrastructure with Traefik, a modern reverse proxy and load balancer made to deploy microservices with ease.

Alvaro Videla is a lead architect at Migros in Switzerland. Previously, he was a senior software engineer at Apple and a core developer for RabbitMQ. He also worked in Shanghai helping build one of Germany’s biggest dating websites. Alvaro is the coauthor of RabbitMQ in Action (Manning) and is a frequent open source contributer. Outside of work, he likes traveling with his wife, listening to and playing music, and reading books.

Presentations

What we talk about when we talk about distributed systems Session

Distributed systems are a complex. There's abundant research, but sometimes it's hard for a beginner to know where to start. Alvaro Videla discusses the foundational concepts of distributed systems and offers an overview of the best resources for getting started.

Pierre Vincent is SRE manager at Poppulo, where he helps teams embrace DevOps practices, focusing on building maintainable applications and continuously improving their processes. Based in Cork, Ireland, Pierre is passionate about enabling engineers to deliver great software with speed.

Presentations

Increasing visibility of distributed systems in production Session

Understanding the state of a running application is the key to efficiently troubleshooting production issues and ultimately anticipating outages. Pierre Vincent demonstrates how to make monitoring an integral part of development, using health checks, metrics, tracing, and other patterns to get a clearer picture of applications in production.

Salim Virji is a site reliability engineer at Google working on user-facing applications such as Drive and Spreadsheets. Salim’s experience includes planet-scale storage, low-latency distributed applications, and his favorite, distributed consensus.

Presentations

SRE classroom: A hands-on tutorial Tutorial

Salim Virji explores the key concepts behind microservices before guiding you through applying the concepts to evaluate and build systems of your own.

Mandi Walls is technical practice manager for EMEA at Chef. Mandi travels the world helping organizations increase their effectiveness using configuration management and modernizing IT practices. Previously, she ran large web properties for AOL, including AOL.com, Games.com, and Moviefone. She is a regular speaker at technical conferences and is the author of Building a DevOps Culture, published by O’Reilly. Mandi holds a master’s degree in computer science from GWU and an MBA from UNC Kenan-Flagler.

Presentations

Hands on with Chef's Habitat Tutorial

Chef's Habitat project is designed for the automation of your applications, no matter where they have to run. Mandi Walls helps you get started with Habitat and its toolset. You'll learn how to automate various application stacks with Habitat and how to export and manage Habitat-built artifacts with Docker and native Habitat runtime environments.

Meet the Experts with Mandi Walls Meet the Experts

Ask Mandi about Chef's Habitat project and how to increase your effectiveness using configuration management and modernizing IT practices.

Shannon Weyrick is director of technology at NS1. Shannon has been working in internet infrastructure since his start in 1996 at an ISP in upstate New York. He’s been programming since time immemorial and loves it to this day. Shannon can find his way around any full backend stack, but he’s focused on backend software development and has created or contributed to many open source projects throughout the years. Previously, Shannon worked at Internap and F5 Networks, architecting and developing distributed platforms for a variety of infrastructure projects.

Presentations

DDoS war games: Strengthen your team and systems by attacking them Session

DDoS mitigation is an ever-evolving art. Architectures change, attackers get more creative, and keeping your team and tools ahead of the curve is a constant battle. So why not make DDoS preparedness fun as well as practical? Shannon Weyrick explains why you should use DDoS war games to keep your team’s skillset polished, their tools in top shape, and their spirits and confidence high.

Harry Winser is a platforms developer at Rightmove, which means he gets to work on loads of internal projects and tools to help developers delivery software continuously. When not working, he can usually be found hacking on random small projects, blogging, playing guitar, or exploring.

Presentations

Consumer-driven contract testing with Pact and Docker Session

Harry Winser explains how to leverage consumer-driven contracts to achieve fully independent releases of microservices across teams and how to handle a service rollback while still serving over 47 million requests a day. Harry also demonstrates how to use the Pact framework to continuously deliver services that depend on one another and Docker to make developer testing easier.

Jamie Winsor is a lead engineer at Chef Software and the coauthor of Habitat, an open source project built upon distributed system protocol Butterfly to provide a self-healing, self-configuring, stack-agnostic, frictionless abstraction for running applications—regardless of their complexity—to software developers. Jamie has been a software engineer in the video game industry for 10 years, focusing on networked application servers on such titles as League of Legends, Lord of the Rings Online, and Dungeons and Dragons Online. One of Jamie’s responsibilities in his game development tenure was to bring what we today know as DevOps into the daily lives of the other developers on his team, which Jamie accomplished by building, evangelizing, and teaching methods to his peers. He draws on that experience today in building Habitat, as he helps enable all software developers, regardless of their experience, bring their ideas to life without investing in the details of operationalizing an application.

Presentations

Building distributed systems is accessible. I promise. Session

Understanding and building distributed systems can be a daunting task, but like most other software development patterns, distributed systems mimic concepts in the real world that you're already familiar with. Jamie Winsor walks you through building a mental model to help you understand the basics of building distributed systems based on concrete, real-world systems.

Jason Yee is a technical writer and evangelist at Datadog, where he works to inspire developers and ops engineers with the power of metrics and monitoring. He’s also a co-organizer of DevOpsDays Portland. When he’s not speaking at conferences or helping organize them, he likes to spend time on planes “travel hacking” and hunting for interesting, regional whiskey.

Presentations

Monitoring containers: Follow the data Session

Using real-world metrics data from thousands of organizations, Jason Yee explores the latest trends in container adoption and use, shares data on what types of applications organizations are running in containers, and explains how to best monitor these containerized applications.

Dan Young is CEO and cofounder of small, London-based software consultancy EngineerBetter, which places an emphasis on XP values and practices, including pair programming on colocated teams. Dan’s 15-year career has been a multidisciplinary journey through web ops, large-scale carrier networking, and product management leadership. He is driven by a strong desire to reduce friction and find more effective ways of working in large organizations.

Presentations

Humane teams at home and around the world Session

Software development is a social activity that favors direct human contact, yet 21st century life can often get in the way, forcing us to reconsider our communication patterns. Daniel Young and Emma Jane Hogbin Westby explore how to build and maintain happy productive teams, regardless of geography.