Presentation: Designing Services for Resilience Testing @Netflix

Track: DevOps: You Build It, You Run It

Location: Ballroom BC

Day of week:

Slides: Download Slides

Level: Intermediate - Advanced

Persona: Architect, Chaos/Resiliency/SRE, CTO/CIO/Leadership, Developer, DevOps Engineer, Technical Engineering Manager

Abstract

As an industry, we focus on designing microservices for availability. However, we don’t tend to speak about enabling these same services for resiliency testing. In a perfect world, you wouldn’t need resiliency testing, but that’s not the reality we are currently facing. This talk covers designing microservices for enabling resiliency testing and the moving parts you need to consider when designing them from the get go, and along their lifetime. Yes, the services may all have RESTful calls in place, but those RESTful calls may not always be wrapped in circuit breakers. Yes, the services already have circuit breakers in place, but they may not always have fallbacks enabled, service owners may not know what those fallbacks do, or know how to execute that path confidently. The audience will come away from this talk with tips and tricks on how to design their microservices for resiliency tests, examples of poorly designed services, and how to ensure these pertinent design decisions are in place on a continuous basis. The audience will also leave with how to regularly test confidence in these design decisions through new chaos experimentation techniques.

Speaker: Nora Jones

Senior Chaos Engineer @Netflix

Nora is a Senior Chaos Engineer at Netflix. She is passionate about delivering high-quality software, improving processes, and promoting efficiency within architecture. Occasionally, she pokes holes in distributed systems to make them more resilient.

Find Nora Jones at

Similar Talks

Monitoring and Tracing @Netflix Streaming Data Infrastructure

Qcon

Architect & Engineer in Real Time Data Infrastructure Team @Netflix

Allen Wang

The System of Profound Knowledge

Qcon

VP, Production Engineering @packethost

Ben Rockwood

Incident Management in the Age of DevOps & SRE

Qcon

Co-Founder and Chief Product Officer @Rundeck

Damon Edwards

DevOps & Lean Thinking Panel

Qcon

Agile Methods Coach & Advocate for Woman in Tech

Cat Swetel