Presentation: Chaos Engineering on a Budget

Track: The Art of Chaos Engineering

Location: Ballroom BC

Day of week:

Slides: Download Slides

Level: Intermediate

Persona: Chaos/Resiliency/SRE

Abstract

As the systems that support internet-scale services grow larger and ever more complex, chaos engineering has emerged as industry best practice for ensuring system resiliency. Many companies maintain entire teams devoted to chaos testing their product. But what can you do if you don't have these kinds of resources to devote to the problem? How can you get started with chaos engineering without hiring an entire team of experts?

This is the story of implementing chaos testing on a small product, and how several small and targeted early investments in chaos engineering saved huge amounts of time and effort down the road.

Speaker: Heather Nakama

Software Engineer @Microsoft - Azure Search

Heather Nakama is a Senior Software Engineer at Azure Search, a managed search-as-a-service offering from Microsoft's Azure cloud service platform. She works on the backend infrastructure that deploys, monitors, elastically scales, and automatically heals clusters hosting customer services. Heather has a passion for building distributed systems that are scalable, fault-tolerant, and reliable.

Find Heather Nakama at