Presentation: Service Ownership @Slack
This presentation is now available to view on InfoQ.com
Watch video with transcriptAbstract
As recently as 2017, developers at Slack didn’t carry a pager. They deployed to production over a hundred times a day, and a centralized operations team took the calls in the night. Most pages were not very actionable because they weren’t set up by the dev teams that knew their systems best. Heros and knowledge islands saved day over and over. Post-incident postmortems were poorly attended and did not encourage learning.
Slowly, then quickly, all that changed. Slack moved to teams of empowered developers on-call, with embedded SREs, safer production deployments, and actionable alerts. Postmortems focus on learning, and meaningful analysis of incident patterns is done at all levels of the company.
In this talk you’ll hear all about the bumps and scrapes, triumphs and pitfalls of our journey from a centralized ops team to development teams that own the full lifecycle of their systems. It wasn’t easy, but it wasn’t impossible. Hopefully, it will inspire you to try something radically different at your company too.
Similar Talks
Stateful Programming Models in Serverless Functions
Principal Engineering Manager @Microsoft, helping lead the Azure Functions Team
Chris Gillum
The System of Profound Knowledge
VP, Production Engineering @packethost
Ben Rockwood
Incident Management in the Age of DevOps & SRE
Co-Founder and Chief Product Officer @Rundeck
Damon Edwards
User & Device Identity for Microservices @ Netflix Scale
Senior Software Engineer in Product Edge Access Services Team @Netflix
Satyajit Thadeshwar
Observability in the Development Process: Not Just for Ops Anymore
Cofounder @honeycombio
Christine Yen
Scaling Patterns for Netflix's Edge
Playback Edge Engineering @Netflix
Justin Ryan
Managing Failure Modes in Microservice Architectures
VP Cloud Architecture Strategy @AWSCloud & Microservices Pioneer
Adrian Cockcroft
Beyond Microservices: Streams, State and Scalability
Software Engineer @Confluent, PMC Member @Kafka, & Committer Apache Sqoop
Gwen Shapira
Controlled Chaos: Taming Organic, Federated Growth of Microservices
Co-founder and CEO @glasnostic