How quality is created, maintained and lost in complex software systems thumbnail

How quality is created, maintained and lost in complex software systems

The July 2024 CrowdStrike outage was one of the most significant software incidents in recent memory. In this talk, Jitesh Gosai uses the event as a case study to explore what happened, why it was so disruptive, and what it reveals about how quality is created, maintained and lost in complex sociotechnical systems.

Jitesh examines the incident from multiple perspectives, showing why traditional root cause analysis often fails to explain large-scale failures and how we can instead learn from these events to build resilience into our systems. He connects lessons from the outage to the broader practice of quality engineering, showing how studying real-world incidents can help teams build healthier systems and make quality a shared responsibility.

Resources


Comments

Gary Hawkes
Glad to have caught up with this talk as on the day I was attending a workshop. Excellent talk Jitesh! 👏

Sign in to comment
Explore MoT
Leading with AI - The London Edition image
Fri, 19 Jun
A half-day educational experience to navigate the world of AI
MoT Software Quality Engineering Certificate image
Boost your career in quality engineering with the MoT Software Quality Engineering Certificate.
This Week in Quality image
Debrief the week in Quality via a community radio show hosted by Simon Tomes and members of the community
Subscribe to our newsletter