The OpenTelemetry Hero’s Journey: Correlating Application & Infrastructure Context
August 23, 2022Why Observability is THE vehicle to manage resources and spend strategically… and to ensure you have the keys to switch to efficient data-driven decisions making.
August 24, 2022During incident response it is critical that responders communicate and update shared mental models. Mutual understandings and assumptions must be constantly updated to maintain effective coordination. This is known as Common Ground. It consists of all efforts required to manage coordination for the incident. This work is secondary to the incident itself: it is a necessary binding agent and catalyst to help effective teams build reciprocity and predict each other’s actions during mitigation activities. In this lightning talk we’ll cover how Common Ground is established, the responsibilities of Incident Command to maintain Common Grounding during incidents, and ultimately how Observability is a crucial piece of sustaining Common Ground.
About The Speaker
Matt Davis
SRE Advocate
Blameless
Just as at home with analog synthesizer electronics as with Infrastructure as Code, I find joy in operating inherently chaotic complex systems. My expertise brings to bear a variegated background including data-center operations, storage hardware and distributed databases, IT security, site reliability, support services, observability systems, and techops leadership.
With degrees in music performance and composition, I have a passion for exploring the relationships between the artistic mind and operating distributed software architectures. This is reflected in my subjects for technology talks and musical output, both of which seek out diverse ways to introspect and learn from our adaptive universe.
My experiences have led me to talk about reliability and resilience at QCon, SRECon, Music City Tech, Disney JETA, RE:Deploy, and close to home at Southern California Linux Expo (SCaLE).
See the Session and Resources
To view the video you must have a CMG membership or have a virtual conference registration. Sign up today!
- [Blog] A Guide to Understanding Observability & Monitoring in SRE Practices
- [Webinar] Incident Readiness and Observability
- [Video] SRE: From Theory to Practice: What’s difficult about problem detection?
Observability, Incident Response, and Common Ground that Binds
For existing members sign in here.