Jun

03

2022

SRE Incident Response

huayting 3 Jun 2022 05:01 LEARNING » e-learning - Tutorial

SRE Incident Response
SRE Incident Response
Instructors: Emil Stolarsky, Jaime Woo
November 2021 | Duration: 2h 1m
Video: MP4 1920x1080 48 KHz | English
Size: 613 MB

Incidents are costly. When your system goes down, you must work quickly, efficiently, and effectively to get things back up. The gold standard process is the incident management system (IMS), developed by American firefighters in the 1970s. IMS is now used by militaries, emergency personnel, and—in the domain of site reliability engineering (SRE)—companies like Google. Responding efficiently and effectively can make the difference between meeting your service-level objectives (SLOs) and blowing right past them—which is why effective incident response is a core pillar of SRE.

Just as important are the preparation done beforehand and the analysis that occurs afterward. During nonincident times, organizations should be safely testing how services may fail (such as with game days), planning who responds when things break, and crafting playbooks for common actions and responses. Postincident, measuring and evaluating incident response is crucial to determine what works and what doesn't.

Incident Labs' Emil Stolarsky and Jaime Woo show you how to create a successful incident response strategy, from preparation and training to running IMS during the incident to evaluating the response and sharing lessons learned throughout your organization. Our services will never be perfect, and they'll all break eventually. What makes us SREs is how we prepare for those days when things break, how we respond, and what we learn.

What you'll learn and how you can apply it

By the end of this live online course, you'll understand

The importance of a centralized command structure for incident response
The value of preparation for incident response through training
Why resilience engineering principles are key to successful postincident response

And you'll be able to

Run IMS within your company's incident response strategy
Prepare for incidents through the use of game days and chaos engineering
Run effective postincident review meetings and share those lessons company wide
This recording of a live event is for you because.
You're an operator or SRE who wants to better respond to incidents.
You work within a company that does incident response well but could be better.
You want to become a leader inside your company in helping teams learn from incidents.
Prerequisites
An understanding of core SRE principles (as covered by either of the recommended resources below)


https://www.oreilly.com/live-events/sre-incident-response/0636920390770/0636920399414/



PLEASE SUPPORT ME BY CLICK ONE OF MY LINKS IF YOU WANT BUYING OR EXTENDING YOUR ACCOUNT
https://nitro.download/view/C04CCE450B7022E/SRE_INCIDENT_RESPONSE.rar

https://rapidgator.net/file/76b851d19e85a191dd9c6bcedaeb8745/SRE_INCIDENT_RESPONSE.rar.html

https://uploadgig.com/file/download/bc0e920d0c0979da/SRE_INCIDENT_RESPONSE.rar

High Speed Download

Add Comment

  • People and smileys emojis
    Animals and nature emojis
    Food and drinks emojis
    Activities emojis
    Travelling and places emojis
    Objects emojis
    Symbols emojis
    Flags emojis