Developing a Google SRE Culture
In many IT organizations, incentives are not aligned between developers, who strive for agility, and operators, who focus on stability. Site reliability engineering, or SRE, is how Google aligns incentives between development and operations and does mission-critical production support. Adoption of SRE cultural and technical practices can help improve collaboration between the business and IT. This course introduces key practices of Google SRE and the important role IT and business leaders play in the success of SRE organizational adoption.
What you'll learn
In many IT organizations, incentives are not aligned between developers, who strive for agility, and operators, who focus on stability. Site reliability engineering, or SRE, is how Google aligns incentives between development and operations and does mission-critical production support. Adoption of SRE cultural and technical practices can help improve collaboration between the business and IT. This course introduces key practices of Google SRE and the important role IT and business leaders play in the success of SRE organizational adoption.
Table of contents
- Module introduction 1m
- The value of SRE to your organization 2m
- Accept failure as normal with blameless postmortems 5m
- Blamelessness and Psychological Safety 8m
- Reduce organizational silos with SLOs and Error Budgets 7m
- Unify vision, foster collaboration and communications, and share knowledge 12m
- Module 3: Key Points, Reflection Activity, and Resources 0m