Return to course: Site Reliability Engineering
Computer Measurement Group
Site Reliability Engineering
Part 1 - Introduction
What is SRE? Is SRE different from DevOps? Are they competing standards?
Reliability vs Availability. How are they measured?
What are the essential tools of SRE?
What is toil? How to eliminate toil?
How does Automation fit into SRE?
Measuring vs Monitoring vs Alerting - How are they different?
Part 2 - Defining Critical Practices : The Elements of Reliable Service
Postmortem / Root Cause Analysis
Testing and Release Procedures
Part 3 - Types of SRE Implementations & CRE : SRE – Processes and Best Practices
Processes and Best Practices