The SRE Manifesto
Site Reliability Engineering Practices
Table of Contents
Practice Code | Practice Name | Proposed by | Link |
---|---|---|---|
AUT100 |
Reduce Operational Toil | Rod Anami | Doc |
DSC100 |
Time-series Data Analysis through Percentiles | Rod Anami | Doc |
DOE100 |
Infrastructure Provisioning through Code | Rod Anami | Doc |
OBS100 |
Observability Golden Signals | Rod Anami | Doc |
STH100 |
Solution Full Stack Analysis for Reliability | Rod Anami | Doc |
Practice Areas
Area | Area Description |
---|---|
Automation |
Good practices around automating operational and engineering work. |
Data Science |
Practices around MELT data analysis and application of mathematical models and statistical methods. |
DevOps |
Practices that intercalate with the DevOps framework. |
Observability |
Good practices around improving monitoring and the systema inner states. |
Systems Thinking |
Good practices for a systemic approach to reliability. |