Question 1

What is the difference between SRE and DevOps?

Accepted Answer

DevOps is a culture and set of practices that emphasizes collaboration between development and operations. SRE is a specific implementation of DevOps principles, originated at Google, with concrete practices like SLOs, error budgets, and toil reduction. You can think of SRE as "class SRE implements DevOps."

Question 2

What is an error budget?

Accepted Answer

The error budget is the inverse of your SLO. If your SLO is 99.9% availability, your error budget is 0.1% — about 43 minutes per month. The team can "spend" this budget on risky deployments or experiments. When it is exhausted, the focus shifts to reliability.

Question 3

What is toil in SRE?

Accepted Answer

Toil is manual, repetitive, automatable work that scales linearly with system size — restarting services, manually provisioning accounts, running routine maintenance scripts. SRE aims to keep toil below 50% of an engineer's time and automate the rest.

Site Reliability Engineering Explained

Explanation

Bookuvai Implementation

Key Facts

Related Terms

Frequently Asked Questions