You are viewing a free preview of this lesson.
Subscribe to unlock all 10 lessons in this course and every other course on LearningBro.
Service Level Objectives (SLOs) provide a framework for defining and measuring reliability from the user's perspective. Rather than chasing 100% uptime (which is impossible and infinitely expensive), SLOs set explicit reliability targets that balance user happiness with engineering velocity.
| Term | Definition | Example |
|---|---|---|
| SLI (Service Level Indicator) | A quantitative measure of service behaviour | 99.2% of requests complete in under 500ms |
| SLO (Service Level Objective) | A target value or range for an SLI | 99.5% of requests should complete in under 500ms |
| SLA (Service Level Agreement) | A contract with consequences for missing an SLO | "99.9% uptime or we refund 10% of your bill" |
| Error Budget | The allowed amount of unreliability (1 - SLO) | 0.5% of requests can fail over 30 days |
SLI (measurement) → SLO (target) → SLA (contract) → Error Budget (remaining allowance)
Good SLIs measure what users actually experience:
| SLI Type | Metric | Good Event |
|---|---|---|
| Availability | Successful requests / total requests | Non-5xx response |
| Latency | Requests faster than threshold / total requests | Response in < 500ms |
| Correctness | Correct responses / total responses | Response matches expected output |
| Throughput | Successful operations / time | Orders processed per minute |
| Freshness | Data updated within threshold / total data | Dashboard refreshed in < 60s |
| SLO Target | Monthly Error Budget | Monthly Downtime Equivalent |
|---|---|---|
| 99% | 1% | ~7.3 hours |
| 99.5% | 0.5% | ~3.6 hours |
| 99.9% | 0.1% | ~43.8 minutes |
| 99.95% | 0.05% | ~21.9 minutes |
| 99.99% | 0.01% | ~4.4 minutes |
Tip: "Nines" beyond 99.9% are exponentially more expensive to achieve. Each additional nine roughly costs 10x more.
Subscribe to continue reading
Get full access to this lesson and all 10 lessons in this course.