Back to Blog

Tag

#reliability

4 articles
Article preview
Real Domains December 9, 2024 5 min read

SRE for Small Teams — What Actually Pays Back

Google's SRE book is 500 pages long and targets 100-engineer orgs. For a 10-person team, the pragmatic subset that delivers most of the benefit at a fraction of the cost.

Article preview
Core Patterns December 11, 2023 5 min read

Dead Letter Queues — Handling the Unhandleable

What DLQs are, why you must have one for every message consumer, and the operational patterns that keep bad messages from blocking the good ones.

Article preview
Core Patterns June 21, 2023 5 min read

Transactional Outbox — The Pattern, End to End

Why "save to DB, then publish to Kafka" is almost always wrong, and the outbox pattern that fixes it — with real Java code, schema, and production considerations.