How to choose database transaction boundaries in backend APIs so writes stay atomic without holding locks across external calls, slow work, retries, or post-commit side effects.
How retry budgets keep microservice retries useful without letting clients amplify overload, including per-request limits, client retry ratios, token buckets, retry metadata, and production metrics.
How correlation IDs in microservices connect logs, traces, queues, and background jobs across service boundaries without pretending to replace real tracing or metrics.
Observability vs logging in production, with a practical guide to when logs, metrics, traces, and correlation IDs answer different debugging questions.
A practical OpenTelemetry guide for backend engineers: what to instrument first, how traces, metrics, logs, context propagation, attributes, sampling, and collectors make production debugging clearer.