Performance Investigation Runbook

1. Identify Slow Operations

1
2
3
4
5
6
SELECT service_name, operation, avg(duration_ms) as avg_latency
FROM traces
WHERE _timestamp > now() - interval '1 hour'
GROUP BY service_name, operation
ORDER BY avg_latency DESC
LIMIT 20

2. Analyze Latency Distribution

3. Trace Slow Requests

4. Root Cause

5. Optimize