Runbooks
Service Not Ready
docker compose psdocker compose logs -f <service>./scripts/dev/wait-for-services.sh- Verify dependent endpoints (etcd, minio, storage-engine).
Gateway Command Failures
- Check gateway logs for command + auth context.
- Confirm command exists in gateway dispatch (
handler/mod.rs). - Check coordinator/storage-engine downstream health.
Wrong Hot/Cold/Federated Result Behavior
- Verify collection policy command state.
- Re-run targeted matrix/E2E scenario.
- Compare with compatibility matrix expectations.
Backup/Restore Incident
- List backups and inspect backup status.
- Run restore in dry-run/staging first.
- Validate data correctness before production traffic cutback.
Pre-Release Checklist
make fmt
make lint
make test
make test-integration
make test-compatibility
make e2e-all-local
make mql-exhaustive
make mql-coverage-audit
make test-architecture
make check-guardrails
Source of Truth
services/gateway/src/proxy/handler/mod.rsservices/coordinator/internal/server/server.goservices/controller/internal/grpc/backup_service.goscripts/dev/wait-for-services.sh