Skip to content

Operations: Observability and Troubleshooting

Health and readiness

  • GET /health returns backend liveness.
  • Endpoint status requires periodic healthcheck jobs.
  • Quota alerts/history requires periodic quota monitor jobs.

Logs

Collect backend logs centrally and include:

  • request path and status
  • selected context/endpoint when relevant
  • backend errors (including upstream storage denials)

Frequent failure classes

  • AccessDenied: storage policy/permission denial.
  • Missing menu/page: feature flag or capability mismatch.
  • Stale metrics/billing/quota history: scheduler or token misconfiguration.
  • Missing quota alert emails: SMTP configuration/user opt-in/global watch mismatch.