Skip to content

[Backend] Add Keeper Health Degradation States Beyond Simple Up or Down #241

@ayomideadeniran

Description

@ayomideadeniran

Contributor Focus: [Operational Clarity] Make backend health reporting more informative than a binary status
ETA: 2 days

Context

Real backend health often degrades gradually rather than failing all at once.

Problem

A simple healthy or unhealthy signal hides useful information about partial outages or slow performance.

Task Breakdown

  1. Define meaningful degraded states such as stale polling, partial RPC failure, or retry backlog pressure.
  2. Extend health endpoints to expose these states.
  3. Keep status outputs machine-readable and human-readable.
  4. Document recommended operator interpretation for each state.
  5. Add tests covering state transitions.

Acceptance Criteria

  • Health output distinguishes degraded modes from total failure.
  • Operators can identify the nature of the problem faster.
  • Status transitions are predictable and documented.
  • Existing health behavior is not broken for simple consumers.

Metadata

Metadata

Assignees

Labels

Stellar WaveIssues in the Stellar wave programbackendGenerated from backend issue tags: backendcomplexGenerated from frontend issue tags: complexhealthchecksGenerated from backend issue tags: healthcheckskeeperGenerated from backend issue tags: keeperobservabilityGenerated from frontend issue tags: observability

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions