Monitor and nudge stalled agents

witness-observerskillsetup L3★64

What it does

Monitor polecat health, detect stalls, send nudges, escalate to mayor

Best for

Passive observation of worker agent health where escalation to a supervisor (mayor) is needed but the observer never modifies work or resolves conflicts.

Inputs

· rig name (agent pool to monitor)
· patrol interval (seconds between health checks)
· stall threshold (seconds of inactivity)
· nudge count threshold (when to escalate)

Outputs

· nudge messages to stalled agents
· stall escalations to mayor (durable mail)
· health report summaries (RigHealthReport)

Requires

· StateManager for agent list and hook queries
· AgentMessage mail protocol
· timestamp comparison (stall detection)
· durable file storage for escalations

Preconditions

· StateManager initialized
· Multiple polecats active in rig
· Hook timestamps updated regularly by agents
· Mayor listening for escalation mail

Failure modes

· Hook timestamps not updated: false positive stalls (nudge anyway)
· Agent responds to nudge but doesn't reset timestamp: counted as stalled again
· Mayor unreachable: escalation mail queues but never processed
· Patrol interval too short: observer becomes bottleneck
· Patrol interval too long: stalls detected late

Trust signals

· Strictly read-only: observer never modifies agent work or state
· Graduated escalation: warning→alert→critical with configurable thresholds
· Patrol loop design avoids tight polling (default 5min interval)
· Health report aggregation provides rig-wide visibility