B15 — Watchdog dashboard tile (operator visibility for B10/B12/B14) #431
Labels
No labels
area:agents
area:dashboard
area:database
area:design
area:design-review
area:flows
area:infra
area:meta
area:security
area:sessions
area:webhook
area:workdir
security
type:bug
type:chore
type:meta
type:user-story
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
charles/claude-hooks#431
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
As an operator,
I want a single panel listing PRs the orchestrator considers "needs human attention" (B10's dead-letter, B12's stash branches, B14's metric spikes),
so that I can see at a glance what to fix without grepping logs.
The hardening features (B10/B12/B14) are silent by themselves — they auto-recover but the operator has no way to see what's been recovered or what's stuck. This story surfaces it.
Acceptance criteria
Backend
GET /watchdog/statusendpoint returning:json { "dead_letter_prs": [{ "number": 420, "repo": "charles/claude-hooks", "reason": "B10:silent_failure_count>=3", "since_ts": "..." }], "worktree_recoveries": [{ "path": "...", "stash_branch": "worktree-recovery/abc1234", "ts": "..." }], "session_resume_failures_24h": 12, "escalations_today": 3 }Frontend
/agents(or/planner/boardtoolbar — operator's choice during impl).🟢 Orchestrator self-healed everythingwith a link to last 24 h history.Operator actions
Retry(re-dispatch with fresh counter) andSnooze 24h(suppress watchdog for that PR for 24 h).Inspect(shows the diff in a drawer) andDrop(force-deletes the branch).Tests
/watchdog/retry/<pr>.Out of scope
References
docs/specs/automation-hardening.md§4 B15.flow:dead-letter.worktree-recovery/<sha>.claude_session_resume_failures_total.Dependencies