charles/claude-hooks

Fork

You've already forked claude-hooks

Code Issues 10 Pull requests Projects Releases Packages 1 Wiki Activity Actions

feat(monitor): mid-flight steering — inject user messages into a running agent #224

New issue

Closed

opened 2026-04-21 12:03:27 +00:00 by claude-desktop · 0 comments

claude-desktop commented

2026-04-21 12:03:27 +00:00

Collaborator

Copy link

User story

As an operator, I want to interrupt a running agent mid-turn and send it a redirect message ("no, don't touch sessions.ts, focus on db.ts") without killing the task and losing its context. Today the only recourse is /cancel + re-dispatch, which wastes the accumulated reasoning and costs a full re-prompt.

Acceptance criteria

SDK integration

Switch agent-runner.ts from one-shot query() to the streaming input variant of @anthropic-ai/claude-agent-sdk so new user messages can be pushed into a live conversation. Existing session-chain + resume semantics remain intact.
On every dispatch, keep a bounded in-memory queue of pending operator messages keyed by task_id. The SDK iterator drains from this queue each turn boundary.

Steer endpoint

POST /task/:id/steer with body { message: string }. Auth-gated (M18-8). Returns 202 when the message is queued; 409 Conflict if the task has settled; 413 Payload Too Large if the body exceeds a sane cap (suggest 8 KB).
The queued message is delivered at the next turn boundary of the in-flight SDK conversation — ideally as a regular user role message so the agent treats it as operator input, not as tool output.
An SSE steer_queued / steer_delivered envelope emits on /events so the Monitor UI can show the message appearing in the transcript with a distinct visual.

Interrupt-and-steer UI wiring

The task detail page (per the Penpot handoff) implements the "Interrupt + Steer" composer. Pressing Enter fires POST /task/:id/steer; the panel flips to a pending state until the service echoes steer_delivered.
The transcript renders operator-injected messages with a dedicated role color (suggest --color-role-operator, add to tokens if missing) so agent vs. operator turns are distinguishable.
Cancel remains a separate button with a confirm prompt — steer is additive, cancel is terminal.

Safety rails

Rate-limit steer to N messages per minute per task (suggest 6 — one every ~10 s). Further submissions return 429 Too Many Requests. Prevents a runaway loop of chained steers from overloading the SDK.
Queue is capped at 10 messages. Overflow drops the oldest and logs a warning — operator should see a toast.
Steer is refused on the foreman agent — the foreman already has a full chat surface via /foreman/chat; steering would be a confusing second channel. Return 400 with a hint pointing at the Planner page.

Verification

Unit test in apps/server/src/agent-runner.test.ts — mocked SDK with streaming input, verify a posted steer message lands in the next iterator yield.
Integration test: dispatch a long-running task (mocked SDK), POST /steer, assert the transcript receives the injected user message and the agent's next assistant turn references it.
Manual: dispatch a dev task on a dummy issue, steer it mid-run with "also update CLAUDE.md", confirm the resulting PR includes the CLAUDE.md edit.

Dependencies

Blocked by the Penpot mockup ticket (task-detail redesign). The implementer should ship against the agreed-upon visual and copy.

Out of scope

Multi-party chat (agents responding to each other). This is strictly operator-to-single-agent.
Persisted steering history — the queue is in-memory; the transcript captures what was delivered.
Retroactively steering an already-settled task — for that, use re-dispatch (companion ticket) and re-prompt fresh.

References

@anthropic-ai/claude-agent-sdk — streaming input mode.
apps/server/src/agent-runner.ts — current one-shot query invocation.
apps/server/src/worker.ts — task lifecycle + abort signal wiring.
packages/shared/src/sse.ts — SSE envelope shapes (add steer_queued / steer_delivered).
Companion tickets: Penpot mockup (blocker), re-dispatch button (sibling — different feature but same surface area).

## User story As an operator, I want to **interrupt a running agent mid-turn and send it a redirect message** ("no, don't touch sessions.ts, focus on db.ts") without killing the task and losing its context. Today the only recourse is `/cancel` + re-dispatch, which wastes the accumulated reasoning and costs a full re-prompt. ## Acceptance criteria ### SDK integration - [ ] Switch `agent-runner.ts` from one-shot `query()` to the **streaming input** variant of `@anthropic-ai/claude-agent-sdk` so new user messages can be pushed into a live conversation. Existing session-chain + resume semantics remain intact. - [ ] On every dispatch, keep a bounded in-memory queue of pending operator messages keyed by `task_id`. The SDK iterator drains from this queue each turn boundary. ### Steer endpoint - [ ] `POST /task/:id/steer` with body `{ message: string }`. Auth-gated (M18-8). Returns `202` when the message is queued; `409 Conflict` if the task has settled; `413 Payload Too Large` if the body exceeds a sane cap (suggest 8 KB). - [ ] The queued message is delivered at the next turn boundary of the in-flight SDK conversation — ideally as a regular `user` role message so the agent treats it as operator input, not as tool output. - [ ] An SSE `steer_queued` / `steer_delivered` envelope emits on `/events` so the Monitor UI can show the message appearing in the transcript with a distinct visual. ### Interrupt-and-steer UI wiring - [ ] The task detail page (per the Penpot handoff) implements the "Interrupt + Steer" composer. Pressing Enter fires `POST /task/:id/steer`; the panel flips to a pending state until the service echoes `steer_delivered`. - [ ] The transcript renders operator-injected messages with a dedicated role color (suggest `--color-role-operator`, add to tokens if missing) so agent vs. operator turns are distinguishable. - [ ] Cancel remains a separate button with a confirm prompt — steer is additive, cancel is terminal. ### Safety rails - [ ] Rate-limit steer to N messages per minute per task (suggest 6 — one every ~10 s). Further submissions return `429 Too Many Requests`. Prevents a runaway loop of chained steers from overloading the SDK. - [ ] Queue is capped at 10 messages. Overflow drops the oldest and logs a warning — operator should see a toast. - [ ] Steer is refused on the **foreman** agent — the foreman already has a full chat surface via `/foreman/chat`; steering would be a confusing second channel. Return `400` with a hint pointing at the Planner page. ### Verification - [ ] Unit test in `apps/server/src/agent-runner.test.ts` — mocked SDK with streaming input, verify a posted steer message lands in the next iterator yield. - [ ] Integration test: dispatch a long-running task (mocked SDK), POST /steer, assert the transcript receives the injected `user` message and the agent's next `assistant` turn references it. - [ ] Manual: dispatch a dev task on a dummy issue, steer it mid-run with "also update CLAUDE.md", confirm the resulting PR includes the CLAUDE.md edit. ## Dependencies - **Blocked by the Penpot mockup ticket** (task-detail redesign). The implementer should ship against the agreed-upon visual and copy. ## Out of scope - Multi-party chat (agents responding to each other). This is strictly operator-to-single-agent. - Persisted steering history — the queue is in-memory; the transcript captures what was delivered. - Retroactively steering an already-settled task — for that, use re-dispatch (companion ticket) and re-prompt fresh. ## References - `@anthropic-ai/claude-agent-sdk` — streaming input mode. - `apps/server/src/agent-runner.ts` — current one-shot query invocation. - `apps/server/src/worker.ts` — task lifecycle + abort signal wiring. - `packages/shared/src/sse.ts` — SSE envelope shapes (add `steer_queued` / `steer_delivered`). - Companion tickets: Penpot mockup (blocker), re-dispatch button (sibling — different feature but same surface area).

claude-desktop added the

area:dashboard

type:user-story

labels

2026-04-21 12:03:40 +00:00

code-lead was assigned by claude-desktop

2026-04-21 12:03:42 +00:00

code-lead added a new dependency

2026-04-21 12:03:44 +00:00

#223 design(monitor): Penpot mockup — task detail page redesign with mid-flight steering

code-lead referenced this issue from a commit

2026-04-21 12:24:58 +00:00

feat(monitor): mid-flight steering — inject user messages into a running agent (#224)

code-lead referenced this issue from a pull request that will close it,

2026-04-21 12:25:18 +00:00

feat(monitor): mid-flight steering — inject user messages into a running agent #227

claude-desktop referenced this issue

2026-04-21 12:28:36 +00:00

chore(web): fit the dashboard to 100vh on md+ screens — scroll per component, not globally #228

code-lead referenced this issue from a commit

2026-04-21 12:42:19 +00:00

feat(monitor): mid-flight steering — inject user messages into a running agent (#224)

claude-desktop referenced this issue

2026-04-21 12:48:55 +00:00

feat(agents): token economy — caveman mode, prompt caching, model tiering, cost cap as last resort #231

code-lead referenced this issue from a commit

2026-04-21 12:49:42 +00:00

feat(monitor): mid-flight steering — inject user messages into a running agent (#224)

code-lead referenced this issue from a commit

2026-04-21 13:29:32 +00:00

feat(monitor): mid-flight steering — inject user messages into a running agent (#227)

code-lead removed a dependency

2026-04-21 15:10:07 +00:00

#223 design(monitor): Penpot mockup — task detail page redesign with mid-flight steering

claude-desktop closed this issue

2026-04-21 15:10:12 +00:00

claude-desktop referenced this issue

2026-04-21 21:52:38 +00:00

feat(janitor): periodic reconciler — detect stuck issues/PRs/tasks and self-heal #269

No Branch/Tag specified

main

chore/sync-pre-push-from-forge-base

fix/flows-yaml-dispatch-identity

feat/board-tap-to-assign

dev/1107

code-lead/1106

code-lead/1108

dev/1104

code-lead/1103

code-lead/1080

dev/1087

feat/flows-yaml-ci-events

chore/board-drop-stalled-and-density-controls

fix/flows-yaml-routes-always-register

flows-yaml/api-defaults

dev/1023

fix/event-log-history-bleed

fix/janitor-fix-ci-logs-and-cap

dev/1022

fix/board-card-provider

code-lead/1036

dev/1025

code-lead/1020

dev/1017

code-lead/1026

feat/web-shortcut-registry-1018

dev/1015

code-lead/1009

code-lead/1008

dev/975

dev/969

dev/973

dev/967

code-lead/968

code-lead/953

dev/970

dev/976

code-lead/966

code-lead/956

code-lead/951

dev/962

dev/963

dev/977

dev/955

dev/983

dev/961

dev/974

code-lead/950

code-lead/939

dev/941

dev/940

dev/937

dev/938

dev/936

dev/935

feat/web-i18n-fr-locale

feat/spec-editor-ui-polish

chore/drop-legacy-compat

fix/skills-drop-preview-pane

fix/882-skills-safety-rail

dev/911

dev/909

dev/923

dev/917

dev/915

feat/879-sr11-m2-drop-legacy-skill

code-lead/873

dev/881

code-lead/869

dev/867

code-lead/845

code-lead/843

code-lead/844

dev/837

dev/861

dev/849

code-lead/837

code-lead/842

fix/dedup-rebase-inflight

dev/838

code-lead/847

dev/833

code-lead/848

pr/838

code-lead/841

feat/settings-save-bar/836

code-lead/840

dev/846

code-lead/839

dev/832

fix/board-sse-stale-cache

dev/834

dev/835

feat/settings-breadcrumbs

feat/forge-oauth-credentials

refactor/service-config-consolidation

feat/agent-tokens-to-secrets

feat/gitlab-oauth-to-db

feat/authelia-rip-and-voice-fixes

fix/rebase-storm-and-dead-letter

code-lead/797

code-lead/796

dev/811

code-lead/798

dev/810

code-lead/795

dev/808

code-lead/794

dev/805

dev/802

dev/803

feat/avatar-menu-settings-entry

feat/per-agent-token-tracking

dev/793

dev/747

dev/752

code-lead/790

code-lead/759

dev/756

dev/760

dev/741

dev/767

dev/740

dev/709

dev/644

dev/637

boss/614

dev/600

dev/611

dev/585

fix/login-bonus-fixes

boss/544

dev/542

refactor/api-prefix-and-session-gate

dev/489

boss/531

boss/518

dev/499

boss/516

dev/530

dev/517

dev/519

dev/515

dev/522

dev/503

dev/471

boss/329

dev/417

dev/418

dev/402

boss/327

dev/334

dev/332

boss/326

boss/325

dev/331

boss/324

boss/323

boss/322

dev/294

test/s11-task-analytics

dev/262

boss/270

dev/268

foreman/ui-consolidation-spec

dev/234

boss/196

boss/176

boss/164

fix/124-session-persist-bind

boss/52

dev/87

boss/73

dev/77

dev/81

dev/82

boss/79

dev/42

dev/35

boss/7

No results found.

Labels

Clear labels

area:agents

Agent types, pool scheduling, per-instance config

area:dashboard

Dashboard UI and observability surfaces

area:database

DB layer — schema, migrations, ORM, raw SQL

area:design

UI/UX mockup work — routes to designer agent

area:design-review

Design review dispatch — routes to design-reviewer agent

area:flows

Flow runner — YAML loader, executor, op registry, expression eval

area:infra

Deployment, isolation, containers, systemd units

area:meta

Tracking, scaffolding, project setup

area:security

Security — routes to reviewer-security (opus)

area:sessions

Session-id store, Claude SDK resume logic

area:webhook

Forgejo webhook routing and handlers

area:workdir

Clone cache, worktrees, git identity

security

Security-sensitive issue

Tracking or decisions, not implementation work

No labels

Milestone

Clear milestone

No items

No milestone

Projects

Clear projects

No items

No project

Assignees

Clear assignees

No assignees

code-lead

1 participant

Notifications

Due date

The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference

charles/claude-hooks#224

Reference in a new issue

Repository

charles/claude-hooks

Title

Body

No description provided.

Delete branch "%!s()"

Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?

Rows
Columns