charles/claude-hooks

Fork

You've already forked claude-hooks

Code Issues 10 Pull requests Projects Releases Packages 1 Wiki Activity Actions

agents: synthesize shell_output_delta for claude-code via container log stream #957

New issue

Closed

opened 2026-05-08 12:06:30 +00:00 by claude-desktop · 6 comments

claude-desktop commented

2026-05-08 12:06:30 +00:00

Collaborator

Copy link

User story

As an operator watching a long-running shell command (e.g. bun test, just qa) inside a claude-code agent, I want to see live stdout/stderr just like I would for cursor's ShellToolCall, so the dashboard doesn't pretend the agent is idle while a 5-minute test suite is actually running.

Context

The Claude Code SDK does not expose stdout deltas for Bash invocations — the dashboard sees one event at start and one at completion. Cursor's SDK exposes ShellOutputDeltaUpdate for the same situation. We own the docker container the agent runs in, so we can synthesize equivalent delta events for claude-code by tailing the container's process tree or piping the shell wrapper's output through a sidecar.

This is invasive and should land after the cursor delta + tool-kind-taxonomy work — render path needs to exist first.

Acceptance criteria

Approach (decide during design phase, document the choice in the PR)

Option A — bash wrapper. The PreToolUse hook intercepts every Bash invocation, replaces the command with bash -c '<cmd>' 2> >(tee /dev/stderr | logger -t claude-hooks-shell-${call_id}) | tee /dev/stdout | logger -t claude-hooks-shell-${call_id} (or simpler: redirect to a per-call FIFO). Server tails journalctl --user -t claude-hooks-shell-${call_id} (or the FIFO) and emits shell_output_delta events with the right callId.
Option B — container shim binary. Replace the in-container shell wrapper with a small Bun script that streams stdout/stderr lines to a unix socket on the host, indexed by call_id. Server reads the socket.
Option C — docker exec sidecar. Per-task sidecar container docker exec -it claude-hooks-${agent} sh -c 'tail -f /tmp/shell-${call_id}.log', tailed by the runner.

The PR proposes one, motivates the choice on simplicity / robustness / hook compatibility, and benchmarks at least one realistic workload (e.g. bun test --watch for 30 s).

Wire format

Reuses the same ShellOutputDeltaEvent shape from the cursor delta-streaming issue. UI doesn't know which provider produced it.

Edge cases

Lines longer than the safe buffer size are split — UI re-stitches by call_id.
Tool calls that exit before any stdout is captured emit a single zero-byte synthetic delta + the existing completion event so the widget renders an empty pane, not a perpetually-loading spinner.
Aborted tasks must close the stream cleanly — no zombie tailers per cancelled call_id.

Tests

Synthetic shell command emitting 1k lines, asserting all lines arrive in order with the right call_id.
Abort test: cancel mid-stream, assert tailer goroutine/process exits.
Multi-task isolation: two tasks running concurrent shells must not cross-contaminate streams.

Out of scope

Streaming structured tool inputs (Edit, Write, etc.) for claude-code. Only Bash benefits from live output.
Replacing the existing extractProgress short text — that stays as the "single line of progress" complement to the live pane.

References

Parent: #950
Cursor analog: ShellOutputDeltaUpdate in delta-types.d.ts
Existing PreToolUse hook context: docs/plugins.md + apps/server/src/... (search for "rtk" and "PreToolUse").

## User story As an operator watching a long-running shell command (e.g. `bun test`, `just qa`) inside a claude-code agent, I want to see live stdout/stderr just like I would for cursor's `ShellToolCall`, so the dashboard doesn't pretend the agent is idle while a 5-minute test suite is actually running. ## Context The Claude Code SDK does not expose stdout deltas for `Bash` invocations — the dashboard sees one event at start and one at completion. Cursor's SDK exposes `ShellOutputDeltaUpdate` for the same situation. We own the docker container the agent runs in, so we can synthesize equivalent delta events for claude-code by tailing the container's process tree or piping the shell wrapper's output through a sidecar. This is invasive and should land *after* the cursor delta + tool-kind-taxonomy work — render path needs to exist first. ## Acceptance criteria ### Approach (decide during design phase, document the choice in the PR) - [ ] **Option A** — bash wrapper. The PreToolUse hook intercepts every `Bash` invocation, replaces the command with `bash -c '<cmd>' 2> >(tee /dev/stderr | logger -t claude-hooks-shell-${call_id}) | tee /dev/stdout | logger -t claude-hooks-shell-${call_id}` (or simpler: redirect to a per-call FIFO). Server tails `journalctl --user -t claude-hooks-shell-${call_id}` (or the FIFO) and emits `shell_output_delta` events with the right `callId`. - [ ] **Option B** — container shim binary. Replace the in-container shell wrapper with a small Bun script that streams stdout/stderr lines to a unix socket on the host, indexed by call_id. Server reads the socket. - [ ] **Option C** — docker exec sidecar. Per-task sidecar container `docker exec -it claude-hooks-${agent} sh -c 'tail -f /tmp/shell-${call_id}.log'`, tailed by the runner. The PR proposes one, motivates the choice on simplicity / robustness / hook compatibility, and benchmarks at least one realistic workload (e.g. `bun test --watch` for 30 s). ### Wire format - [ ] Reuses the same `ShellOutputDeltaEvent` shape from the cursor delta-streaming issue. UI doesn't know which provider produced it. ### Edge cases - [ ] Lines longer than the safe buffer size are split — UI re-stitches by call_id. - [ ] Tool calls that exit before any stdout is captured emit a single zero-byte synthetic delta + the existing completion event so the widget renders an empty pane, not a perpetually-loading spinner. - [ ] Aborted tasks must close the stream cleanly — no zombie tailers per cancelled call_id. ### Tests - [ ] Synthetic shell command emitting 1k lines, asserting all lines arrive in order with the right call_id. - [ ] Abort test: cancel mid-stream, assert tailer goroutine/process exits. - [ ] Multi-task isolation: two tasks running concurrent shells must not cross-contaminate streams. ## Out of scope - Streaming structured tool inputs (Edit, Write, etc.) for claude-code. Only `Bash` benefits from live output. - Replacing the existing `extractProgress` short text — that stays as the "single line of progress" complement to the live pane. ## References - Parent: #950 - Cursor analog: `ShellOutputDeltaUpdate` in `delta-types.d.ts` - Existing PreToolUse hook context: `docs/plugins.md` + `apps/server/src/...` (search for "rtk" and "PreToolUse").

claude-desktop added the

area:agents

type:user-story

labels

2026-05-08 12:07:02 +00:00

claude-desktop added a new dependency

2026-05-08 12:19:19 +00:00

#951 agents: stream cursor InteractionUpdate deltas (text, thinking, tool, shell, token, summary)

claude-desktop added a new dependency

2026-05-08 12:19:19 +00:00

#954 shared: canonical ToolKind taxonomy mapped from both Anthropic + Cursor tool names

claude-desktop referenced this issue

2026-05-08 12:33:27 +00:00

cursor-sdk-adapter: visibility parity + cancel-race fix + stall watchdog #950

code-lead was assigned by architect

2026-05-08 15:11:39 +00:00

architect commented

2026-05-08 15:11:40 +00:00

Collaborator

Copy link

🤖 Auto-assigned to code-lead (heuristic: area:agents → code-lead (architecture-touching)). Reply /unassign to reroute.

🤖 Auto-assigned to **code-lead** (heuristic: area:agents → code-lead (architecture-touching)). Reply `/unassign` to reroute.

code-lead was unassigned by architect

2026-05-08 15:36:11 +00:00

code-lead was assigned by architect

2026-05-08 15:36:12 +00:00

architect commented

2026-05-08 16:06:13 +00:00

Collaborator

Copy link

🧹 janitor: this ticket has been idle-assigned since 2026-05-08T15:36:12.000Z. Re-dispatching.

code-lead was unassigned by architect

2026-05-08 16:06:14 +00:00

code-lead was assigned by architect

2026-05-08 16:06:14 +00:00

code-lead commented

2026-05-08 19:05:53 +00:00

Collaborator

Copy link

🦵 @charles kicked the queue — re-running implement on @code-lead.

code-lead commented

2026-05-08 19:09:22 +00:00

Collaborator

Copy link

🦵 @charles kicked the queue — re-running implement on @code-lead.

code-lead commented

2026-05-08 19:12:31 +00:00

Collaborator

Copy link

🦵 @charles kicked the queue — re-running implement on @code-lead.

code-lead referenced this issue from a commit

2026-05-08 19:19:53 +00:00

feat(agents): synthesize shell_output_delta for claude-code via container log stream

code-lead referenced this issue from a pull request that will close it,

2026-05-08 19:21:21 +00:00

feat(agents): synthesize shell_output_delta for claude-code via container log stream #994

charles closed this issue

2026-05-08 19:39:01 +00:00

charles referenced this issue from a commit

2026-05-08 19:39:03 +00:00

feat(agents): synthesize shell_output_delta for claude-code via container log stream (#994)

code-lead commented

2026-05-08 19:57:35 +00:00

Collaborator

Copy link

⚠️ duplicate dispatch — issue was closed by PR #994 (merged at 2026-05-08T19:39:01Z) while a parallel code-lead/957 implementation was in progress.

A fully tested implementation landed locally on branch code-lead/957 (b904196 feat(agents): synthesise shell_output_delta for claude-code via container tail) covering all acceptance criteria — Option A bash wrapper via PreToolUse, ShellDeltaSource abstraction with MemoryShellDeltaSource test double + DockerExecShellDeltaSource production, per-agent shell-tee-hook.py rendered by agent-env-sync/render-for-instance.ts, integration into SdkClaudeAgent.runTask, 50 passing tests covering 1k-line ordering, abort cleanup, multi-call isolation, and the python hook's safety guards.

Not pushing the branch — PR #994 already shipped the same surface (different layout: single /tmp/shell-stream/stream.log + base64 frames + ShellStreamTailer vs. per-call /state/shell-deltas/<callId>.{stdout,stderr} + python hook + ShellDeltaSource). Leaving the local commit intact for reference; operator can git diff between branches if a comparative review of the two designs is useful.

Reporting and stopping.

⚠️ duplicate dispatch — issue was closed by PR #994 (merged at 2026-05-08T19:39:01Z) while a parallel `code-lead/957` implementation was in progress. A fully tested implementation landed locally on branch `code-lead/957` (`b904196 feat(agents): synthesise shell_output_delta for claude-code via container tail`) covering all acceptance criteria — Option A bash wrapper via PreToolUse, `ShellDeltaSource` abstraction with `MemoryShellDeltaSource` test double + `DockerExecShellDeltaSource` production, per-agent `shell-tee-hook.py` rendered by `agent-env-sync/render-for-instance.ts`, integration into `SdkClaudeAgent.runTask`, 50 passing tests covering 1k-line ordering, abort cleanup, multi-call isolation, and the python hook's safety guards. Not pushing the branch — PR #994 already shipped the same surface (different layout: single `/tmp/shell-stream/stream.log` + base64 frames + `ShellStreamTailer` vs. per-call `/state/shell-deltas/<callId>.{stdout,stderr}` + python hook + `ShellDeltaSource`). Leaving the local commit intact for reference; operator can `git diff` between branches if a comparative review of the two designs is useful. Reporting and stopping.

claude-desktop referenced this issue

2026-05-11 16:34:00 +00:00

fix(tasks): persist task_history row at task START, not only at finish — restart-kills leave no audit trail #1107

No Branch/Tag specified

main

chore/sync-pre-push-from-forge-base

fix/flows-yaml-dispatch-identity

feat/board-tap-to-assign

dev/1107

code-lead/1106

code-lead/1108

dev/1104

code-lead/1103

code-lead/1080

dev/1087

feat/flows-yaml-ci-events

chore/board-drop-stalled-and-density-controls

fix/flows-yaml-routes-always-register

flows-yaml/api-defaults

dev/1023

fix/event-log-history-bleed

fix/janitor-fix-ci-logs-and-cap

dev/1022

fix/board-card-provider

code-lead/1036

dev/1025

code-lead/1020

dev/1017

code-lead/1026

feat/web-shortcut-registry-1018

dev/1015

code-lead/1009

code-lead/1008

dev/975

dev/969

dev/973

dev/967

code-lead/968

code-lead/953

dev/970

dev/976

code-lead/966

code-lead/956

code-lead/951

dev/962

dev/963

dev/977

dev/955

dev/983

dev/961

dev/974

code-lead/950

code-lead/939

dev/941

dev/940

dev/937

dev/938

dev/936

dev/935

feat/web-i18n-fr-locale

feat/spec-editor-ui-polish

chore/drop-legacy-compat

fix/skills-drop-preview-pane

fix/882-skills-safety-rail

dev/911

dev/909

dev/923

dev/917

dev/915

feat/879-sr11-m2-drop-legacy-skill

code-lead/873

dev/881

code-lead/869

dev/867

code-lead/845

code-lead/843

code-lead/844

dev/837

dev/861

dev/849

code-lead/837

code-lead/842

fix/dedup-rebase-inflight

dev/838

code-lead/847

dev/833

code-lead/848

pr/838

code-lead/841

feat/settings-save-bar/836

code-lead/840

dev/846

code-lead/839

dev/832

fix/board-sse-stale-cache

dev/834

dev/835

feat/settings-breadcrumbs

feat/forge-oauth-credentials

refactor/service-config-consolidation

feat/agent-tokens-to-secrets

feat/gitlab-oauth-to-db

feat/authelia-rip-and-voice-fixes

fix/rebase-storm-and-dead-letter

code-lead/797

code-lead/796

dev/811

code-lead/798

dev/810

code-lead/795

dev/808

code-lead/794

dev/805

dev/802

dev/803

feat/avatar-menu-settings-entry

feat/per-agent-token-tracking

dev/793

dev/747

dev/752

code-lead/790

code-lead/759

dev/756

dev/760

dev/741

dev/767

dev/740

dev/709

dev/644

dev/637

boss/614

dev/600

dev/611

dev/585

fix/login-bonus-fixes

boss/544

dev/542

refactor/api-prefix-and-session-gate

dev/489

boss/531

boss/518

dev/499

boss/516

dev/530

dev/517

dev/519

dev/515

dev/522

dev/503

dev/471

boss/329

dev/417

dev/418

dev/402

boss/327

dev/334

dev/332

boss/326

boss/325

dev/331

boss/324

boss/323

boss/322

dev/294

test/s11-task-analytics

dev/262

boss/270

dev/268

foreman/ui-consolidation-spec

dev/234

boss/196

boss/176

boss/164

fix/124-session-persist-bind

boss/52

dev/87

boss/73

dev/77

dev/81

dev/82

boss/79

dev/42

dev/35

boss/7

No results found.

Labels

Clear labels

area:agents

Agent types, pool scheduling, per-instance config

area:dashboard

Dashboard UI and observability surfaces

area:database

DB layer — schema, migrations, ORM, raw SQL

area:design

UI/UX mockup work — routes to designer agent

area:design-review

Design review dispatch — routes to design-reviewer agent

area:flows

Flow runner — YAML loader, executor, op registry, expression eval

area:infra

Deployment, isolation, containers, systemd units

area:meta

Tracking, scaffolding, project setup

area:security

Security — routes to reviewer-security (opus)

area:sessions

Session-id store, Claude SDK resume logic

area:webhook

Forgejo webhook routing and handlers

area:workdir

Clone cache, worktrees, git identity

security

Security-sensitive issue

Tracking or decisions, not implementation work

No labels

Milestone

Clear milestone

No items

No milestone

Projects

Clear projects

No items

No project

Assignees

Clear assignees

No assignees

code-lead

3 participants

Notifications

Due date

The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Depends on

#951 agents: stream cursor InteractionUpdate deltas (text, thinking, tool, shell, token, summary)

charles/claude-hooks

#954 shared: canonical ToolKind taxonomy mapped from both Anthropic + Cursor tool names

charles/claude-hooks

Reference

charles/claude-hooks#957

Reference in a new issue

Repository

charles/claude-hooks

Title

Body

No description provided.

Delete branch "%!s()"

Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?

Rows
Columns