Claude Code Supervisor
Supervise Claude Code sessions running in tmux. Uses Claude Code hooks with bash pre-filtering (Option D) and fast LLM triage to detect errors, stuck agents, and task completion. Harness-agnostic — wo
Supervise Claude Code sessions running in tmux. Uses Claude Code hooks with bash pre-filtering (Option D) and fast LLM triage to detect errors, stuck agents, and task completion. Harness-agnostic — wo
Real data. Real impact.
Emerging
Developers
Per week
Open source
Skills give you superpowers. Install in 30 seconds.
Bridge between Claude Code's lifecycle hooks and your agent harness.
Claude Code (in tmux) │ Stop / Error / Notification ▼ Bash pre-filter (Option D) │ obvious cases handled directly │ ambiguous cases pass through ▼ Fast LLM triage (claude -p with Haiku, or local LLM) │ classifies: FINE | NEEDS_NUDGE | STUCK | DONE | ESCALATE │ FINE → logged silently ▼ Notify command (configurable) │ openclaw wake, webhook, ntfy, script, etc. ▼ Agent harness decides + acts │ nudge (send-keys to tmux), wait, escalate to human
{baseDir}/scripts/install-hooks.sh /path/to/your/project
Creates:
.claude/hooks/supervisor/ — hook scripts + triage.claude/settings.json — wired into Claude Code lifecycle.claude-code-supervisor.yml — configuration (edit this)Edit
.claude-code-supervisor.yml:
triage: command: "claude -p --no-session-persistence" # or: ollama run llama3.2 model: "claude-haiku-4-20250414"notify: command: "openclaw gateway call wake --params" # or: curl, ntfy, script
Create
~/.openclaw/workspace/supervisor-state.json (or wherever your harness keeps state):
{ "sessions": { "my-task": { "socket": "/tmp/openclaw-tmux-sockets/openclaw.sock", "tmuxSession": "my-task", "projectDir": "/path/to/project", "goal": "Fix issue #42", "successCriteria": "Tests pass, committed", "maxNudges": 5, "escalateAfterMin": 60, "status": "running" } } }
SOCKET="/tmp/openclaw-tmux-sockets/openclaw.sock" tmux -S "$SOCKET" new -d -s my-task tmux -S "$SOCKET" send-keys -t my-task "cd /path/to/project && claude 'Fix issue #42'" Enter
Hooks fire automatically. Triage assesses. You get notified only when it matters.
Not every hook event needs an LLM call. Bash catches the obvious cases first:
| Signal | Bash decision | LLM triage? |
|---|---|---|
| Always needs attention | ✅ Yes |
+ shell prompt back | Agent might be done | ✅ Yes |
+ no prompt | Agent is mid-work | ❌ Skip |
| Normal | ❌ Skip |
| Signal | Bash decision | LLM triage? |
|---|---|---|
| API 429 / rate limit | Transient, will resolve | ❌ Log only |
| API 500 | Agent likely stuck | ✅ Yes |
| Other tool error | Unknown severity | ✅ Yes |
| Signal | Bash decision | LLM triage? |
|---|---|---|
| Internal, transient | ❌ Skip |
| Needs decision | ✅ Yes |
| Agent waiting | ✅ Yes |
The LLM returns one of:
| Verdict | Meaning | Typical action |
|---|---|---|
| FINE | Agent is working normally | Log silently, no notification |
| NEEDS_NUDGE | Transient error, should continue | Send "continue" to tmux |
| STUCK | Looping or not progressing | Try different approach or escalate |
| DONE | Task completed successfully | Report to human |
| ESCALATE | Needs human judgment | Notify human with context |
Wake events arrive with the prefix
cc-supervisor: followed by the classification:
cc-supervisor: NEEDS_NUDGE | error:api_500 | cwd=/home/user/project | ... cc-supervisor: DONE | stopped:end_turn:prompt_back | cwd=/home/user/project | ...
tmux -S "$SOCKET" send-keys -t "$SESSION" "continue — the API error was transient" Enter
See
references/escalation-rules.md for when to nudge vs escalate and quiet hours.
Hooks depend on Claude Code being alive. If the session hard-crashes, hits account limits, or the process gets OOM-killed, no hooks fire. The watchdog catches this.
scripts/watchdog.sh is a pure bash script (no LLM, no Claude Code dependency) that:
supervisor-state.json for all "running" sessionslastWatchdogAt in state for trackingRun it on a timer. Choose your poison:
System cron:
*/15 * * * * /path/to/claude-code-supervisor/scripts/watchdog.sh
OpenClaw cron:
{ "schedule": { "kind": "every", "everyMs": 900000 }, "payload": { "kind": "systemEvent", "text": "cc-supervisor: watchdog — run /path/to/scripts/watchdog.sh and report" }, "sessionTarget": "main" }
systemd timer, launchd, or whatever runs periodically on your box.
The watchdog is deliberately dumb — no LLM, no complex logic, just "is the process still there?" This means it works even when the triage model is down, the API is melting, or your account hit its limit. Belts and suspenders.
scripts/install-hooks.sh — one-command setup per projectscripts/hooks/on-stop.sh — Stop event handler with bash pre-filterscripts/hooks/on-error.sh — PostToolUseFailure handler with bash pre-filterscripts/hooks/on-notify.sh — Notification handler with bash pre-filterscripts/triage.sh — LLM triage (called by hooks for ambiguous cases)scripts/lib.sh — shared config loading and notification functionsscripts/watchdog.sh — dead session detector (pure bash, no LLM dependency)references/state-patterns.md — terminal output pattern matching guidereferences/escalation-rules.md — when to nudge vs escalate vs waitsupervisor.yml.example — example configurationNo automatic installation available. Please visit the source repository for installation instructions.
View Installation Instructions1,500+ AI skills, agents & workflows. Install in 30 seconds. Part of the Torly.ai family.
© 2026 Torly.ai. All rights reserved.