Files

Lukas May 7b93cfe7d7 feat: Remove checkpoint task types — per-phase review is sufficient

Checkpoint tasks (human-verify, decision, human-action) silently blocked
auto-dispatch with no UI to resolve them. Per-phase review + initiative
review already cover human verification, making checkpoints redundant.

Removed from: schema, dispatch manager, tRPC validators, detail prompt,
frontend types, tests, and docs.

2026-03-05 21:30:22 +01:00

15 KiB

Raw Blame History

Agent Module

apps/server/agent/ — Agent lifecycle management, output parsing, multi-provider support, and account failover.

File Inventory

File	Purpose
`types.ts`	Core types: `AgentInfo`, `AgentManager` interface, `SpawnOptions`, `StreamEvent`
`manager.ts`	`MultiProviderAgentManager` — main orchestrator class
`process-manager.ts`	`AgentProcessManager` — worktree creation, command building, detached spawn
`output-handler.ts`	`OutputHandler` — JSONL stream parsing, completion detection, proposal creation, task dedup, task dependency persistence
`file-tailer.ts`	`FileTailer` — watches output files, fires parser + raw content callbacks
`file-io.ts`	Input/output file I/O: frontmatter writing, signal.json reading, tiptap conversion. Output files support `action` field (create/update/delete) for chat mode CRUD.
`markdown-to-tiptap.ts`	Markdown to Tiptap JSON conversion using MarkdownManager
`index.ts`	Public exports, `ClaudeAgentManager` deprecated alias

Sub-modules

Directory	Purpose
`providers/`	Provider registry, presets (7 providers), config types
`providers/parsers/`	Provider-specific output parsers (Claude JSONL, generic line)
`accounts/`	Account discovery, config dir setup, credential management, usage API
`credentials/`	`AccountCredentialManager` — credential injection per account
`lifecycle/`	`LifecycleController` — retry policy, signal recovery, missing signal instructions
`prompts/`	Mode-specific prompt builders (execute, discuss, plan, detail, refine, chat) + shared blocks (test integrity, deviation rules, git workflow, session startup, progress tracking) + inter-agent communication instructions

Key Flows

Spawning an Agent

tRPC procedure calls agentManager.spawn(options)
Manager generates alias (adjective-animal), creates DB record
AgentProcessManager.createWorktree() — creates git worktree at .cw-worktrees/agent/<alias>/
file-io.writeInputFiles() — writes .cw/input/ with assignment files (initiative, pages, phase, task) and read-only context dirs (context/phases/, context/tasks/)
Provider config builds spawn command via buildSpawnCommand()
spawnDetached() — launches detached child process with file output redirection
FileTailer watches output file, fires onEvent (parsed stream events) and onRawContent (raw JSONL chunks) callbacks
onRawContent → DB insert via createLogChunkCallback() → agent:output event emitted (single emission point)
OutputHandler.handleStreamEvent() processes parsed events (session tracking, result capture — no event emission)
DB record updated with PID, output file path, session ID
agent:spawned event emitted

Completion Detection

Polling detects process exit, FileTailer.stop() flushes remaining output
OutputHandler.handleCompletion() triggered
Path resolution: Uses ActiveAgent.agentCwd (recorded at spawn) to locate signal.json. Standalone agents run in a workspace/ subdirectory under agent-workdirs/<alias>/, so the base getAgentWorkdir() path won't contain .cw/output/signal.json. Reconciliation and crash detection paths also probe for the workspace/ subdirectory when .cw/output is missing at the base level.
Primary path: Reads .cw/output/signal.json from agent worktree
Signal contains { status: "done"|"questions"|"error", result?, questions?, error? }
Agent DB status updated accordingly (idle, waiting_for_input, crashed)
For done: proposals created from structured output; agent:stopped emitted
For questions: parsed and stored as pendingQuestions; agent:waiting emitted
Fallback: If signal.json missing, lifecycle controller retries with instruction injection

Account Failover

On usage-limit error, markAccountExhausted(id, until) called
findNextAvailable(provider) returns least-recently-used non-exhausted account
Agent re-spawned with new account's credentials
agent:account_switched event emitted

Resume Flow

tRPC resumeAgent called with answers: Record<string, string>
Manager looks up agent's session ID and provider config
buildResumeCommand() creates resume command with session flag
formatAnswersAsPrompt(answers) converts answers to prompt text
New detached process spawned, same worktree, incremented session number

Provider Configuration

Providers defined in providers/presets.ts:

Provider	Command	Resume	Prompt Mode
claude	`claude`	`--resume <id>`	native (`-p`)
claude-code	`claude`	`--resume <id>`	native
codex	`codex`	none	flag (`--prompt`)
aider	`aider`	none	flag (`--message`)
cline	`cline`	none	flag
continue	`continue`	none	flag
cursor-agent	`cursor`	none	flag

Each provider config specifies: command, args, resumeStyle, promptMode, structuredOutput, sessionId extraction, nonInteractive options.

Output Parsing

The OutputHandler processes JSONL streams from Claude CLI:

init event → session ID extracted and persisted
text_delta events → no-op in handler (output streaming handled by DB log chunks)
result event → final result with structured data captured on ActiveAgent
Signal file (signal.json) → authoritative completion status

Output event flow: FileTailer.onRawContent() → DB insertChunk() → EventBus.emit('agent:output'). This is the single emission point — no events from handleStreamEvent() or processLine().

For providers without structured output, the generic line parser accumulates raw text.

Credential Management

AccountCredentialManager in credentials/ handles OAuth token lifecycle:

read() — extracts claudeAiOauth from .credentials.json. Only accessToken is required; refreshToken and expiresAt may be null (setup tokens).
isExpired() — returns false when expiresAt is null (setup tokens never "expire" from our perspective).
ensureValid() — if expired and refreshToken exists, refreshes. If expired with no refreshToken, returns invalid with error.

Setup Tokens

Setup tokens (from claude setup-token) are long-lived OAuth access tokens with no refresh token or expiry. Register via:

cw account add --token <token> --email user@example.com

Stored as credentials: {"claudeAiOauth":{"accessToken":"<token>"}} and configJson: {"hasCompletedOnboarding":true}.

Auto-Resume for Conversations

When Agent A asks Agent B a question via cw ask and Agent B is idle, the conversation router automatically resumes Agent B's session. This mirrors the resumeForCommit() pattern.

Flow

createConversation tRPC procedure creates the conversation record
Target resolution prefers running agents, falls back to idle (previously only matched running)
After creation, checks if target agent is idle → calls agentManager.resumeForConversation()
Agent resumes with a prompt to: answer via cw answer, drain pending conversations via cw listen, then complete

Guards

Agent must be idle status with a valid sessionId
Provider must support resume (resumeStyle !== 'none')
Worktree must still exist (existsSync check)
In-memory conversationResumeLocks Set prevents double-resume race when multiple conversations arrive simultaneously
Resume failure is caught and logged — conversation is always created even if resume fails

Auto-Cleanup & Commit Retries

After an agent completes (status → idle), tryAutoCleanup checks if its project worktrees have uncommitted changes:

CleanupManager.getDirtyWorktreePaths() runs git status --porcelain in each project subdirectory (not the parent agent-workdirs/<alias>/ dir), returns { name, absPath }[]
If all clean → worktrees and logs removed immediately
If dirty → resumeForCommit() resumes the agent's session with a prompt listing absolute paths to dirty subdirectories, using git add -u (tracked files only) to avoid staging unrelated files
The agent cds into each listed absolute path and commits tracked changes only
On next completion, cleanup runs again. MAX_COMMIT_RETRIES (1) limits retries — after that the workdir is left in place with a warning

The retry counter is cleaned up on: successful removal, max retries exceeded, or unexpected error. It is not cleaned up when a commit retry is successfully launched (so the counter persists across the retry cycle).

Log Chunks

Agent output is persisted to agent_log_chunks table and drives all live streaming:

onRawContent callback fires for every raw JSONL chunk from FileTailer
DB insert → agent:output event emission (single source of truth for UI)
No FK to agents — survives agent deletion
Session tracking: spawn=1, resume=previousMax+1
Read path (getAgentOutput tRPC): concatenates all DB chunks (no file fallback)
Live path (onAgentOutput subscription): listens for agent:output events
Frontend: initial query loads from DB, subscription accumulates raw JSONL, both parsed via parseAgentOutput()

Inter-Agent Communication

Agents can communicate with each other via the conversations table, coordinated through CLI commands.

Prompt Integration

buildInterAgentCommunication(agentId, mode) function in prompts/shared.ts generates per-agent communication instructions. Called in manager.ts after agent record creation — the actual agent ID is injected directly into the prompt (no manifest.json indirection).

Mode-aware branching:

Planning modes (plan, refine): Minimal block — just the agent ID and cw ask syntax for emergencies. These agents define high-level structure, not implementation details, so real-time coordination is almost never needed.
Execution + coordination modes (execute, detail, discuss, verify, merge, review): Full protocol including:
1. Commands table with accurate CLI behavior descriptions
2. Numbered shell recipe for background listener lifecycle (start → check → answer → restart → cleanup)
3. Targeting guidance (--agent-id vs --task-id vs --phase-id)
4. Decision criteria: when to ask (uncommitted interfaces, shared file conflicts) and when NOT to ask (answer in codebase, answer in input files, not blocked, confirming approach)
5. Good/bad examples using <example label> pattern
6. Answering guidelines (be specific — include code snippets, file paths, type signatures)

Agent Identity

manifest.json includes agentId and agentName fields. The manager passes these from the DB record after agent creation. The agent ID is also injected directly into the prompt's communication instructions.

CLI Commands

cw listen --agent-id <id>

Subscribes to onPendingConversation SSE subscription, prints first pending as JSON, exits with code 0
First yields any existing pending conversations from DB, then listens for conversation:created events
Output: { conversationId, fromAgentId, question, phaseId?, taskId? }

cw ask <question> --from <agentId> --agent-id|--task-id|--phase-id <target>

Creates conversation, subscribes to onConversationAnswer SSE, prints answer text to stdout when answered
Target resolution: --agent-id (direct), --task-id (find agent running task), --phase-id (find agent in phase)

cw answer <answer> --conversation-id <id>

Calls answerConversation, prints { conversationId, status: "answered" }

Prompt Architecture

Mode-specific prompts in prompts/ use XML tags as top-level structural delimiters, with markdown formatting inside tags. This separates first-order instructions from second-order content (task descriptions, examples, templates) per Anthropic best practices. The old apps/server/agent/prompts.ts (flat markdown) has been deleted.

XML Tag Structure

All prompts follow a consistent tag ordering:

<role> — agent identity and mode
<task> — dynamic task content (execute mode only)
<input_files> — file format documentation
<codebase_exploration> — codebase grounding instructions (architect modes only)
<output_format> — what to produce, file paths, frontmatter
<id_generation> — ID creation via cw id
<signal_format> — completion signaling
<session_startup> — startup verification steps
Mode-specific tags (see below)
Rules/constraints tags
<progress_tracking> / <context_management>
<definition_of_done> — completion checklist
<workspace> — workspace layout (appended by manager)
<inter_agent_communication> — per-agent CLI instructions (appended by manager)

Shared Blocks (`prompts/shared.ts`)

Constant / Function	XML Tag	Content
`SIGNAL_FORMAT`	`<signal_format>`	Done/questions/error via `.cw/output/signal.json`
`INPUT_FILES`	`<input_files>`	Manifest, assignment files, context files
`ID_GENERATION`	`<id_generation>`	`cw id` usage for generating entity IDs
`TEST_INTEGRITY`	`<test_integrity>`	No self-validating tests, no assertion mutation, no skipping, independent tests, full suite runs
`SESSION_STARTUP`	`<session_startup>`	Confirm working directory, check git state, establish green test baseline, read assignment
`PROGRESS_TRACKING`	`<progress_tracking>`	Maintain `.cw/output/progress.md` after each commit — survives context compaction
`DEVIATION_RULES`	`<deviation_rules>`	Typo→fix, bug→fix if small, missing dep→coordinate, architectural mismatch→STOP
`GIT_WORKFLOW`	`<git_workflow>`	Specific file staging (no `git add .`), no force-push, check status first
`CODEBASE_EXPLORATION`	`<codebase_exploration>`	Architect-mode codebase grounding: read project docs, explore structure, check existing patterns, use subagents for parallel exploration
`CONTEXT_MANAGEMENT`	`<context_management>`	Parallel file reads, cross-reference to progress tracking
`buildInterAgentCommunication()`	`<inter_agent_communication>`	Per-agent CLI instructions for `cw listen`, `cw ask`, `cw answer`

Mode-Specific Tags

Mode	File	Mode-Specific Tags
execute	`execute.ts`	`<task>`, `<execution_protocol>`, `<anti_patterns>`, `<scope_rules>`
plan	`plan.ts`	`<phase_design>`, `<dependencies>`, `<file_ownership>`, `<specificity>`, `<existing_context>`
detail	`detail.ts`	`<task_body_requirements>`, `<file_ownership>`, `<task_sizing>`, `<existing_context>`
discuss	`discuss.ts`	`<analysis_method>`, `<question_quality>`, `<decision_quality>`, `<question_categories>`, `<rules>`
refine	`refine.ts`	`<improvement_priorities>`, `<rules>`
chat	`chat.ts`	`<chat_history>`, `<instruction>` — iterative refinement loop, uses action field (create/update/delete) in output files, signals "questions" after each change to stay alive

Examples within mode-specific tags use <examples> > <example label="good"> / <example label="bad"> nesting.

Execute Prompt Dispatch

buildExecutePrompt(taskDescription?) accepts an optional task description wrapped in a <task> tag. The dispatch manager (apps/server/dispatch/manager.ts) wraps task.description || task.name in buildExecutePrompt() so execute agents receive full system context alongside their task. The <workspace> and <inter_agent_communication> blocks are appended by the agent manager at spawn time.

15 KiB Raw Blame History