docs(04): update agent plans with names and CLI approach

Key changes:
- Add agent names (human-readable like 'gastown') instead of UUID-only
- Use Claude CLI with --output-format json instead of SDK streaming
- Session ID extracted from CLI JSON output, not SDK init message
- Add waiting_for_input status for AskUserQuestion scenarios
- Resume flow is for answering agent questions, not general resumption
- CLI commands use names as primary identifier
This commit is contained in:
Lukas May
2026-01-30 19:53:29 +01:00
parent 781fbd0b23
commit ffe5acceff
5 changed files with 427 additions and 447 deletions

View File

@@ -43,11 +43,12 @@ Add agents table to schema.ts following existing patterns:
```typescript
export const agents = sqliteTable('agents', {
id: text('id').primaryKey(),
name: text('name').notNull().unique(), // Human-readable name (e.g., 'gastown', 'chinatown')
taskId: text('task_id')
.references(() => tasks.id, { onDelete: 'set null' }), // Task may be deleted
sessionId: text('session_id').notNull(), // Claude SDK session ID for resumption
sessionId: text('session_id'), // Claude CLI session ID for resumption (null until first run completes)
worktreeId: text('worktree_id').notNull(), // WorktreeManager worktree ID
status: text('status', { enum: ['idle', 'running', 'stopped', 'crashed'] })
status: text('status', { enum: ['idle', 'running', 'waiting_for_input', 'stopped', 'crashed'] })
.notNull()
.default('idle'),
createdAt: integer('created_at', { mode: 'timestamp' }).notNull(),
@@ -55,6 +56,8 @@ export const agents = sqliteTable('agents', {
});
```
Note: `name` is unique, human-readable identifier. `sessionId` nullable - populated from CLI JSON output after first run. `waiting_for_input` status is for when agent pauses on AskUserQuestion.
Add relations:
```typescript
export const agentsRelations = relations(agents, ({ one }) => ({
@@ -87,11 +90,12 @@ Create AgentRepository port interface following same pattern as TaskRepository:
// src/db/repositories/agent-repository.ts
import type { Agent, NewAgent } from '../schema.js';
export type AgentStatus = 'idle' | 'running' | 'stopped' | 'crashed';
export type AgentStatus = 'idle' | 'running' | 'waiting_for_input' | 'stopped' | 'crashed';
export interface AgentRepository {
create(agent: Omit<NewAgent, 'createdAt' | 'updatedAt'>): Promise<Agent>;
findById(id: string): Promise<Agent | null>;
findByName(name: string): Promise<Agent | null>; // Lookup by human-readable name
findByTaskId(taskId: string): Promise<Agent | null>;
findBySessionId(sessionId: string): Promise<Agent | null>;
findAll(): Promise<Agent[]>;
@@ -121,11 +125,13 @@ Create DrizzleAgentRepository following existing patterns (see drizzle/task.ts):
Tests should cover:
- create() returns agent with timestamps
- create() rejects duplicate names
- findById() returns null for non-existent
- findByName() finds agent by human-readable name
- findByTaskId() finds agent by task
- findBySessionId() finds agent by session
- findAll() returns all agents
- findByStatus() filters correctly
- findByStatus() filters correctly (including waiting_for_input)
- updateStatus() changes status and updatedAt
- updateSessionId() changes sessionId
- delete() removes agent

View File

@@ -49,12 +49,14 @@ Create new agent module with port interface following WorktreeManager pattern:
* AgentManager is the PORT. Implementations are ADAPTERS.
*/
export type AgentStatus = 'idle' | 'running' | 'stopped' | 'crashed';
export type AgentStatus = 'idle' | 'running' | 'waiting_for_input' | 'stopped' | 'crashed';
/**
* Options for spawning a new agent
*/
export interface SpawnAgentOptions {
/** Human-readable name for the agent (e.g., 'gastown', 'chinatown') */
name: string;
/** Task ID to assign to agent */
taskId: string;
/** Initial prompt/instruction for the agent */
@@ -69,13 +71,15 @@ export interface SpawnAgentOptions {
export interface AgentInfo {
/** Unique identifier for this agent */
id: string;
/** Human-readable name for the agent */
name: string;
/** Task this agent is working on */
taskId: string;
/** Claude SDK session ID for resumption */
sessionId: string;
/** Claude CLI session ID for resumption (null until first run completes) */
sessionId: string | null;
/** WorktreeManager worktree ID */
worktreeId: string;
/** Current status */
/** Current status (waiting_for_input = paused on AskUserQuestion) */
status: AgentStatus;
/** When the agent was created */
createdAt: Date;
@@ -145,13 +149,22 @@ export interface AgentManager {
get(agentId: string): Promise<AgentInfo | null>;
/**
* Resume an idle agent with a new prompt.
* Get a specific agent by name.
*
* @param name - Agent name (human-readable)
* @returns Agent if found, null otherwise
*/
getByName(name: string): Promise<AgentInfo | null>;
/**
* Resume an agent that's waiting for input.
*
* Used when agent paused on AskUserQuestion and user provides response.
* Uses stored session ID to continue with full context.
* Agent must be in 'idle' status.
* Agent must be in 'waiting_for_input' status.
*
* @param agentId - Agent to resume
* @param prompt - New instruction for the agent
* @param prompt - User's response to continue the agent
*/
resume(agentId: string, prompt: string): Promise<void>;
@@ -191,8 +204,8 @@ export interface AgentSpawnedEvent extends DomainEvent {
type: 'agent:spawned';
payload: {
agentId: string;
name: string;
taskId: string;
sessionId: string;
worktreeId: string;
};
}
@@ -201,8 +214,9 @@ export interface AgentStoppedEvent extends DomainEvent {
type: 'agent:stopped';
payload: {
agentId: string;
name: string;
taskId: string;
reason: 'user_requested' | 'task_complete' | 'error';
reason: 'user_requested' | 'task_complete' | 'error' | 'waiting_for_input';
};
}
@@ -210,6 +224,7 @@ export interface AgentCrashedEvent extends DomainEvent {
type: 'agent:crashed';
payload: {
agentId: string;
name: string;
taskId: string;
error: string;
};
@@ -219,10 +234,22 @@ export interface AgentResumedEvent extends DomainEvent {
type: 'agent:resumed';
payload: {
agentId: string;
name: string;
taskId: string;
sessionId: string;
};
}
export interface AgentWaitingEvent extends DomainEvent {
type: 'agent:waiting';
payload: {
agentId: string;
name: string;
taskId: string;
sessionId: string;
question: string; // The question being asked
};
}
```
Update the DomainEventType union to include new event types.

View File

@@ -4,14 +4,14 @@ plan: 03
type: execute
wave: 2
depends_on: ["04-01", "04-02"]
files_modified: [package.json, src/agent/manager.ts, src/agent/manager.test.ts, src/agent/index.ts]
files_modified: [src/agent/manager.ts, src/agent/manager.test.ts, src/agent/index.ts]
autonomous: true
---
<objective>
Implement ClaudeAgentManager adapter using the Claude Agent SDK.
Implement ClaudeAgentManager adapter using Claude CLI with JSON output.
Purpose: Provide concrete implementation of AgentManager that spawns real Claude agents.
Purpose: Provide concrete implementation of AgentManager that spawns real Claude agents via CLI.
Output: ClaudeAgentManager adapter with comprehensive tests.
</objective>
@@ -33,45 +33,51 @@ Output: ClaudeAgentManager adapter with comprehensive tests.
@src/git/manager.ts
@src/db/repositories/agent-repository.ts
@src/events/types.ts
@src/process/manager.ts
</context>
<tasks>
<task type="auto">
<name>Task 1: Install Claude Agent SDK</name>
<files>package.json</files>
<action>
Install the Claude Agent SDK:
```bash
npm install @anthropic-ai/claude-agent-sdk
```
Verify installation by checking package.json includes the dependency.
Note: SDK requires Node.js 18+ (already satisfied by project).
</action>
<verify>npm ls @anthropic-ai/claude-agent-sdk shows package installed</verify>
<done>@anthropic-ai/claude-agent-sdk added to dependencies</done>
</task>
<task type="auto">
<name>Task 2: Implement ClaudeAgentManager adapter</name>
<name>Task 1: Implement ClaudeAgentManager adapter</name>
<files>src/agent/manager.ts, src/agent/index.ts</files>
<action>
Create ClaudeAgentManager implementing AgentManager port:
Create ClaudeAgentManager implementing AgentManager port.
**Key insight:** Use `claude -p "prompt" --output-format json` CLI mode, not SDK streaming.
The session_id is returned in the JSON result:
```json
{
"type": "result",
"subtype": "success",
"session_id": "f38b6614-d740-4441-a123-0bb3bea0d6a9",
"result": "..."
}
```
Use existing ProcessManager pattern (execa) but with JSON output parsing.
```typescript
// src/agent/manager.ts
import { query } from '@anthropic-ai/claude-agent-sdk';
import { execa, type ResultPromise } from 'execa';
import { randomUUID } from 'crypto';
import type { AgentManager, AgentInfo, SpawnAgentOptions, AgentResult, AgentStatus } from './types.js';
import type { AgentRepository } from '../db/repositories/agent-repository.js';
import type { WorktreeManager } from '../git/types.js';
import type { EventBus, AgentSpawnedEvent, AgentStoppedEvent, AgentCrashedEvent, AgentResumedEvent } from '../events/index.js';
import type { EventBus, AgentSpawnedEvent, AgentStoppedEvent, AgentCrashedEvent, AgentResumedEvent, AgentWaitingEvent } from '../events/index.js';
interface ClaudeCliResult {
type: 'result';
subtype: 'success' | 'error';
is_error: boolean;
session_id: string;
result: string;
total_cost_usd?: number;
}
interface ActiveAgent {
abortController: AbortController;
subprocess: ResultPromise;
result?: AgentResult;
}
@@ -85,145 +91,150 @@ export class ClaudeAgentManager implements AgentManager {
) {}
async spawn(options: SpawnAgentOptions): Promise<AgentInfo> {
const { taskId, prompt, cwd } = options;
const { name, taskId, prompt, cwd } = options;
const agentId = randomUUID();
const branchName = `agent/${agentId}`;
const branchName = `agent/${name}`; // Use name for branch
// Check name uniqueness
const existing = await this.repository.findByName(name);
if (existing) {
throw new Error(`Agent with name '${name}' already exists`);
}
// 1. Create isolated worktree
const worktree = await this.worktreeManager.create(agentId, branchName);
// 2. Create agent record (session ID set after SDK init)
// 2. Create agent record (session ID null until first run completes)
const agent = await this.repository.create({
id: agentId,
name,
taskId,
sessionId: '', // Updated after SDK init
sessionId: null,
worktreeId: worktree.id,
status: 'running',
});
// 3. Start agent execution
const abortController = new AbortController();
this.activeAgents.set(agentId, { abortController });
// 3. Start Claude CLI in background
const subprocess = execa('claude', [
'-p', prompt,
'--output-format', 'json',
], {
cwd: cwd ?? worktree.path,
detached: true,
stdio: ['ignore', 'pipe', 'pipe'], // Capture stdout/stderr
});
// Run agent in background (non-blocking)
this.runAgent(agentId, prompt, cwd ?? worktree.path, abortController.signal)
.catch(error => this.handleAgentError(agentId, error));
this.activeAgents.set(agentId, { subprocess });
// Emit spawned event
if (this.eventBus) {
const event: AgentSpawnedEvent = {
type: 'agent:spawned',
timestamp: new Date(),
payload: {
agentId,
name,
taskId,
worktreeId: worktree.id,
},
};
this.eventBus.emit(event);
}
// Handle completion in background
this.handleAgentCompletion(agentId, subprocess);
return this.toAgentInfo(agent);
}
private async runAgent(
agentId: string,
prompt: string,
cwd: string,
signal: AbortSignal
): Promise<void> {
private async handleAgentCompletion(agentId: string, subprocess: ResultPromise): Promise<void> {
try {
let sessionId: string | undefined;
const { stdout, stderr } = await subprocess;
const agent = await this.repository.findById(agentId);
if (!agent) return;
for await (const message of query({
prompt,
options: {
allowedTools: ['Read', 'Write', 'Edit', 'Bash', 'Glob', 'Grep'],
permissionMode: 'bypassPermissions',
cwd,
}
})) {
// Check for abort
if (signal.aborted) {
throw new Error('Agent stopped by user');
}
// Parse JSON result
const result: ClaudeCliResult = JSON.parse(stdout);
// Capture session ID from init message
if (message.type === 'system' && message.subtype === 'init') {
sessionId = message.session_id;
await this.repository.updateSessionId(agentId, sessionId);
// Emit spawned event now that we have session ID
if (this.eventBus) {
const agent = await this.repository.findById(agentId);
if (agent) {
const event: AgentSpawnedEvent = {
type: 'agent:spawned',
timestamp: new Date(),
payload: {
agentId,
taskId: agent.taskId ?? '',
sessionId,
worktreeId: agent.worktreeId,
},
};
this.eventBus.emit(event);
}
}
}
// Handle result
if (message.type === 'result') {
const active = this.activeAgents.get(agentId);
if (active) {
active.result = {
success: message.subtype === 'success',
message: message.subtype === 'success'
? 'Task completed successfully'
: 'Task failed',
};
}
}
// Store session_id for potential resume
if (result.session_id) {
await this.repository.updateSessionId(agentId, result.session_id);
}
// Agent completed successfully
// Store result
const active = this.activeAgents.get(agentId);
if (active) {
active.result = {
success: result.subtype === 'success',
message: result.result,
};
}
// Update status to idle (ready for next prompt or resume)
await this.repository.updateStatus(agentId, 'idle');
if (this.eventBus) {
const agent = await this.repository.findById(agentId);
if (agent) {
const event: AgentStoppedEvent = {
type: 'agent:stopped',
timestamp: new Date(),
payload: {
agentId,
taskId: agent.taskId ?? '',
reason: 'task_complete',
},
};
this.eventBus.emit(event);
}
const event: AgentStoppedEvent = {
type: 'agent:stopped',
timestamp: new Date(),
payload: {
agentId,
name: agent.name,
taskId: agent.taskId ?? '',
reason: 'task_complete',
},
};
this.eventBus.emit(event);
}
} catch (error) {
throw error;
await this.handleAgentError(agentId, error);
}
}
private async handleAgentError(agentId: string, error: unknown): Promise<void> {
const errorMessage = error instanceof Error ? error.message : String(error);
const agent = await this.repository.findById(agentId);
if (!agent) return;
// Check if this was a user-requested stop
if (errorMessage === 'Agent stopped by user') {
await this.repository.updateStatus(agentId, 'stopped');
return;
}
// Check if this is a "waiting for input" scenario (agent asked AskUserQuestion)
// The CLI exits with a specific pattern when waiting for user input
if (errorMessage.includes('waiting for input') || errorMessage.includes('user_question')) {
await this.repository.updateStatus(agentId, 'waiting_for_input');
// Crashed
await this.repository.updateStatus(agentId, 'crashed');
if (this.eventBus) {
const agent = await this.repository.findById(agentId);
if (agent) {
const event: AgentCrashedEvent = {
type: 'agent:crashed',
if (this.eventBus) {
const event: AgentWaitingEvent = {
type: 'agent:waiting',
timestamp: new Date(),
payload: {
agentId,
name: agent.name,
taskId: agent.taskId ?? '',
error: errorMessage,
sessionId: agent.sessionId ?? '',
question: errorMessage, // Would need to parse actual question
},
};
this.eventBus.emit(event);
}
return;
}
// Actual crash
await this.repository.updateStatus(agentId, 'crashed');
if (this.eventBus) {
const event: AgentCrashedEvent = {
type: 'agent:crashed',
timestamp: new Date(),
payload: {
agentId,
name: agent.name,
taskId: agent.taskId ?? '',
error: errorMessage,
},
};
this.eventBus.emit(event);
}
// Store error result
const active = this.activeAgents.get(agentId);
if (active) {
active.result = {
@@ -241,7 +252,7 @@ export class ClaudeAgentManager implements AgentManager {
const active = this.activeAgents.get(agentId);
if (active) {
active.abortController.abort();
active.subprocess.kill('SIGTERM');
this.activeAgents.delete(agentId);
}
@@ -253,6 +264,7 @@ export class ClaudeAgentManager implements AgentManager {
timestamp: new Date(),
payload: {
agentId,
name: agent.name,
taskId: agent.taskId ?? '',
reason: 'user_requested',
},
@@ -271,18 +283,23 @@ export class ClaudeAgentManager implements AgentManager {
return agent ? this.toAgentInfo(agent) : null;
}
async getByName(name: string): Promise<AgentInfo | null> {
const agent = await this.repository.findByName(name);
return agent ? this.toAgentInfo(agent) : null;
}
async resume(agentId: string, prompt: string): Promise<void> {
const agent = await this.repository.findById(agentId);
if (!agent) {
throw new Error(`Agent '${agentId}' not found`);
}
if (agent.status !== 'idle') {
throw new Error(`Agent '${agentId}' is not idle (status: ${agent.status})`);
if (agent.status !== 'waiting_for_input') {
throw new Error(`Agent '${agent.name}' is not waiting for input (status: ${agent.status})`);
}
if (!agent.sessionId) {
throw new Error(`Agent '${agentId}' has no session to resume`);
throw new Error(`Agent '${agent.name}' has no session to resume`);
}
// Get worktree path
@@ -291,20 +308,28 @@ export class ClaudeAgentManager implements AgentManager {
throw new Error(`Worktree '${agent.worktreeId}' not found`);
}
// Update status to running
await this.repository.updateStatus(agentId, 'running');
// Create new abort controller
const abortController = new AbortController();
this.activeAgents.set(agentId, { abortController });
// Start CLI with --resume flag
const subprocess = execa('claude', [
'-p', prompt,
'--resume', agent.sessionId,
'--output-format', 'json',
], {
cwd: worktree.path,
detached: true,
stdio: ['ignore', 'pipe', 'pipe'],
});
this.activeAgents.set(agentId, { subprocess });
// Emit resumed event
if (this.eventBus) {
const event: AgentResumedEvent = {
type: 'agent:resumed',
timestamp: new Date(),
payload: {
agentId,
name: agent.name,
taskId: agent.taskId ?? '',
sessionId: agent.sessionId,
},
@@ -312,65 +337,7 @@ export class ClaudeAgentManager implements AgentManager {
this.eventBus.emit(event);
}
// Run with resume option
this.runAgentResume(agentId, prompt, worktree.path, agent.sessionId, abortController.signal)
.catch(error => this.handleAgentError(agentId, error));
}
private async runAgentResume(
agentId: string,
prompt: string,
cwd: string,
sessionId: string,
signal: AbortSignal
): Promise<void> {
try {
for await (const message of query({
prompt,
options: {
allowedTools: ['Read', 'Write', 'Edit', 'Bash', 'Glob', 'Grep'],
permissionMode: 'bypassPermissions',
cwd,
resume: sessionId,
}
})) {
if (signal.aborted) {
throw new Error('Agent stopped by user');
}
if (message.type === 'result') {
const active = this.activeAgents.get(agentId);
if (active) {
active.result = {
success: message.subtype === 'success',
message: message.subtype === 'success'
? 'Task completed successfully'
: 'Task failed',
};
}
}
}
await this.repository.updateStatus(agentId, 'idle');
if (this.eventBus) {
const agent = await this.repository.findById(agentId);
if (agent) {
const event: AgentStoppedEvent = {
type: 'agent:stopped',
timestamp: new Date(),
payload: {
agentId,
taskId: agent.taskId ?? '',
reason: 'task_complete',
},
};
this.eventBus.emit(event);
}
}
} catch (error) {
throw error;
}
this.handleAgentCompletion(agentId, subprocess);
}
async getResult(agentId: string): Promise<AgentResult | null> {
@@ -378,9 +345,19 @@ export class ClaudeAgentManager implements AgentManager {
return active?.result ?? null;
}
private toAgentInfo(agent: { id: string; taskId: string | null; sessionId: string; worktreeId: string; status: string; createdAt: Date; updatedAt: Date }): AgentInfo {
private toAgentInfo(agent: {
id: string;
name: string;
taskId: string | null;
sessionId: string | null;
worktreeId: string;
status: string;
createdAt: Date;
updatedAt: Date;
}): AgentInfo {
return {
id: agent.id,
name: agent.name,
taskId: agent.taskId ?? '',
sessionId: agent.sessionId,
worktreeId: agent.worktreeId,
@@ -399,14 +376,14 @@ export { ClaudeAgentManager } from './manager.js';
```
</action>
<verify>npm run build passes with no TypeScript errors</verify>
<done>ClaudeAgentManager adapter implemented and exported</done>
<done>ClaudeAgentManager adapter implemented using CLI with JSON output</done>
</task>
<task type="auto">
<name>Task 3: Write tests for AgentManager</name>
<name>Task 2: Write tests for AgentManager</name>
<files>src/agent/manager.test.ts</files>
<action>
Create unit tests for ClaudeAgentManager. Since we can't actually spawn Claude agents in tests, mock the SDK:
Create unit tests for ClaudeAgentManager. Mock execa since we can't spawn real Claude CLI:
```typescript
// src/agent/manager.test.ts
@@ -416,14 +393,13 @@ import type { AgentRepository } from '../db/repositories/agent-repository.js';
import type { WorktreeManager, Worktree } from '../git/types.js';
import { EventEmitterBus } from '../events/index.js';
// Mock the Claude Agent SDK
vi.mock('@anthropic-ai/claude-agent-sdk', () => ({
query: vi.fn(),
// Mock execa
vi.mock('execa', () => ({
execa: vi.fn(),
}));
import { query } from '@anthropic-ai/claude-agent-sdk';
const mockQuery = vi.mocked(query);
import { execa } from 'execa';
const mockExeca = vi.mocked(execa);
describe('ClaudeAgentManager', () => {
let manager: ClaudeAgentManager;
@@ -433,13 +409,14 @@ describe('ClaudeAgentManager', () => {
const mockWorktree: Worktree = {
id: 'worktree-123',
branch: 'agent/test',
branch: 'agent/gastown',
path: '/tmp/worktree',
isMainWorktree: false,
};
const mockAgent = {
id: 'agent-123',
name: 'gastown',
taskId: 'task-456',
sessionId: 'session-789',
worktreeId: 'worktree-123',
@@ -449,12 +426,12 @@ describe('ClaudeAgentManager', () => {
};
beforeEach(() => {
// Reset mocks
vi.clearAllMocks();
mockRepository = {
create: vi.fn().mockResolvedValue(mockAgent),
findById: vi.fn().mockResolvedValue(mockAgent),
findByName: vi.fn().mockResolvedValue(null), // No duplicate by default
findByTaskId: vi.fn().mockResolvedValue(mockAgent),
findBySessionId: vi.fn().mockResolvedValue(mockAgent),
findAll: vi.fn().mockResolvedValue([mockAgent]),
@@ -482,151 +459,139 @@ describe('ClaudeAgentManager', () => {
});
describe('spawn', () => {
it('creates worktree and agent record', async () => {
// Mock query to complete immediately
mockQuery.mockImplementation(async function* () {
yield { type: 'system', subtype: 'init', session_id: 'sess-123' };
yield { type: 'result', subtype: 'success' };
});
it('creates worktree and agent record with name', async () => {
const mockSubprocess = {
pid: 123,
kill: vi.fn(),
then: () => Promise.resolve({ stdout: '{"type":"result","subtype":"success","session_id":"sess-123","result":"done"}', stderr: '' }),
catch: () => mockSubprocess,
};
mockExeca.mockReturnValue(mockSubprocess as any);
const result = await manager.spawn({
name: 'gastown',
taskId: 'task-456',
prompt: 'Test task',
});
expect(mockWorktreeManager.create).toHaveBeenCalledWith(
expect.any(String),
expect.stringContaining('agent/')
'agent/gastown' // Uses name for branch
);
expect(mockRepository.create).toHaveBeenCalled();
expect(result.taskId).toBe('task-456');
expect(mockRepository.create).toHaveBeenCalledWith(
expect.objectContaining({ name: 'gastown' })
);
expect(result.name).toBe('gastown');
});
it('emits AgentSpawned event', async () => {
it('rejects duplicate agent names', async () => {
mockRepository.findByName = vi.fn().mockResolvedValue(mockAgent);
await expect(manager.spawn({
name: 'gastown',
taskId: 'task-456',
prompt: 'Test',
})).rejects.toThrow("Agent with name 'gastown' already exists");
});
it('emits AgentSpawned event with name', async () => {
const events: any[] = [];
eventBus.subscribe((event) => events.push(event));
mockQuery.mockImplementation(async function* () {
yield { type: 'system', subtype: 'init', session_id: 'sess-123' };
yield { type: 'result', subtype: 'success' };
});
const mockSubprocess = {
pid: 123,
kill: vi.fn(),
then: () => Promise.resolve({ stdout: '{"type":"result","subtype":"success","session_id":"sess-123","result":"done"}', stderr: '' }),
catch: () => mockSubprocess,
};
mockExeca.mockReturnValue(mockSubprocess as any);
await manager.spawn({ taskId: 'task-456', prompt: 'Test' });
await manager.spawn({ name: 'gastown', taskId: 'task-456', prompt: 'Test' });
// Wait for async event
await new Promise(resolve => setTimeout(resolve, 10));
expect(events.some(e => e.type === 'agent:spawned')).toBe(true);
const spawnedEvent = events.find(e => e.type === 'agent:spawned');
expect(spawnedEvent).toBeDefined();
expect(spawnedEvent.payload.name).toBe('gastown');
});
});
describe('stop', () => {
it('stops running agent', async () => {
// First spawn an agent
mockQuery.mockImplementation(async function* () {
yield { type: 'system', subtype: 'init', session_id: 'sess-123' };
// Hang here - never yield result
await new Promise(() => {});
});
const mockSubprocess = {
pid: 123,
kill: vi.fn(),
then: () => new Promise(() => {}), // Never resolves
catch: () => mockSubprocess,
};
mockExeca.mockReturnValue(mockSubprocess as any);
const agent = await manager.spawn({ taskId: 'task-456', prompt: 'Test' });
await manager.stop(agent.id);
await manager.spawn({ name: 'gastown', taskId: 'task-456', prompt: 'Test' });
await manager.stop(mockAgent.id);
expect(mockRepository.updateStatus).toHaveBeenCalledWith(agent.id, 'stopped');
});
it('throws for non-existent agent', async () => {
mockRepository.findById = vi.fn().mockResolvedValue(null);
await expect(manager.stop('not-found')).rejects.toThrow("Agent 'not-found' not found");
expect(mockSubprocess.kill).toHaveBeenCalledWith('SIGTERM');
expect(mockRepository.updateStatus).toHaveBeenCalledWith(mockAgent.id, 'stopped');
});
});
describe('list', () => {
it('returns all agents', async () => {
it('returns all agents with names', async () => {
const agents = await manager.list();
expect(mockRepository.findAll).toHaveBeenCalled();
expect(agents).toHaveLength(1);
expect(agents[0].id).toBe('agent-123');
expect(agents[0].name).toBe('gastown');
});
});
describe('get', () => {
it('returns agent by id', async () => {
const agent = await manager.get('agent-123');
describe('getByName', () => {
it('finds agent by name', async () => {
mockRepository.findByName = vi.fn().mockResolvedValue(mockAgent);
expect(mockRepository.findById).toHaveBeenCalledWith('agent-123');
expect(agent?.id).toBe('agent-123');
});
const agent = await manager.getByName('gastown');
it('returns null for non-existent agent', async () => {
mockRepository.findById = vi.fn().mockResolvedValue(null);
const agent = await manager.get('not-found');
expect(agent).toBeNull();
expect(mockRepository.findByName).toHaveBeenCalledWith('gastown');
expect(agent?.name).toBe('gastown');
});
});
describe('resume', () => {
it('resumes idle agent with existing session', async () => {
mockQuery.mockImplementation(async function* () {
yield { type: 'result', subtype: 'success' };
it('resumes agent waiting for input', async () => {
mockRepository.findById = vi.fn().mockResolvedValue({
...mockAgent,
status: 'waiting_for_input',
});
await manager.resume('agent-123', 'Continue work');
const mockSubprocess = {
pid: 123,
kill: vi.fn(),
then: () => Promise.resolve({ stdout: '{"type":"result","subtype":"success","session_id":"sess-123","result":"continued"}', stderr: '' }),
catch: () => mockSubprocess,
};
mockExeca.mockReturnValue(mockSubprocess as any);
expect(mockRepository.updateStatus).toHaveBeenCalledWith('agent-123', 'running');
await manager.resume(mockAgent.id, 'User response');
expect(mockExeca).toHaveBeenCalledWith('claude', [
'-p', 'User response',
'--resume', 'session-789',
'--output-format', 'json',
], expect.any(Object));
});
it('throws if agent not idle', async () => {
it('rejects if agent not waiting for input', async () => {
mockRepository.findById = vi.fn().mockResolvedValue({
...mockAgent,
status: 'running',
});
await expect(manager.resume('agent-123', 'Continue')).rejects.toThrow('is not idle');
});
it('throws if no session to resume', async () => {
mockRepository.findById = vi.fn().mockResolvedValue({
...mockAgent,
sessionId: '',
});
await expect(manager.resume('agent-123', 'Continue')).rejects.toThrow('no session to resume');
});
});
describe('getResult', () => {
it('returns result after completion', async () => {
mockQuery.mockImplementation(async function* () {
yield { type: 'system', subtype: 'init', session_id: 'sess-123' };
yield { type: 'result', subtype: 'success' };
});
const agent = await manager.spawn({ taskId: 'task-456', prompt: 'Test' });
// Wait for completion
await new Promise(resolve => setTimeout(resolve, 10));
const result = await manager.getResult(agent.id);
expect(result?.success).toBe(true);
});
it('returns null for unknown agent', async () => {
const result = await manager.getResult('unknown');
expect(result).toBeNull();
await expect(manager.resume(mockAgent.id, 'Response')).rejects.toThrow('not waiting for input');
});
});
});
```
Tests mock the Claude Agent SDK since we can't spawn real agents in tests.
Tests mock execa since we can't spawn real Claude CLI in tests.
</action>
<verify>npm test -- src/agent/manager.test.ts passes all tests</verify>
<done>ClaudeAgentManager tests pass, verifying spawn, stop, list, get, resume, and getResult</done>
<done>ClaudeAgentManager tests pass, verifying spawn with names, stop, list, getByName, resume</done>
</task>
</tasks>
@@ -635,9 +600,11 @@ Tests mock the Claude Agent SDK since we can't spawn real agents in tests.
Before declaring plan complete:
- [ ] npm run build succeeds without errors
- [ ] npm test passes all agent manager tests
- [ ] @anthropic-ai/claude-agent-sdk installed
- [ ] ClaudeAgentManager implements all AgentManager methods
- [ ] Events emitted on spawn, stop, crash, resume
- [ ] ClaudeAgentManager uses CLI with --output-format json
- [ ] Session ID extracted from CLI JSON output
- [ ] Agent names enforced (unique, used for branches)
- [ ] waiting_for_input status handled for AskUserQuestion scenarios
- [ ] Events include agent name
</verification>
<success_criteria>

View File

@@ -89,25 +89,36 @@ Add agent procedures following existing patterns:
import { z } from 'zod';
// Input schemas
// Input schemas - support lookup by name OR id
const spawnAgentInput = z.object({
name: z.string(), // Human-readable name (required)
taskId: z.string(),
prompt: z.string(),
cwd: z.string().optional(),
});
const stopAgentInput = z.object({
agentId: z.string(),
const agentIdentifier = z.object({
name: z.string().optional(), // Lookup by name (preferred)
id: z.string().optional(), // Or by ID
}).refine(data => data.name || data.id, {
message: 'Either name or id must be provided',
});
const resumeAgentInput = z.object({
agentId: z.string(),
name: z.string().optional(),
id: z.string().optional(),
prompt: z.string(),
}).refine(data => data.name || data.id, {
message: 'Either name or id must be provided',
});
const getAgentInput = z.object({
agentId: z.string(),
});
// Helper to resolve agent by name or id
async function resolveAgent(ctx: Context, input: { name?: string; id?: string }) {
if (input.name) {
return ctx.agentManager.getByName(input.name);
}
return ctx.agentManager.get(input.id!);
}
// Add to router
export const appRouter = router({
@@ -118,6 +129,7 @@ export const appRouter = router({
.input(spawnAgentInput)
.mutation(async ({ ctx, input }) => {
const agent = await ctx.agentManager.spawn({
name: input.name,
taskId: input.taskId,
prompt: input.prompt,
cwd: input.cwd,
@@ -126,10 +138,12 @@ export const appRouter = router({
}),
stopAgent: procedure
.input(stopAgentInput)
.input(agentIdentifier)
.mutation(async ({ ctx, input }) => {
await ctx.agentManager.stop(input.agentId);
return { success: true };
const agent = await resolveAgent(ctx, input);
if (!agent) throw new Error('Agent not found');
await ctx.agentManager.stop(agent.id);
return { success: true, name: agent.name };
}),
listAgents: procedure
@@ -138,22 +152,32 @@ export const appRouter = router({
}),
getAgent: procedure
.input(getAgentInput)
.input(agentIdentifier)
.query(async ({ ctx, input }) => {
return ctx.agentManager.get(input.agentId);
return resolveAgent(ctx, input);
}),
getAgentByName: procedure
.input(z.object({ name: z.string() }))
.query(async ({ ctx, input }) => {
return ctx.agentManager.getByName(input.name);
}),
resumeAgent: procedure
.input(resumeAgentInput)
.mutation(async ({ ctx, input }) => {
await ctx.agentManager.resume(input.agentId, input.prompt);
return { success: true };
const agent = await resolveAgent(ctx, input);
if (!agent) throw new Error('Agent not found');
await ctx.agentManager.resume(agent.id, input.prompt);
return { success: true, name: agent.name };
}),
getAgentResult: procedure
.input(getAgentInput)
.input(agentIdentifier)
.query(async ({ ctx, input }) => {
return ctx.agentManager.getResult(input.agentId);
const agent = await resolveAgent(ctx, input);
if (!agent) return null;
return ctx.agentManager.getResult(agent.id);
}),
});
```
@@ -171,23 +195,27 @@ Export updated AppRouter type for client.
Add CLI commands for agent management using existing tRPC client pattern:
```typescript
// Add commands to CLI
// Add commands to CLI - use NAMES as primary identifier (like gastown)
// cw agent spawn <taskId> <prompt>
// Spawns a new agent for the given task
// cw agent spawn --name <name> --task <taskId> <prompt>
// Example: cw agent spawn --name gastown --task task-123 "Fix the auth bug"
program
.command('agent spawn <taskId> <prompt>')
.command('agent spawn <prompt>')
.description('Spawn a new agent to work on a task')
.requiredOption('--name <name>', 'Human-readable name for the agent (e.g., gastown)')
.requiredOption('--task <taskId>', 'Task ID to assign to agent')
.option('--cwd <path>', 'Working directory for agent')
.action(async (taskId: string, prompt: string, options: { cwd?: string }) => {
.action(async (prompt: string, options: { name: string; task: string; cwd?: string }) => {
const client = await getTrpcClient();
try {
const agent = await client.spawnAgent.mutate({
taskId,
name: options.name,
taskId: options.task,
prompt,
cwd: options.cwd,
});
console.log(`Agent spawned: ${agent.id}`);
console.log(`Agent '${agent.name}' spawned`);
console.log(` ID: ${agent.id}`);
console.log(` Task: ${agent.taskId}`);
console.log(` Status: ${agent.status}`);
console.log(` Worktree: ${agent.worktreeId}`);
@@ -197,15 +225,16 @@ program
}
});
// cw agent stop <agentId>
// cw agent stop <name>
// Example: cw agent stop gastown
program
.command('agent stop <agentId>')
.description('Stop a running agent')
.action(async (agentId: string) => {
.command('agent stop <name>')
.description('Stop a running agent by name')
.action(async (name: string) => {
const client = await getTrpcClient();
try {
await client.stopAgent.mutate({ agentId });
console.log(`Agent ${agentId} stopped`);
const result = await client.stopAgent.mutate({ name });
console.log(`Agent '${result.name}' stopped`);
} catch (error) {
console.error('Failed to stop agent:', error);
process.exit(1);
@@ -226,7 +255,8 @@ program
}
console.log('Agents:');
for (const agent of agents) {
console.log(` ${agent.id} [${agent.status}] - Task: ${agent.taskId}`);
const status = agent.status === 'waiting_for_input' ? 'WAITING' : agent.status.toUpperCase();
console.log(` ${agent.name} [${status}] - ${agent.taskId}`);
}
} catch (error) {
console.error('Failed to list agents:', error);
@@ -234,21 +264,23 @@ program
}
});
// cw agent get <agentId>
// cw agent get <name>
// Example: cw agent get gastown
program
.command('agent get <agentId>')
.description('Get agent details')
.action(async (agentId: string) => {
.command('agent get <name>')
.description('Get agent details by name')
.action(async (name: string) => {
const client = await getTrpcClient();
try {
const agent = await client.getAgent.query({ agentId });
const agent = await client.getAgent.query({ name });
if (!agent) {
console.log(`Agent ${agentId} not found`);
console.log(`Agent '${name}' not found`);
return;
}
console.log(`Agent: ${agent.id}`);
console.log(`Agent: ${agent.name}`);
console.log(` ID: ${agent.id}`);
console.log(` Task: ${agent.taskId}`);
console.log(` Session: ${agent.sessionId}`);
console.log(` Session: ${agent.sessionId ?? '(none)'}`);
console.log(` Worktree: ${agent.worktreeId}`);
console.log(` Status: ${agent.status}`);
console.log(` Created: ${agent.createdAt}`);
@@ -259,29 +291,31 @@ program
}
});
// cw agent resume <agentId> <prompt>
// cw agent resume <name> <response>
// Example: cw agent resume gastown "Use option A"
program
.command('agent resume <agentId> <prompt>')
.description('Resume an idle agent with a new prompt')
.action(async (agentId: string, prompt: string) => {
.command('agent resume <name> <response>')
.description('Resume an agent that is waiting for input')
.action(async (name: string, response: string) => {
const client = await getTrpcClient();
try {
await client.resumeAgent.mutate({ agentId, prompt });
console.log(`Agent ${agentId} resumed`);
const result = await client.resumeAgent.mutate({ name, prompt: response });
console.log(`Agent '${result.name}' resumed`);
} catch (error) {
console.error('Failed to resume agent:', error);
process.exit(1);
}
});
// cw agent result <agentId>
// cw agent result <name>
// Example: cw agent result gastown
program
.command('agent result <agentId>')
.command('agent result <name>')
.description('Get agent execution result')
.action(async (agentId: string) => {
.action(async (name: string) => {
const client = await getTrpcClient();
try {
const result = await client.getAgentResult.query({ agentId });
const result = await client.getAgentResult.query({ name });
if (!result) {
console.log('No result available (agent may still be running)');
return;

View File

@@ -10,118 +10,64 @@
3. How do we persist agent sessions across terminal close/reopen?
4. How do we manage the process lifecycle?
## Key Finding: Claude Agent SDK
## Key Finding: Claude CLI with JSON Output
**The Claude Agent SDK** (formerly Claude Code SDK) is the correct approach. It provides:
Use **Claude CLI** with `--output-format json` flag, not the SDK. The session_id is returned in the JSON result output:
- Programmatic agent spawning via `query()` function
- Built-in tool execution (no need to implement tool loop)
- Session persistence via session IDs
- Streaming message output
- Multi-turn conversations
```json
{
"type": "result",
"subtype": "success",
"is_error": false,
"duration_ms": 7899,
"session_id": "f38b6614-d740-4441-a123-0bb3bea0d6a9",
"result": "Claude's response...",
"total_cost_usd": 0.08348025
}
```
### Installation
### CLI Invocation Pattern
```bash
npm install @anthropic-ai/claude-agent-sdk
```
# Spawn agent with JSON output
claude -p "Fix the auth bug in src/auth.ts" --output-format json
Requires Node.js 18+.
### API Patterns
**Single-turn / autonomous task (V1 - recommended for our use case):**
```typescript
import { query } from "@anthropic-ai/claude-agent-sdk";
for await (const message of query({
prompt: "Find and fix the bug in auth.py",
options: {
allowedTools: ["Read", "Edit", "Bash"],
permissionMode: "bypassPermissions" // For automation
}
})) {
if ("result" in message) console.log(message.result);
}
```
**Multi-turn sessions (V2 preview):**
```typescript
import { unstable_v2_createSession } from '@anthropic-ai/claude-agent-sdk'
await using session = unstable_v2_createSession({
model: 'claude-sonnet-4-5-20250929'
})
await session.send('Read the auth module')
for await (const msg of session.stream()) {
// Process messages
}
# Resume a session (when agent was waiting for input)
claude -p "Use option A" --resume f38b6614-d740-4441-a123-0bb3bea0d6a9 --output-format json
```
### Session Persistence (AGENT-04)
Sessions can be captured and resumed:
**Key insight:** Session ID is primarily important when an agent stops due to waiting for a question (AskUserQuestion tool) and needs to be resumed with the user's response.
```typescript
let sessionId: string | undefined;
Flow:
1. Agent runs and uses AskUserQuestion → CLI exits with session_id
2. Store session_id in database, set status to `waiting_for_input`
3. User provides response via `cw agent resume <name> "response"`
4. Resume with `claude -p "response" --resume <session_id> --output-format json`
// First query: capture session ID
for await (const message of query({
prompt: "Read the authentication module",
options: { allowedTools: ["Read", "Glob"] }
})) {
if (message.type === "system" && message.subtype === "init") {
sessionId = message.session_id; // Store this!
}
}
### Agent Names
// Later: resume with full context
for await (const message of query({
prompt: "Now find all places that call it",
options: { resume: sessionId }
})) {
// Full context preserved
}
Agents use human-readable names (e.g., "gastown", "chinatown") instead of UUIDs for better UX:
```bash
# Spawn with name
cw agent spawn --name gastown --task task-123 "Fix the auth bug"
# Reference by name
cw agent stop gastown
cw agent get gastown
cw agent resume gastown "Use option A"
```
**Decision:** Store `session_id` in database for persistence across terminal sessions.
### Key Options
| Option | Purpose |
|--------|---------|
| `allowedTools` | Array of tool names (Read, Edit, Bash, Glob, etc.) |
| `permissionMode` | "bypassPermissions" for automation, "acceptEdits" for safer mode |
| `resume` | Session ID to resume from |
| `maxTurns` | Maximum agentic turns before stopping |
| `cwd` | Working directory for agent operations |
### Message Types
| Type | Subtype | Contains |
|------|---------|----------|
| `system` | `init` | `session_id` - capture this for persistence |
| `assistant` | - | `message.content` with text blocks |
| `result` | `success`/`error` | Final result |
### Working Directory & Worktrees
The SDK accepts `cwd` option for working directory. We'll set this to the worktree path:
Each agent gets an isolated worktree. The CLI runs with `cwd` set to the worktree path:
```typescript
for await (const message of query({
prompt: task.description,
options: {
allowedTools: ["Read", "Edit", "Bash", "Glob", "Grep"],
cwd: worktree.path, // Agent operates in isolated worktree
permissionMode: "bypassPermissions"
}
})) {
// ...
}
execa('claude', ['-p', prompt, '--output-format', 'json'], {
cwd: worktree.path, // Agent operates in isolated worktree
});
```
## Architecture Decision