Codewalkers

Author	SHA1	Message	Date
Lukas May	2b62160c95	fix: Polish settings pages — display font, status tokens, spacing Apply font-display to headings across settings layout and health page. Replace text-destructive with text-status-error-fg for consistency with the design system status tokens. Increase projects page section spacing from space-y-4 to space-y-6.	2026-03-04 07:50:30 +01:00
Lukas May	7e60cbfff9	feat: Premium design overhaul — typography, atmosphere, animations, component polish - Add Plus Jakarta Sans as display font for headings - Add subtle noise texture overlay + indigo radial gradient for depth - New keyframe animations: glow-pulse, fade-in-up, scale-in, slide-in-right - Card: interactive hover-lift + selected ring variants - Button: scale micro-interactions, destructive glow, transition-all - Header: logo upgrade with wordmark, animated nav indicator bar, glass search button, gradient shadow depth - StatusDot: glow halos per status variant (active/success/error/warning/urgent) - HealthDot: glow effects for connected/disconnected/reconnecting states - Card hover-lift and status glow CSS utilities	2026-03-04 07:30:06 +01:00
Lukas May	dd86f12057	feat: Add page-level entrance animations using motion library Subtle fade-in + y-offset animations on mount for all main pages (initiatives list, initiative detail, agents, inbox) and staggered card animations for initiative and agent lists.	2026-03-04 07:28:53 +01:00
Lukas May	af092ba16a	feat: Add task execution graph within phase detail panel Tasks are now grouped by dependency depth using the same groupPhasesByDependencyLevel utility. Parallel tasks are wrapped in dashed containers, sequential layers connected by status-aware lines. Replaces the flat TaskRow list and DependencyIndicator callout bars.	2026-03-04 07:20:44 +01:00
Lukas May	9f88d5b433	feat: Replace flat phase sidebar with vertical execution graph Phases are now grouped by dependency depth using groupPhasesByDependencyLevel. Single-phase layers render as compact nodes, multi-phase layers are wrapped in a dashed "PARALLEL" container. Connectors between layers turn green when prior layers are all completed. Staggered entrance animation per layer.	2026-03-04 05:44:23 +01:00
Lukas May	6a9d9e3452	feat: Redesign task and phase dependency display in plans tab Replace plain text dependency indicators with visual, status-aware components: - New DependencyChip/PhaseNumberBadge components with status-colored styling - Sidebar shows compact numbered circles for phase deps instead of text - Detail panel uses bordered cards with phase badges and status indicators - Task dependency callout bars with resolved/total counters - Collapse mechanism for tasks with 3+ dependencies (+N more button) - Full dark mode support via semantic status tokens	2026-03-04 05:28:11 +01:00
Lukas May	9e7c246280	fix: Switch auto-spawn from discuss to refine agent, surface in UI The auto-spawned agent on initiative creation was using discuss mode (Q&A) when it should use refine mode (expand content). Now: - Description seeds root page as tiptap content (split on double newlines) - Spawns refine agent with the populated page in inputContext - getActiveRefineAgent broadened to also surface discuss agents (for CLI-spawned discuss agents) - RefineAgentPanel shows mode-appropriate label for discuss vs refine	2026-03-03 14:25:43 +01:00
Lukas May	b38a2ec034	fix: Pass task with description to auto-spawned discuss agent The discuss agent spawned on initiative creation received only the initiative in its inputContext, missing the task that carries the user's description. The agent started without knowing what to discuss.	2026-03-03 14:13:45 +01:00
Lukas May	3d04cb2081	fix: Surface active architect agents in initiative activity state Auto-spawned discuss/plan/refine agents were invisible because: 1. listInitiatives only filtered for mode='detail' agents 2. deriveInitiativeActivity returned 'idle' for zero phases before checking for active agents Broadened agent filter to all architect modes (discuss, plan, detail, refine), moved active agent check before zero-phases early return, and added 'discussing'/'refining' activity states with pulsing indicators.	2026-03-03 14:12:20 +01:00
Lukas May	e4289659cd	chore: Re-record full-flow cassettes after task dependency changes Prompt changes in detail.ts invalidated the old cassette hashes. Re-recorded all 4 cassettes with updated prompt content. Replay verified passing in 12s.	2026-03-03 13:56:47 +01:00
Lukas May	9b91ffe0e5	fix: Persist and expose task dependencies from detail output Detail agents define task dependencies in YAML frontmatter but they were silently dropped — never written to the task_dependencies table. This caused all tasks to dispatch in parallel regardless of intended ordering, and the frontend showed no dependency information. - Add fileIdToDbId mapping and second-pass dependency creation in output-handler.ts (mirrors existing phase dependency pattern) - Add task_dependency to changeset entry entityType enum - Add listPhaseTaskDependencies tRPC procedure for batch querying - Wire blockedBy in PhaseDetailPanel and PhaseWithTasks from real data - Clarify dependency semantics in detail prompt	2026-03-03 13:46:29 +01:00
Lukas May	536cdf08a1	feat: Propagate task summaries and input context to execution agents Execution agents were spawning blind — no input files, no knowledge of what predecessor tasks accomplished. This adds three capabilities: 1. summary column on tasks table — completeTask() reads the finishing agent's result.message and stores it on the task record 2. dispatchNext() gathers full initiative context (initiative, phase, sibling tasks, pages) and passes it as inputContext so agents get .cw/input/task.md, initiative.md, phase.md, and context directories 3. context/tasks/*.md files now include the summary field in frontmatter so dependent agents can see what prior agents accomplished	2026-03-03 13:42:37 +01:00
Lukas May	86a1912959	feat: Add description field and auto-spawn discuss agent on initiative creation	2026-03-03 13:40:37 +01:00
Lukas May	9edc93a268	feat: Auto-resume idle agents for inter-agent conversations When an agent asks a question via `cw ask` targeting an idle agent, the conversation router now auto-resumes the idle agent's session so it can answer. Previously, questions to idle agents sat unanswered forever because target resolution only matched running agents. Changes: - Add `resumeForConversation()` to AgentManager interface and implement on MultiProviderAgentManager (mirrors resumeForCommit pattern) - Relax createConversation target resolution: prefer running, fall back to idle (was running-only) - Trigger auto-resume after conversation creation for idle targets - Add concurrency lock (conversationResumeLocks Set) to prevent double-resume race conditions	2026-03-03 13:29:39 +01:00
Lukas May	938700d45d	feat: Make inter-agent communication prompt mode-aware Planning modes (plan, refine) get a minimal block with just cw ask syntax. Execution modes get the full protocol: commands table, shell recipe for listener lifecycle, targeting guidance, when/when-not decision criteria, good/bad examples, and answering guidelines.	2026-03-03 13:26:47 +01:00
Lukas May	3e678f2591	fix: Show detailing indicator by including detail tasks in listInitiativeTasks listInitiativeTasks was filtering out detail tasks server-side, so the detailAgentByPhase mapping could never resolve agent.taskId to a phaseId. Move the filter to client-side (displayTasks) so detail tasks are available for agent mapping but excluded from counts and display grouping.	2026-03-03 13:25:29 +01:00
Lukas May	0ab7b54ad7	feat: Show detailing status in pipeline tab phase groups Thread detail agent info through PipelineGraph → PipelineStageColumn → PipelinePhaseGroup. Phase groups now show spinner + "Detailing…" when a detail agent is active and "Review changes" when finished with no tasks.	2026-03-03 13:13:07 +01:00
Lukas May	411700d37d	feat: Show detailing status in initiative overview and phase sidebar Add 'detailing' activity state derived from active detail agents (mode=detail, status running/waiting_for_input). Initiative cards show pulsing "Detailing" indicator. Phase sidebar items show spinner during active detailing and "Review changes" when the agent finishes.	2026-03-03 13:08:05 +01:00
Lukas May	96386e1c3d	feat: Replace initiative card N+1 queries with server-computed activity indicator listInitiatives now returns an activity object (state, activePhase, phase counts) derived server-side from phases, eliminating per-card listPhases queries. Initiative cards show a StatusDot with pulse animation + label instead of a static StatusBadge. Removed redundant View and Spawn Architect buttons from cards. Added variant override prop to StatusDot.	2026-03-03 12:49:07 +01:00
Lukas May	b74b59b906	fix: Align subscription status mapping with tRPC state machine tRPC subscriptions use connecting/pending/error/idle — not success. The old code mapped pending→isConnecting and waited for success (which never fires), causing AgentOutputViewer to permanently show "Connecting...". Now: connecting→isConnecting, pending→isConnected, idle→disconnected.	2026-03-03 12:46:19 +01:00
Lukas May	c8f370583a	feat: Add codebase exploration to architect agent prompts Architect agents (discuss, plan, detail, refine) were producing generic analysis disconnected from the actual codebase. They had full tool access in their worktrees but were never instructed to explore the code. - Add CODEBASE_EXPLORATION shared constant: read project docs, explore structure, check existing patterns, use subagents for parallel exploration - Inject into all 4 architect prompts after INPUT_FILES - Strengthen discuss prompt: analysis method references codebase, examples cite specific paths, definition_of_done requires codebase references - Fix spawnArchitectDiscuss to pass full context (pages/phases/tasks) via gatherInitiativeContext() — was only passing bare initiative metadata - Update docs/agent.md with new tag ordering and shared block table	2026-03-03 12:45:14 +01:00
Lukas May	1043079a08	feat: Persist agents page filter in URL query params, default to questions	2026-03-03 12:42:32 +01:00
Lukas May	2f2ad6eb95	feat: Add remove account button to health page UI	2026-03-03 12:08:48 +01:00
Lukas May	86c6ad8be1	chore: Switch dev.sh to side-by-side split layout (server \| web)	2026-03-03 12:04:23 +01:00
Lukas May	2eada071a1	fix: Use npx tsx so tsx resolves from local node_modules	2026-03-03 12:03:05 +01:00
Lukas May	0fad4a42b9	chore: Move dev.sh into workdir/ with correct working directory	2026-03-03 12:02:21 +01:00
Lukas May	8e77503941	chore: Add dev.sh tmux script to start server and frontend together	2026-03-03 11:59:34 +01:00
Lukas May	b11cae998c	refactor: Co-locate server artifacts under apps/server/ Move drizzle/, dist/, and coverage/ into apps/server/ so all server-specific artifacts live alongside the source they belong to. - git mv drizzle/ → apps/server/drizzle/ - drizzle.config.ts: out → ./apps/server/drizzle - tsconfig.json: outDir → ./apps/server/dist, exclude drizzle dir - package.json: main/bin/clean point to apps/server/dist/ - vitest.config.ts: reportsDirectory → ./apps/server/coverage - .gitignore: add coverage/ entry - ensure-schema.ts: update getMigrationsPath() for new layout - docs/database-migrations.md: update drizzle/ references	2026-03-03 11:55:12 +01:00
Lukas May	04c212da92	feat: Implement v2 design system with indigo brand, dark mode, and status tokens Complete frontend design overhaul replacing achromatic shadcn/ui defaults with an indigo-branded (#6366F1), status-aware, dark-mode-enabled token system. Phase 1 — Theme Foundation: - Replace all CSS tokens in index.css with v2 light/dark mode values - Add 24 status tokens (6 statuses × 4 variants), 22 terminal tokens, 7 diff tokens, 5 shadow tokens, 9 transition/animation tokens, 10 z-index tokens, 10-step extended indigo scale - Install Geist Sans/Mono variable fonts (public/fonts/) - Extend tailwind.config.ts with all new token utilities - Add dark mode flash-prevention script in index.html - Add status-pulse and shimmer keyframe animations - Add global focus-visible styles and reduced-motion media query Phase 2 — ThemeProvider + Toggle: - ThemeProvider context with system preference listener - 3-state ThemeToggle (Sun/Monitor/Moon) - Radix tooltip primitive for tooltips - localStorage persistence with 'cw-theme' key Phase 3 — Shared Components + Token Migration: - StatusDot: mapEntityStatus() maps raw statuses to 6 semantic variants - StatusBadge: uses status token bg/fg/border classes - Badge: 6 new status variants + xs size - EmptyState, ErrorState, SaveIndicator shared patterns - CommandPalette: Cmd+K search with fuzzy matching, keyboard nav - Skeleton with shimmer animation + SkeletonCard composite layouts - KeyboardShortcutHint, NavBadge, enhanced Sonner config - Migrate ALL hardcoded Tailwind colors to token classes across AgentOutputViewer, review/*, ProgressBar, AccountCard, InitiativeHeader, DependencyIndicator, PipelineTaskCard, PreviewPanel, ChangeSetBanner, MessageCard, PhaseDetailPanel Phase 4 — App Layout Overhaul: - Single 48px row header with CW logo, nav with NavBadge counts, Cmd+K search button, ThemeToggle, HealthDot - Remove max-w-7xl from header/main; pages control own widths - ConnectionBanner for offline/reconnecting states - BrowserTitleUpdater with running/questions counts - useGlobalKeyboard (1-4 nav, Cmd+K), useConnectionStatus hooks - Per-page width wrappers (initiatives max-w-6xl, settings max-w-4xl) Phase 5 — Page-Level Token Migration: - ReviewSidebar: all hardcoded green/orange/red → status/diff tokens - CommentThread: resolved state → status-success tokens - Settings health: green → status-success-dot	2026-03-03 11:43:09 +01:00
Lukas May	34578d39c6	refactor: Restructure monorepo to apps/server/ and apps/web/ layout Move src/ → apps/server/ and packages/web/ → apps/web/ to adopt standard monorepo conventions (apps/ for runnable apps, packages/ for reusable libraries). Update all config files, shared package imports, test fixtures, and documentation to reflect new paths. Key fixes: - Update workspace config to ["apps/", "packages/"] - Update tsconfig.json rootDir/include for apps/server/ - Add apps/web/** to vitest exclude list - Update drizzle.config.ts schema path - Fix ensure-schema.ts migration path detection (3 levels up in dev, 2 levels up in dist) - Fix tests/integration/cli-server.test.ts import paths - Update packages/shared imports to apps/server/ paths - Update all docs/ files with new paths	2026-03-03 11:22:53 +01:00
Lukas May	8c38d958ce	refactor: Remove full-flow.test.ts in favour of cassette variant The cassette-backed test (full-flow-cassette.test.ts) covers the same discuss→plan→detail→execute pipeline without API cost. The real-agent test added no unique value once cassettes were committed, and the Stage 6 npm-test validation it included was soft (warn, not fail). Also removes the now-unused shouldRunFullFlowTests export and the FULL_FLOW_TESTS=1 entry from CLAUDE.md.	2026-03-03 10:53:41 +01:00
Lukas May	25360e1711	fix: Stabilize full-flow cassette keys and restore output files on replay Three issues discovered and fixed after initial recording: 1. Agent workdir names not normalized — random animal names (e.g. "available-sheep") embedded in workspace paths caused key drift. Added AGENT_WORKDIR_RE to replace agent-workdirs/<name> with agent-workdirs/__AGENT__ in normalizer.ts. 2. Phase/task files missing on replay — plan/detail agents write output to .cw/output/ (phases/, tasks/) which the server reads on completion. The replay worker only emits JSONL; it doesn't re-execute file writes. Extended cassette format with outputFiles field and added capture (walkOutputDir) + restore (restoreOutputFiles) logic to process-manager. 3. Recording timeout too short — fixed CASSETTE_FLOW_TIMEOUT to be mode-aware: 60 min for recording runs, 5 min for replay. Also commit the 4 recorded cassettes (discuss/plan/detail/execute) that make the full-flow cassette test runnable in CI without API costs.	2026-03-03 10:35:13 +01:00
Lukas May	1e374abcd6	docs: Design review pass on all v2 wireframes 13 files reviewed with mission-control design lens. Key additions: - theme: extended indigo scale, 4-level surface hierarchy, 22 terminal tokens, transition/z-index/focus-visible token categories - All screens: keyboard shortcuts, loading/error/empty states hardened - 5 new shared components: StatusDot, SkeletonLoader, Toast, Badge, KeyboardShortcutHint - settings: expanded from 2 to 5 sub-pages (accounts, workspace, danger zone) - review-tab: 3-pane layout, inline comments, file nav, hunk controls - execution-tab: zoom, partial failure state, stale agent detection - dialogs: 2 bugs found (mutation locking, error placement) Total: 4,039 → 9,302 lines (+130% from review pass)	2026-03-02 19:36:26 +09:00
Lukas May	478a7f18e9	docs: Add v2 wireframes and theme specification 14 files in docs/wireframes/v2/ addressing 13 UX gaps from v1: - Theme spec with indigo brand, status tokens, terminal/diff tokens, dark mode, Geist typography, 6px radius, layered shadows - Wireframes for all pages with loading/error/empty states - Shared component specs (SaveIndicator, EmptyState, ErrorState, CommandPalette, ThemeToggle)	2026-03-02 18:13:17 +09:00
Lukas May	41b1d0e986	feat: Add cassette support for full-flow integration test - normalizer.ts: Add NANOID_RE (21-char alphanumeric) → __ID__ as step 2.5, fixing cassette key instability from nanoid agent IDs in prompts - harness.ts: Add FullFlowHarnessOptions.processManagerFactory for injecting CassetteProcessManager without duplicating harness setup - full-flow-cassette.test.ts: New cassette-backed variant of full-flow test; skips automatically when no cassettes exist (fresh clone), runs in ~seconds once cassettes are recorded and committed - CLAUDE.md: Document cassette recording command for the full-flow test	2026-03-02 17:42:43 +09:00
Lukas May	89db580ca4	docs: Add ASCII wireframe mockups for all frontend pages Covers: app layout, initiatives list, initiative detail (4 tabs), agents page, inbox, settings (health + projects), and all dialogs.	2026-03-02 17:28:14 +09:00
Lukas May	988160b2b7	fix: Patch full-flow test timeouts and driveToCompletion polling loop - driveToCompletion() now catches inner waitForAgentAttention timeouts instead of letting them propagate — long-running execute/detail agents (>3 min without transitioning to waiting_for_input) no longer crash the polling loop; the outer deadline handles termination correctly - Switch execute stage from waitForAgentCompletion to driveToCompletion so any clarifying questions get auto-answered - Increase DETAIL_TIMEOUT_MS 8→15 min, PLAN_TIMEOUT_MS 8→12 min, EXECUTE_TIMEOUT_MS 10→20 min — architect agents are variable in practice; these are upper bounds not expectations - Raise FULL_FLOW_TIMEOUT 30→60 min to cover worst-case stacking - Update CLAUDE.md test command with correct --test-timeout=3600000 Verified: full pipeline (discuss→plan→detail→execute) passes in ~499s	2026-03-02 17:15:12 +09:00
Lukas May	76aca71705	refactor: Restructure agent prompts with XML tags Replace ## Heading sections with descriptive XML tags (<role>, <task>, <execution_protocol>, <examples>, etc.) for unambiguous first-order vs second-order delimiter separation per Anthropic best practices. - shared.ts: All constants wrapped in their XML tag - Mode prompts: Consistent tag vocabulary and ordering across all 5 modes - Examples use <examples> > <example label="good/bad"> nesting - workspace.ts: Output wrapped in <workspace> tags - Delete dead src/agent/prompts.ts (zero imports) - Update docs/agent.md with XML tag documentation	2026-03-02 14:15:28 +09:00
Lukas May	55eb6a494b	test: Add full-flow integration test (discuss→plan→detail→execute) Adds a complete multi-agent workflow test gated behind FULL_FLOW_TESTS=1: - src/test/fixtures/todo-api/ — minimal JS project with missing complete() method and failing tests; gives execute agents a concrete, verifiable task - src/test/integration/full-flow/harness.ts — FullFlowHarness wiring all 11 repos + real MultiProviderAgentManager + tRPC caller + driveToCompletion() helper for Q&A loops - src/test/integration/full-flow/report.ts — stage-by-stage console formatters (discuss/plan/detail/execute/git diff/final summary) - src/test/integration/full-flow/full-flow.test.ts — staged integration test that validates breakdown granularity, agent output quality, and that npm test passes in the project worktree after execution Run with: FULL_FLOW_TESTS=1 npm test -- src/test/integration/full-flow/ --test-timeout=1800000	2026-03-02 13:28:23 +09:00
Lukas May	1540039c52	test: Remove redundant and dead tests (-743 lines) Delete 3 files: - completion-detection.test.ts (private method tests, covered by crash-race-condition) - completion-race-condition.test.ts (covered by mutex-completion + crash-race-condition) - real-e2e-crash.test.ts (dead: expect(true).toBe(true), hardcoded paths) Remove individual tests: - crash-race-condition.test.ts #4 (weaker duplicate of #2) - mock-manager.test.ts duplicate "(second test)" for detail_complete - process-manager.test.ts 2 "logs comprehensive" tests with empty assertions - edge-cases.test.ts 2 Q&A tests redundant with recovery-scenarios Update test-inventory.md to reflect removals.	2026-03-02 12:57:27 +09:00
Lukas May	a2ab4c4a84	docs: Add comprehensive test inventory with coverage gaps and redundancy map Audited all 44 test files one by one. Documents what each test verifies, identifies 12 redundant test pairs, 13 coverage gaps (prioritized), fragility assessment, and mock style inconsistencies.	2026-03-02 12:23:39 +09:00
Lukas May	e9ec5143fd	docs: Document cassette testing system in docs/testing.md and CLAUDE.md	2026-03-02 12:22:46 +09:00
Lukas May	ec031211a2	fix: Resolve advanceTimers return type mismatch (Promise<VitestUtils> → Promise<void>)	2026-03-02 12:19:47 +09:00
Lukas May	0ed657b644	feat: Add VCR-style cassette testing system for agent subprocess pipeline Implements cassette recording/replay to test the full agent execution pipeline (ProcessManager → FileTailer → OutputHandler → SignalManager) without real AI API calls. Key components: - `CassetteProcessManager`: extends ProcessManager, intercepts spawnDetached to replay cassettes or record real runs on completion - `replay-worker.mjs`: standalone node script that replays JSONL + signal.json as a subprocess, exercising the complete file-based output pipeline - `CassetteStore`: reads/writes cassette JSON files keyed by SHA256 hash - `normalizer.ts`: strips dynamic content (UUIDs, temp paths, timestamps, session numbers) from prompts for stable cassette keys - `key.ts`: hashes normalized prompt + provider args + worktree file content (worktree hash detects content drift for execute-mode agents) - `createCassetteHarness()`: wraps RealProviderHarness with cassette support, same interface so existing real-provider tests work unchanged Mode control via env vars: (default) → replay: cassette must exist (safe for CI) CW_CASSETTE_RECORD=1 → auto: replay if exists, record if missing CW_CASSETTE_FORCE_RECORD=1 → record: always run real agent, overwrite cassette MultiProviderAgentManager gains an optional `processManagerOverride` constructor parameter for clean dependency injection without changing existing callers. Cassette files live in src/test/cassettes/ and are intended to be committed to git so CI runs without API access.	2026-03-02 12:17:52 +09:00
Lukas May	a1366efe4d	refactor: Standardize fake timer usage across E2E tests - Add withFakeTimers(fn) helper to TestHarness for scoped timer control - Replace all vi.runAllTimersAsync() with harness.advanceTimers() in E2E and harness tests (37 call sites across 5 files) - Keep vi.useFakeTimers() per-test activation pattern (intentional)	2026-03-02 12:08:24 +09:00
Lukas May	dcb855ede1	fix: Repair test harness coverage, excludes, and timer overhead - Add @vitest/coverage-v8 dep so `npm run test:coverage` actually works - Add exclude patterns to vitest config (node_modules, dist, packages) - Replace dynamic import('vitest') in advanceTimers with direct vi import	2026-03-02 12:01:16 +09:00
Lukas May	863117c63a	fix: Detach agents before initiative deletion to prevent FK constraint failure Nulls out agents.initiativeId before deleting the initiative row, ensuring the delete succeeds even on databases where migration 0025 (which adds ON DELETE SET NULL to the FK) hasn't been applied.	2026-02-18 18:35:06 +09:00
Lukas May	6fa025251e	feat: Wire up initiative deletion end-to-end Add deleteInitiative tRPC procedure, wire Delete button in InitiativeCard with confirm dialog (Shift+click bypass), remove unused onDelete prop chain. Fix agents table FK constraints (initiative_id, account_id missing ON DELETE SET NULL) via table recreation migration. Register conversations migration in journal. Expand cascade delete tests to cover pages, projects, change sets, agents (set null), and conversations (set null).	2026-02-18 17:54:53 +09:00
Lukas May	80aa3e42fb	Fix StatusBadge crash when status is undefined	2026-02-18 17:44:38 +09:00
Lukas May	8bece70a61	fix: Wire archive button to updateInitiative mutation The Archive menu item in InitiativeCard had no onClick handler. Added mutation call with confirmation dialog (shift+click to skip).	2026-02-18 17:44:01 +09:00

... 5 6 7 8 9 ...

763 Commits