chore: bootstrap Harness Foundation mission (Phase 9) (#289)

Co-authored-by: Jason Woltje <jason@diversecanvas.com> Co-committed-by: Jason Woltje <jason@diversecanvas.com>
2026-03-21 20:10:48 +00:00
parent d06866f501
commit 36095ad80f
4 changed files with 556 additions and 140 deletions
--- a/docs/TASKS.md
+++ b/docs/TASKS.md
@@ -1,100 +1,74 @@
-# Tasks — MVP
+# Tasks — Harness Foundation

 > Single-writer: orchestrator only. Workers read but never modify.
 >
 > **`agent` column values:** `codex` | `sonnet` | `haiku` | `glm-5` | `opus` | `—` (auto/default)
 > Pipeline crons pick the cheapest capable model. Override with a specific value when a task genuinely needs it.
-> Examples: `opus` for major architecture decisions, `codex` for pure coding, `haiku` for review/verify gates, `glm-5` for cost-sensitive coding.

-| id     | status | agent   | milestone                                                                                          | description                                                  | pr            | notes |
-| ------ | ------ | ------- | -------------------------------------------------------------------------------------------------- | ------------------------------------------------------------ | ------------- | ----- |
-| P0-001 | done   | Phase 0 | Scaffold monorepo                                                                                  | #60                                                          | #1            |
-| P0-002 | done   | Phase 0 | @mosaic/types — migrate and extend shared types                                                    | #65                                                          | #2            |
-| P0-003 | done   | Phase 0 | @mosaic/db — Drizzle schema and PG connection                                                      | #67                                                          | #3            |
-| P0-004 | done   | Phase 0 | @mosaic/auth — BetterAuth email/password setup                                                     | #68                                                          | #4            |
-| P0-005 | done   | Phase 0 | Docker Compose — PG 17, Valkey 8, SigNoz                                                           | #65                                                          | #5            |
-| P0-006 | done   | Phase 0 | OTEL foundation — OpenTelemetry SDK setup                                                          | #65                                                          | #6            |
-| P0-007 | done   | Phase 0 | CI pipeline — Woodpecker config                                                                    | #69                                                          | #7            |
-| P0-008 | done   | Phase 0 | Project docs — AGENTS.md, CLAUDE.md, README                                                        | #69                                                          | #8            |
-| P0-009 | done   | Phase 0 | Verify Phase 0 — CI green, all packages build                                                      | #70                                                          | #9            |
-| P1-001 | done   | Phase 1 | apps/gateway scaffold — NestJS + Fastify adapter                                                   | #61                                                          | #10           |
-| P1-002 | done   | Phase 1 | Auth middleware — BetterAuth session validation                                                    | #71                                                          | #11           |
-| P1-003 | done   | Phase 1 | @mosaic/brain — migrate from v0, PG backend                                                        | #71                                                          | #12           |
-| P1-004 | done   | Phase 1 | @mosaic/queue — migrate from v0                                                                    | #71                                                          | #13           |
-| P1-005 | done   | Phase 1 | Gateway routes — conversations CRUD + messages                                                     | #72                                                          | #14           |
-| P1-006 | done   | Phase 1 | Gateway routes — tasks, projects, missions CRUD                                                    | #72                                                          | #15           |
-| P1-007 | done   | Phase 1 | WebSocket server — chat streaming                                                                  | #61                                                          | #16           |
-| P1-008 | done   | Phase 1 | Basic agent dispatch — single provider                                                             | #61                                                          | #17           |
-| P1-009 | done   | Phase 1 | Verify Phase 1 — gateway functional, API tested                                                    | #73                                                          | #18           |
-| P2-001 | done   | Phase 2 | @mosaic/agent — Pi SDK integration + agent pool                                                    | #61                                                          | #19           |
-| P2-002 | done   | Phase 2 | Multi-provider support — Anthropic + Ollama                                                        | #74                                                          | #20           |
-| P2-003 | done   | Phase 2 | Agent routing engine — cost/capability matrix                                                      | #75                                                          | #21           |
-| P2-004 | done   | Phase 2 | Tool registration — brain, queue, memory tools                                                     | #76                                                          | #22           |
-| P2-005 | done   | Phase 2 | @mosaic/coord — migrate from v0, gateway integration                                               | #77                                                          | #23           |
-| P2-006 | done   | Phase 2 | Agent session management — tmux + monitoring                                                       | #78                                                          | #24           |
-| P2-007 | done   | Phase 2 | Verify Phase 2 — multi-provider routing works                                                      | #79                                                          | #25           |
-| P3-001 | done   | Phase 3 | apps/web scaffold — Next.js 16 + BetterAuth + Tailwind                                             | #82                                                          | #26           |
-| P3-002 | done   | Phase 3 | Auth pages — login, registration, SSO redirect                                                     | #83                                                          | #27           |
-| P3-003 | done   | Phase 3 | Chat UI — conversations, messages, streaming                                                       | #84                                                          | #28           |
-| P3-004 | done   | Phase 3 | Task management — list view + kanban board                                                         | #86                                                          | #29           |
-| P3-005 | done   | Phase 3 | Project & mission views — dashboard + PRD viewer                                                   | #87                                                          | #30           |
-| P3-006 | done   | Phase 3 | Settings — provider config, profile, integrations                                                  | #88                                                          | #31           |
-| P3-007 | done   | Phase 3 | Admin panel — user management, RBAC                                                                | #89                                                          | #32           |
-| P3-008 | done   | Phase 3 | Verify Phase 3 — web dashboard functional E2E                                                      | —                                                            | #33           |
-| P4-001 | done   | Phase 4 | @mosaic/memory — preference + insight stores                                                       | —                                                            | #34           |
-| P4-002 | done   | Phase 4 | Semantic search — pgvector embeddings + search API                                                 | —                                                            | #35           |
-| P4-003 | done   | Phase 4 | @mosaic/log — log ingest, parsing, tiered storage                                                  | —                                                            | #36           |
-| P4-004 | done   | Phase 4 | Summarization pipeline — Haiku-tier LLM + cron                                                     | —                                                            | #37           |
-| P4-005 | done   | Phase 4 | Memory integration — inject into agent sessions                                                    | —                                                            | #38           |
-| P4-006 | done   | Phase 4 | Skill management — catalog, install, config                                                        | —                                                            | #39           |
-| P4-007 | done   | Phase 4 | Verify Phase 4 — memory + log pipeline working                                                     | —                                                            | #40           |
-| P5-001 | done   | Phase 5 | Plugin host — gateway plugin loading + channel interface                                           | —                                                            | #41           |
-| P5-002 | done   | Phase 5 | @mosaic/discord-plugin — Discord bot + channel plugin                                              | #61                                                          | #42           |
-| P5-003 | done   | Phase 5 | @mosaic/telegram-plugin — Telegraf bot + channel plugin                                            | —                                                            | #43           |
-| P5-004 | done   | Phase 5 | SSO — Authentik OIDC adapter end-to-end                                                            | —                                                            | #44           |
-| P5-005 | done   | Phase 5 | Verify Phase 5 — Discord + Telegram + SSO working                                                  | #99                                                          | #45           |
-| P6-001 | done   | Phase 6 | @mosaic/cli — unified CLI binary + subcommands                                                     | #104                                                         | #46           |
-| P6-002 | done   | Phase 6 | @mosaic/prdy — migrate PRD wizard from v0                                                          | #101                                                         | #47           |
-| P6-003 | done   | Phase 6 | @mosaic/quality-rails — migrate scaffolder from v0                                                 | #100                                                         | #48           |
-| P6-004 | done   | Phase 6 | @mosaic/mosaic — install wizard for v1                                                             | #103                                                         | #49           |
-| P6-005 | done   | Phase 6 | Pi TUI integration — mosaic tui                                                                    | #61                                                          | #50           |
-| P6-006 | done   | Phase 6 | Verify Phase 6 — CLI functional, all subcommands                                                   | —                                                            | #51           |
-| P7-009 | done   | Phase 7 | Web chat — WebSocket integration, streaming, conversation switching                                | #136                                                         | #120 W1 done  |
-| P7-001 | done   | Phase 7 | MCP endpoint hardening — streamable HTTP transport                                                 | #137                                                         | #52 W1 done   |
-| P7-010 | done   | Phase 7 | Web conversation management — list, search, rename, delete, archive                                | #139                                                         | #121 W2 done  |
-| P7-015 | done   | Phase 7 | Agent tool expansion — file ops, git, shell exec, web fetch                                        | #138                                                         | #126 W2 done  |
-| P7-011 | done   | Phase 7 | Web project detail views — missions, tasks, PRDs, dashboards                                       | #140                                                         | #122 W3 done  |
-| P7-016 | done   | Phase 7 | MCP client — gateway connects to external MCP servers as tools                                     | #141                                                         | #127 W3 done  |
-| P7-012 | done   | Phase 7 | Web provider management UI — add, configure, test LLM providers                                    | #142                                                         | #123 W4 done  |
-| P7-017 | done   | Phase 7 | Agent skill invocation — load and execute skills from catalog                                      | #143                                                         | #128 W4 done  |
-| P7-013 | done   | Phase 7 | Web settings persistence — profile, preferences save to DB                                         | #145                                                         | #124 W5 done  |
-| P7-018 | done   | Phase 7 | CLI model/provider switching — --model, --provider, /model in TUI                                  | #144                                                         | #129 W5 done  |
-| P7-014 | done   | Phase 7 | Web admin panel — user CRUD, role assignment, system health                                        | #150                                                         | #125 W6 done  |
-| P7-019 | done   | Phase 7 | CLI session management — list, resume, destroy sessions                                            | #146                                                         | #130 W6 done  |
-| P7-020 | done   | Phase 7 | Coord DB migration — project-scoped missions, multi-tenant RBAC                                    | #149                                                         | #131 W7 done  |
-| FIX-02 | done   | Backlog | TUI agent:end — fix React state updater side-effect                                                | #147                                                         | #133 W8 done  |
-| FIX-03 | done   | Backlog | Agent session — cwd sandbox, system prompt, tool restrictions                                      | #148                                                         | #134 W8 done  |
-| P7-004 | done   | Phase 7 | E2E test suite — Playwright critical paths                                                         | #152                                                         | #55 W9 done   |
-| P7-006 | done   | Phase 7 | Documentation — user guide, admin guide, dev guide                                                 | #151                                                         | #57 W9 done   |
-| P7-007 | done   | Phase 7 | Bare-metal deployment docs + .env.example                                                          | #153                                                         | #58 W9 done   |
-| P7-021 | done   | Phase 7 | Verify Phase 7 — feature-complete platform E2E                                                     | —                                                            | #132 W10 done |
-| P8-005 | done   | Phase 8 | CLI command architecture — DB schema + brain repo + gateway endpoints                              | #158                                                         |               |
-| P8-006 | done   | Phase 8 | CLI command architecture — agent, mission, prdy commands + TUI mods                                | #158                                                         |               |
-| P8-007 | done   | Phase 8 | DB migrations — preferences.mutable + teams + team_members + projects.teamId                       | #175                                                         | #160          |
-| P8-008 | done   | Phase 8 | @mosaic/types — CommandDef, CommandManifest, new socket events                                     | #174                                                         | #161          |
-| P8-009 | done   | Phase 8 | TUI Phase 1 — slash command parsing, local commands, system message rendering, InputBar wiring     | #176                                                         | #162          |
-| P8-010 | done   | Phase 8 | Gateway Phase 2 — CommandRegistryService, CommandExecutorService, socket + REST commands           | #178                                                         | #163          |
-| P8-011 | done   | Phase 8 | Gateway Phase 3 — PreferencesService, /preferences REST, /system Valkey override, prompt injection | #180                                                         | #164          |
-| P8-012 | done   | Phase 8 | Gateway Phase 4 — /agent, /provider (URL+clipboard), /mission, /prdy, /tools commands              | #181                                                         | #165          |
-| P8-013 | done   | Phase 8 | Gateway Phase 5 — MosaicPlugin lifecycle, ReloadService, hot reload, system:reload TUI             | #182                                                         | #166          |
-| P8-014 | done   | Phase 8 | Gateway Phase 6 — SessionGCService (all tiers), /gc command, cron integration                      | #179                                                         | #167          |
-| P8-015 | done   | Phase 8 | Gateway Phase 7 — WorkspaceService, ProjectBootstrapService, teams project ownership               | #183                                                         | #168          |
-| P8-016 | done   | Phase 8 | Security — file/git/shell tool strict path hardening, sandbox escape prevention                    | #177                                                         | #169          |
-| P8-017 | done   | Phase 8 | TUI Phase 8 — autocomplete sidebar, fuzzy match, arg hints, up-arrow history                       | #184                                                         | #170          |
-| P8-018 | done   | Phase 8 | Spin-off plan stubs — Gatekeeper, Task Queue Unification, Chroot Sandboxing                        | —                                                            | #171          |
-| P8-019 | done   | Phase 8 | Verify Platform Architecture — integration + E2E verification                                      | #185                                                         | #172          |
-| P8-001 | done   | codex   | Phase 8                                                                                            | Additional SSO providers — WorkOS + Keycloak                 | #220          | #53   |
-| P8-002 | done   | codex   | Phase 8                                                                                            | Additional LLM providers — Codex, Z.ai, LM Studio, llama.cpp | #212          | #54   |
-| P8-003 | done   | codex   | Phase 8                                                                                            | Performance optimization                                     | #211          | #56   |
-| P8-004 | done   | haiku   | Phase 8                                                                                            | Beta release gate — v0.1.0 tag                               | —             | #59   |
-| FIX-01 | done   | Backlog | Call piSession.dispose() in AgentService.destroySession                                            | #78                                                          | #62           |
+| id     | status      | agent  | milestone          | description                                                                                                     | pr  | notes                     |
+| ------ | ----------- | ------ | ------------------ | --------------------------------------------------------------------------------------------------------------- | --- | ------------------------- |
+| M1-001 | not-started | sonnet | M1: Persistence    | Wire ChatGateway.handleMessage() → ConversationsRepo.addMessage() for user messages                             | —   | #224                      |
+| M1-002 | not-started | sonnet | M1: Persistence    | Wire agent event relay → ConversationsRepo.addMessage() for assistant responses (text, tool calls, thinking)    | —   | #225                      |
+| M1-003 | not-started | sonnet | M1: Persistence    | Store message metadata: model used, provider, token counts, tool call details, timestamps                       | —   | #226                      |
+| M1-004 | not-started | sonnet | M1: Persistence    | On session resume, load message history from DB and inject into Pi session context                              | —   | #227                      |
+| M1-005 | not-started | opus   | M1: Persistence    | Context window management: summarize older messages when history exceeds 80% of model context                   | —   | #228                      |
+| M1-006 | not-started | sonnet | M1: Persistence    | Conversation search: full-text search on messages table via /api/conversations/search                           | —   | #229                      |
+| M1-007 | not-started | sonnet | M1: Persistence    | TUI: /history command to display conversation message count and context usage                                   | —   | #230                      |
+| M1-008 | not-started | haiku  | M1: Persistence    | Verify: send messages → kill TUI → resume with -c → agent references prior context                              | —   | #231                      |
+| M2-001 | not-started | sonnet | M2: Security       | Audit InsightsRepo: add userId filter to searchByEmbedding() vector search                                      | —   | #232                      |
+| M2-002 | not-started | sonnet | M2: Security       | Audit InsightsRepo: add userId filter to findByUser(), decayOldInsights()                                       | —   | #233                      |
+| M2-003 | not-started | sonnet | M2: Security       | Audit PreferencesRepo: verify all queries filter by userId                                                      | —   | #234                      |
+| M2-004 | not-started | sonnet | M2: Security       | Audit agent memory tools: verify memory*search, memory_save*_, memory*get*_ scope to session user               | —   | #235                      |
+| M2-005 | not-started | sonnet | M2: Security       | Audit ConversationsRepo: verify ownership check on findById, update, delete, addMessage, findMessages           | —   | #236                      |
+| M2-006 | not-started | sonnet | M2: Security       | Audit AgentsRepo: verify findAccessible() returns only user's agents + system agents                            | —   | #237                      |
+| M2-007 | not-started | sonnet | M2: Security       | Integration test: create two users, populate data, verify cross-user isolation on every query path              | —   | #238 TDD                  |
+| M2-008 | not-started | sonnet | M2: Security       | Audit Valkey keys: verify session keys include userId or are not enumerable across users                        | —   | #239                      |
+| M3-001 | not-started | opus   | M3: Providers      | Refactor ProviderService into IProviderAdapter pattern: register(), listModels(), healthCheck(), createClient() | —   | #240 Verify Pi SDK compat |
+| M3-002 | not-started | sonnet | M3: Providers      | Anthropic adapter: @anthropic-ai/sdk, Claude Sonnet 4.6 + Opus 4.6 + Haiku 4.5, OAuth + API key                 | —   | #241                      |
+| M3-003 | not-started | sonnet | M3: Providers      | OpenAI adapter: openai SDK, Codex gpt-5.4, OAuth + API key                                                      | —   | #242                      |
+| M3-004 | not-started | sonnet | M3: Providers      | OpenRouter adapter: OpenAI-compatible client, API key, dynamic model list from /api/v1/models                   | —   | #243                      |
+| M3-005 | not-started | sonnet | M3: Providers      | Z.ai GLM adapter: GLM-5, API key, research API format                                                           | —   | #244                      |
+| M3-006 | not-started | sonnet | M3: Providers      | Ollama adapter: refactor existing integration into adapter pattern, add embedding model support                 | —   | #245                      |
+| M3-007 | not-started | sonnet | M3: Providers      | Provider health check: periodic probe, configurable interval, status per provider, /api/providers/health        | —   | #246                      |
+| M3-008 | not-started | sonnet | M3: Providers      | Model capability matrix: per-model metadata (tier, context window, tool support, vision, streaming, embedding)  | —   | #247                      |
+| M3-009 | not-started | sonnet | M3: Providers      | Refactor EmbeddingService: provider-agnostic interface, Ollama default (nomic-embed-text or mxbai-embed-large)  | —   | #248 Dim migration        |
+| M3-010 | not-started | sonnet | M3: Providers      | OAuth token storage: persist provider tokens per user in DB (encrypted), refresh flow                           | —   | #249                      |
+| M3-011 | not-started | sonnet | M3: Providers      | Provider config UI support: /api/providers CRUD for user-scoped provider credentials                            | —   | #250                      |
+| M3-012 | not-started | haiku  | M3: Providers      | Verify: each provider connects, lists models, completes chat request, handles errors                            | —   | #251                      |
+| M4-001 | not-started | opus   | M4: Routing        | Define routing rule schema: RoutingRule { name, priority, conditions[], action } stored in DB                   | —   | #252 DB migration         |
+| M4-002 | not-started | opus   | M4: Routing        | Condition types: taskType, complexity, domain, costTier, requiredCapabilities                                   | —   | #253                      |
+| M4-003 | not-started | opus   | M4: Routing        | Action types: routeTo { provider, model, agentConfigId?, systemPromptOverride?, toolAllowlist? }                | —   | #254                      |
+| M4-004 | not-started | sonnet | M4: Routing        | Default routing rules seed data: coding→Opus, Q&A→Sonnet, summarization→GLM-5, research→Codex, offline→Ollama   | —   | #255                      |
+| M4-005 | not-started | opus   | M4: Routing        | Task classification: infer taskType + complexity from user message (regex/keyword first, LLM-assisted later)    | —   | #256                      |
+| M4-006 | not-started | opus   | M4: Routing        | Routing decision pipeline: classify → match rules → check health → fallback chain → return result               | —   | #257                      |
+| M4-007 | not-started | sonnet | M4: Routing        | Routing override: /model forces specific model regardless of routing rules                                      | —   | #258                      |
+| M4-008 | not-started | sonnet | M4: Routing        | Routing transparency: include routing decision in session:info event (model + reason)                           | —   | #259                      |
+| M4-009 | not-started | sonnet | M4: Routing        | Routing rules CRUD: /api/routing/rules — list, create, update, delete, reorder priority                         | —   | #260                      |
+| M4-010 | not-started | sonnet | M4: Routing        | Per-user routing overrides: users customize default rules for their sessions                                    | —   | #261                      |
+| M4-011 | not-started | sonnet | M4: Routing        | Agent specialization: agents declare capabilities in config (domains, preferred models, tool sets)              | —   | #262                      |
+| M4-012 | not-started | sonnet | M4: Routing        | Routing integration: wire into ChatGateway — every message triggers routing before agent dispatch               | —   | #263                      |
+| M4-013 | not-started | haiku  | M4: Routing        | Verify: coding→Opus, summarize→GLM-5, simple→Haiku, override via /model works                                   | —   | #264                      |
+| M5-001 | not-started | sonnet | M5: Sessions       | Wire ChatGateway: on session create, load agent config from DB (system prompt, model, provider, tools, skills)  | —   | #265                      |
+| M5-002 | not-started | sonnet | M5: Sessions       | /model command: end-to-end wiring — TUI → socket → gateway switches provider/model → new messages use it        | —   | #266                      |
+| M5-003 | not-started | sonnet | M5: Sessions       | /agent command: switch agent config mid-session — loads new system prompt, tools, default model                 | —   | #267                      |
+| M5-004 | not-started | sonnet | M5: Sessions       | Session ↔ conversation binding: persist sessionId on conversation record, resume via conversationId             | —   | #268                      |
+| M5-005 | not-started | sonnet | M5: Sessions       | Session info broadcast: on model/agent switch, emit session:info with updated state                             | —   | #269                      |
+| M5-006 | not-started | sonnet | M5: Sessions       | Agent creation from TUI: /agent new command creates agent config via gateway API                                | —   | #270                      |
+| M5-007 | not-started | sonnet | M5: Sessions       | Session metrics: per-session token usage, model switches, duration — persist in DB                              | —   | #271                      |
+| M5-008 | not-started | haiku  | M5: Sessions       | Verify: /model switches model, /agent switches agent, session resume loads config                               | —   | #272                      |
+| M6-001 | not-started | sonnet | M6: Jobs           | Add BullMQ dependency, configure with Valkey connection                                                         | —   | #273 Test compat first    |
+| M6-002 | not-started | sonnet | M6: Jobs           | Create queue service: typed job definitions, worker registration, error handling with exponential backoff       | —   | #274                      |
+| M6-003 | not-started | sonnet | M6: Jobs           | Migrate summarization cron → BullMQ repeatable job                                                              | —   | #275                      |
+| M6-004 | not-started | sonnet | M6: Jobs           | Migrate GC (session cleanup) → BullMQ repeatable job                                                            | —   | #276                      |
+| M6-005 | not-started | sonnet | M6: Jobs           | Migrate tier management (log archival) → BullMQ repeatable job                                                  | —   | #277                      |
+| M6-006 | not-started | sonnet | M6: Jobs           | Admin jobs API: GET /api/admin/jobs — list, status, retry, pause/resume queues                                  | —   | #278                      |
+| M6-007 | not-started | sonnet | M6: Jobs           | Job event logging: emit job start/complete/fail events to agent_logs                                            | —   | #279                      |
+| M6-008 | not-started | haiku  | M6: Jobs           | Verify: jobs execute on schedule, failure retries with backoff, admin endpoint shows history                    | —   | #280                      |
+| M7-001 | not-started | opus   | M7: Channel Design | Define IChannelAdapter interface: lifecycle, message flow, identity mapping                                     | —   | #281 Architecture         |
+| M7-002 | not-started | opus   | M7: Channel Design | Define channel message protocol: canonical format all adapters translate to/from                                | —   | #282 Architecture         |
+| M7-003 | not-started | opus   | M7: Channel Design | Design Matrix integration: appservice, room↔conversation, space↔team, agent ghosts, power levels                | —   | #283 Architecture         |
+| M7-004 | not-started | opus   | M7: Channel Design | Design conversation multiplexing: same conversation from TUI+WebUI+Matrix, real-time sync                       | —   | #284 Architecture         |
+| M7-005 | not-started | opus   | M7: Channel Design | Design remote auth bridging: Matrix/Discord auth → Mosaic identity (token linking, OAuth bridge)                | —   | #285 Architecture         |
+| M7-006 | not-started | opus   | M7: Channel Design | Design agent-to-agent communication via Matrix rooms: room per agent pair, human observation                    | —   | #286 Architecture         |
+| M7-007 | not-started | opus   | M7: Channel Design | Design multi-user isolation in Matrix: space-per-team, room visibility, encryption, admin access                | —   | #287 Architecture         |
+| M7-008 | not-started | haiku  | M7: Channel Design | Publish docs/architecture/channel-protocol.md — reviewed and approved                                           | —   | #288                      |