stack

Author	SHA1	Message	Date
Jason Woltje	eba04fb264	feat(#150 ): Implement OrchestrationLoop class (TDD - GREEN phase) Implement the main orchestration loop that coordinates all components: - Queue processing with priority sorting (issues by number) - Integration with ContextMonitor for tracking agent context usage - Integration with QualityOrchestrator for running quality gates - Integration with ForcedContinuationService for rejection prompts - Metrics tracking (processed_count, success_count, rejection_count) - Graceful start/stop with proper lifecycle management - Error handling at all levels (spawn, context, quality, continuation) The OrchestrationLoop flow: 1. Read issue queue (priority sorted by issue number) 2. Mark issue as in progress 3. Spawn agent (stub implementation for Phase 0) 4. Check context usage via ContextMonitor 5. Run quality gates via QualityOrchestrator 6. On approval: mark complete, increment success count 7. On rejection: generate continuation prompt, increment rejection count 99% test coverage for coordinator.py (183 statements, 2 missed). Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 20:22:00 -06:00
Jason Woltje	5cd2ff6c13	test(#150 ): Add tests for orchestration loop (TDD - RED phase) Add comprehensive test suite for OrchestrationLoop class that integrates: - Queue processing with priority sorting - Agent assignment (50% rule) - Quality gate verification on completion claims - Rejection handling with forced continuation prompts - Context monitoring during agent execution - Lifecycle management (start/stop) - Error handling for all edge cases - Metrics tracking (processed, success, rejection counts) 33 new tests covering all acceptance criteria. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 20:21:51 -06:00
Jason Woltje	ac3f5c1af9	test(#149 ): Add comprehensive rejection loop integration tests Add integration tests validating rejection loop behavior: - Agent claims done with failing tests → rejection + forced continuation - Agent claims done with linting errors → rejection + forced continuation - Agent claims done with low coverage → rejection + forced continuation - Agent claims done with build errors → rejection + forced continuation - All gates passing → completion allowed - Multiple simultaneous failures → comprehensive rejection - Continuation prompts are non-negotiable and directive - Agents cannot bypass quality gates - Remediation steps included in prompts All 9 tests pass. Build gate: passes Lint gate: passes Test gate: passes (100% pass rate) Coverage: quality_orchestrator.py at 85%, forced_continuation.py at 100% Refs #149 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 20:11:15 -06:00
Jason Woltje	28d0e4b1df	fix(#148 ): Fix linting violations in quality orchestrator tests Fixed code review findings: - Removed unused imports (AsyncMock, MagicMock) - Fixed line length violation in test_forced_continuation.py All 15 tests still passing after fixes.	2026-02-01 20:07:19 -06:00
Jason Woltje	324c6b71d8	feat(#148 ): Implement Quality Orchestrator and Forced Continuation services Implements COORD-008 - Build Quality Orchestrator service that intercepts completion claims and enforces quality gates. Quality Orchestrator (quality_orchestrator.py): - Runs all quality gates (build, lint, test, coverage) in parallel using asyncio - Aggregates gate results into VerificationResult model - Determines overall pass/fail status - Handles gate exceptions gracefully - Uses dependency injection for testability - 87% test coverage (exceeds 85% minimum) Forced Continuation Service (forced_continuation.py): - Generates non-negotiable continuation prompts for gate failures - Provides actionable remediation steps for each failed gate - Includes specific error details and coverage gaps - Blocks completion until all gates pass - 100% test coverage Tests: - 6 tests for QualityOrchestrator covering: - All gates passing scenario - Single/multiple/all gates failing scenarios - Parallel gate execution verification - Exception handling - 9 tests for ForcedContinuationService covering: - Individual gate failure prompts (build, lint, test, coverage) - Multiple simultaneous failures - Actionable details inclusion - Error handling for invalid states Quality Gates: ✅ Build: mypy passes (no type errors) ✅ Lint: ruff passes (no violations) ✅ Test: 15/15 tests pass (100% pass rate) ✅ Coverage: 87% quality_orchestrator, 100% forced_continuation (exceeds 85%) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 20:04:26 -06:00
Jason Woltje	38da576b69	fix(#147 ): Fix linting violations in quality gate tests Fixed code review findings: - Removed unused mock_run variables (6 instances) - Fixed line length violations (3 instances) - All ruff checks now pass All 36 tests still passing after fixes. Quality gates: BuildGate, LintGate, TestGate, CoverageGate ready for use.	2026-02-01 18:29:13 -06:00
Jason Woltje	f45dbac7b4	feat(#147 ): Implement core quality gates (TDD - GREEN phase) Implement four quality gates enforcing non-negotiable quality standards: 1. BuildGate: Runs mypy type checking - Detects compilation/type errors - Uses strict mode from pyproject.toml - Returns GateResult with pass/fail status 2. LintGate: Runs ruff linting - Treats warnings as failures (non-negotiable) - Checks code style and quality - Enforces rules from pyproject.toml 3. TestGate: Runs pytest tests - Requires 100% test pass rate (non-negotiable) - Runs without coverage (separate gate) - Detects test failures and missing tests 4. CoverageGate: Measures test coverage - Enforces 85% minimum coverage (non-negotiable) - Extracts coverage from JSON and output - Handles edge cases gracefully All gates implement QualityGate protocol with check() method. All gates return GateResult with passed/message/details. All implementations achieve 100% test coverage. Files created: - src/gates/quality_gate.py: Protocol and result model - src/gates/build_gate.py: Type checking enforcement - src/gates/lint_gate.py: Linting enforcement - src/gates/test_gate.py: Test execution enforcement - src/gates/coverage_gate.py: Coverage enforcement - src/gates/__init__.py: Module exports Related to #147 Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-01 18:25:16 -06:00
Jason Woltje	0af93d1ef4	test(#147 ): Add tests for quality gates (TDD - RED phase) Implement comprehensive test suite for four core quality gates: - BuildGate: Tests mypy type checking enforcement - LintGate: Tests ruff linting with warnings as failures - TestGate: Tests pytest execution requiring 100% pass rate - CoverageGate: Tests coverage enforcement with 85% minimum All tests follow TDD methodology - written before implementation. Total: 36 tests covering success, failure, and edge cases. Related to #147 Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-01 18:25:02 -06:00
Jason Woltje	9f3c76d43b	test(#146 ): Validate assignment cost optimization Add comprehensive cost optimization test scenarios and validation report. Test Scenarios Added (10 new tests): - Low difficulty assigns to MiniMax/GLM (free agents) - Medium difficulty assigns to GLM when within capacity - High difficulty assigns to Opus (only capable agent) - Oversized issues rejected with actionable error - Boundary conditions at capacity limits - Aggregate cost optimization across all scenarios Results: - All 33 tests passing (23 existing + 10 new) - 100% coverage of agent_assignment.py (36/36 statements) - Cost savings validation: 50%+ in aggregate scenarios - Real-world projection: 70%+ savings with typical workload Documentation: - Created cost-optimization-validation.md with detailed analysis - Documents cost savings for each scenario - Validates all acceptance criteria from COORD-006 Completes Phase 2 (M4.1-Coordinator) testing requirements. Fixes #146 Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-01 18:13:53 -06:00
Jason Woltje	10ecbd63f1	test(#161 ): Add comprehensive E2E integration test for coordinator Implements complete end-to-end integration test covering: - Webhook receiver → parser → queue → orchestrator flow - Signature validation in full flow - Dependency blocking and unblocking logic - Multi-issue processing with correct ordering - Error handling (malformed issues, agent failures) - Performance requirement (< 10 seconds) Test suite includes 7 test cases: 1. test_full_flow_webhook_to_orchestrator - Main critical path 2. test_full_flow_with_blocked_dependency - Dependency management 3. test_full_flow_with_multiple_issues - Queue ordering 4. test_webhook_signature_validation_in_flow - Security 5. test_parser_handles_malformed_issue_body - Error handling 6. test_orchestrator_handles_spawn_agent_failure - Resilience 7. test_performance_full_flow_under_10_seconds - Performance All tests pass (182 total including 7 new). Performance verified: Full flow completes in < 1 second. 100% of critical integration path covered. Completes #161 (COORD-005) and validates Phase 0. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-01 18:08:10 -06:00
Jason Woltje	9b1a1c0b8a	feat(#145 ): Build assignment algorithm Implement intelligent agent assignment algorithm that selects the optimal agent for each issue based on context capacity, difficulty, and cost. Algorithm: 1. Filter agents that meet context capacity (50% rule - agent needs 2x context) 2. Filter agents that can handle difficulty level 3. Sort by cost (prefer self-hosted when capable) 4. Return cheapest qualifying agent Features: - NoCapableAgentError raised when no agent can handle requirements - Difficulty mapping: easy/low->LOW, medium->MEDIUM, hard/high->HIGH - Self-hosted preference (GLM, minimax cost=0) - Comprehensive test coverage (100%, 23 tests) Test scenarios: - Assignment for low/medium/high difficulty issues - Context capacity filtering (50% rule enforcement) - Cost optimization logic (prefers self-hosted) - Error handling for impossible assignments - Edge cases (zero context, negative context, invalid difficulty) Quality gates: - All 23 tests passing - 100% code coverage (exceeds 85% requirement) - Lint: passing (ruff) - Type check: passing (mypy) Refs #145 Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-01 18:07:58 -06:00
Jason Woltje	88953fc998	feat(#160 ): Implement basic orchestration loop Implements the Coordinator class with main orchestration loop: - Async loop architecture with configurable poll interval - process_queue() method gets next ready issue and spawns agent (stub) - Graceful shutdown handling with stop() method - Error handling that allows loop to continue after failures - Logging for all actions (start, stop, processing, errors) - Integration with QueueManager from #159 - Active agent tracking for future agent management Configuration settings added: - COORDINATOR_POLL_INTERVAL (default: 5.0s) - COORDINATOR_MAX_CONCURRENT_AGENTS (default: 10) - COORDINATOR_ENABLED (default: true) Tests: 27 new tests covering all acceptance criteria Coverage: 92% overall (100% for coordinator.py) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 18:03:12 -06:00
Jason Woltje	f0fd0bed41	feat(#144 ): Implement agent profiles - Add Capability enum (HIGH, MEDIUM, LOW) for agent difficulty levels - Add AgentName enum for all 5 agents (opus, sonnet, haiku, glm, minimax) - Implement AgentProfile data structure with validation - context_limit: max tokens for context window - cost_per_mtok: cost per million tokens (0 for self-hosted) - capabilities: list of difficulty levels the agent handles - best_for: description of optimal use cases - Define profiles for all 5 agents with specifications: - Anthropic models (opus, sonnet, haiku): 200K context, various costs - Self-hosted models (glm, minimax): 128K context, free - Implement get_agent_profile() function for profile lookup - Add comprehensive test suite (37 tests, 100% coverage) - Profile data structure validation - All 5 predefined profiles exist and are correct - Capability enum and AgentName enum tests - Best_for validation and capability matching - Consistency checks across profiles Fixes #144 Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-01 18:00:19 -06:00
Jason Woltje	a1b911d836	test(#143 ): Validate 50% rule prevents context exhaustion Following TDD (Red-Green-Refactor): - RED: Created comprehensive test suite with 12 test cases - GREEN: Implemented validation logic that passes all tests - All quality gates passed Test Coverage: - Oversized issue (120K) correctly rejected - Properly sized issue (80K) correctly accepted - Edge case at exactly 50% (100K) correctly accepted - Sequential issues validated individually - All agent types tested (opus, sonnet, haiku, glm, minimax) - Edge cases covered (zero, very small, boundaries) Implementation: - src/validation.py: Pure validation function - tests/test_fifty_percent_rule.py: 12 comprehensive tests - docs/50-percent-rule-validation.md: Validation report - 100% test coverage (14/14 statements) - Type checking: PASS (mypy) - Linting: PASS (ruff) The 50% rule ensures no single issue exceeds 50% of target agent's context limit, preventing context exhaustion while allowing efficient capacity utilization. Fixes #143 Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-01 17:56:04 -06:00
Jason Woltje	72321f5fcd	feat(#159 ): Implement queue manager Implements QueueManager with full dependency tracking, persistence, and status management. Key features: - QueueItem dataclass with status, metadata, and ready flag - QueueManager with enqueue, dequeue, get_next_ready, mark_complete - Dependency resolution (blocked_by → not ready) - JSON persistence with auto-save on state changes - Automatic reload on startup - Graceful handling of circular dependencies - Status transitions (pending → in_progress → completed) Test coverage: - 26 comprehensive tests covering all operations - Dependency chain resolution - Persistence and reload scenarios - Edge cases (circular deps, missing items) - 100% code coverage on queue module - 97% total project coverage Quality gates passed: ✓ All tests passing (88 total) ✓ Type checking (mypy) passing ✓ Linting (ruff) passing ✓ Coverage ≥85% (97% achieved) This unblocks #160 (orchestrator needs queue). Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-01 17:55:48 -06:00
Jason Woltje	dad4b68f66	feat(#158 ): Implement issue parser agent Add AI-powered issue metadata parser using Anthropic Sonnet model. - Parse issue markdown to extract: estimated_context, difficulty, assigned_agent, blocks, blocked_by - Implement in-memory caching to avoid duplicate API calls - Graceful fallback to defaults on parse failures - Add comprehensive test suite (9 test cases) - 95% test coverage (exceeds 85% requirement) - Add ANTHROPIC_API_KEY to config - Update documentation and add .env.example Fixes #158 Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-01 17:50:35 -06:00
Jason Woltje	d54c65360a	feat(#155 ): Build basic context monitor Implements ContextMonitor class with real-time token usage tracking: - COMPACT_THRESHOLD at 0.80 (80% triggers compaction) - ROTATE_THRESHOLD at 0.95 (95% triggers rotation) - Poll Claude API for context usage - Return appropriate ContextAction based on thresholds - Background monitoring loop (10-second polling) - Log usage over time - Error handling and recovery Added ContextUsage model for tracking agent token consumption. Tests: - 25 test cases covering all functionality - 100% coverage for context_monitor.py and models.py - Mocked API responses for different usage levels - Background monitoring and threshold detection - Error handling verification Quality gates: - Type checking: PASS (mypy) - Linting: PASS (ruff) - Tests: PASS (25/25) - Coverage: 100% for new files, 95.43% overall Fixes #155 Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-01 17:49:09 -06:00
Jason Woltje	e23c09f1f2	feat(#157 ): Set up webhook receiver endpoint Implement FastAPI webhook receiver for Gitea issue assignment events with HMAC SHA256 signature verification and event routing. Implementation details: - FastAPI application with /webhook/gitea POST endpoint - HMAC SHA256 signature verification in security.py - Event routing for assigned, unassigned, closed actions - Comprehensive logging for all webhook events - Health check endpoint at /health - Docker containerization with health checks - 91% test coverage (exceeds 85% requirement) TDD workflow followed: - Wrote 16 tests first (RED phase) - Implemented features to pass tests (GREEN phase) - All tests passing with 91% coverage - Type checking with mypy: success - Linting with ruff: success Files created: - apps/coordinator/src/main.py - FastAPI application - apps/coordinator/src/webhook.py - Webhook handlers - apps/coordinator/src/security.py - HMAC verification - apps/coordinator/src/config.py - Configuration management - apps/coordinator/tests/ - Comprehensive test suite - apps/coordinator/Dockerfile - Production container - apps/coordinator/pyproject.toml - Python project config Configuration: - Updated .env.example with GITEA_WEBHOOK_SECRET - Updated docker-compose.yml with coordinator service Testing: - 16 unit and integration tests - Security tests for signature verification - Event handler tests for all supported actions - Health check endpoint tests - All tests passing with 91% coverage This unblocks issue #158 (issue parser). Fixes #157 Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-01 17:41:46 -06:00
Jason Woltje	cd727f619f	feat: Add debug output to Dockerfiles and .dockerignore Some checks failed ci/woodpecker/push/woodpecker Pipeline failed Details ci/woodpecker/manual/woodpecker Pipeline was successful Details - Add .dockerignore to exclude node_modules, dist, and build artifacts - Add pre/post build directory listings to diagnose dist not found issue - Disable turbo cache temporarily with --force flag - Add --verbosity=2 for more detailed turbo output Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 14:50:13 -06:00
Jason Woltje	442c2f7de2	fix: Dockerfile COPY order - node_modules must come after source All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details Docker COPY replaces directory contents, so copying source code after node_modules was wiping the deps. Reordered to: 1. Copy source code first 2. Copy node_modules second (won't be overwritten) Fixes API build failure: "dist not found" Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 13:39:25 -06:00
Jason Woltje	9246f56687	fix(api): Add AuthModule import to modules using AuthGuard All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details Modules using AuthGuard in their controllers need to import AuthModule to make AuthService available for dependency injection. Fixed: - ActivityModule - WorkspaceSettingsModule Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 01:48:09 -06:00
Jason Woltje	fb0f6b5b62	fix(docker): Fix module resolution and healthcheck syntax All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details Issues fixed: 1. Module not found: Added missing copy of apps/{api,web}/node_modules which contains pnpm symlinks to the root node_modules 2. Healthcheck syntax: Fixed broken quoting from prettier reformatting Changed to CMD-SHELL with proper escaping 3. Removed obsolete version: "3.9" from docker-compose.yml The apps need their own node_modules directories because pnpm uses symlinks that point from apps//node_modules to node_modules/.pnpm/ Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 01:37:30 -06:00
Jason Woltje	aa17b9cb3b	fix(docker): Make port configuration consistent and dynamic All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details Fixed the mismatch between environment variables: - docker-compose now passes PORT (what NestJS/Next.js read) instead of API_PORT - API_PORT/WEB_PORT control host mapping, PORT controls container Changes: - docker-compose: Pass PORT=${API_PORT} and PORT=${WEB_PORT} to containers - docker-compose: Dynamic port mapping on both host and container sides - docker-compose: Traefik labels use ${API_PORT}/${WEB_PORT} variables - docker-compose: Healthchecks use PORT env var - Dockerfiles: Removed hardcoded port values - Dockerfiles: Healthchecks read PORT at runtime This allows changing ports via API_PORT/WEB_PORT environment variables and have all components (app, healthcheck, Traefik) use the correct port. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 01:29:15 -06:00
Jason Woltje	e045cb5a45	perf(docker): Add BuildKit cache mounts for faster builds All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details Added cache mounts for: - pnpm store: Caches downloaded packages between builds - TurboRepo: Caches build outputs between builds This significantly speeds up subsequent builds: - First build: Full download and compile - Subsequent builds: Only changed packages are re-downloaded/rebuilt Requires Docker BuildKit (default in Docker 23+). Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 01:22:51 -06:00
Jason Woltje	353f04f950	fix(docker): Ensure public directory exists in web builder All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details The production stage was failing because it tried to copy the public directory which doesn't exist in the source. Added mkdir -p to ensure the directory exists (even if empty) before the production stage tries to copy it. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 01:15:34 -06:00
Jason Woltje	0495c48418	fix(docker): Copy node_modules from builder instead of reinstalling All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details pnpm stores the Prisma client in the content-addressable store at node_modules/.pnpm/.../.prisma, not at apps/api/node_modules/.prisma. The production stage was trying to copy from the wrong location. Additionally, running `pnpm install --prod` in production failed because: 1. The husky prepare script runs but husky is a devDependency 2. The Prisma client postinstall can't run without the prisma CLI Fixed by copying the full node_modules from the builder stage, which already has all dependencies properly installed and the Prisma client generated in the correct pnpm store location. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 00:42:34 -06:00
Jason Woltje	7ee08865fd	fix(docker): Use TurboRepo to build workspace dependencies All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details The Docker builds were failing because they ran `pnpm build` directly in the app directories without first building workspace dependencies (@mosaic/shared, @mosaic/ui). CI passed because it runs TurboRepo from the root which respects the dependency graph. Changed both Dockerfiles to use `pnpm turbo build --filter=@mosaic/{app}` which ensures dependencies are built in the correct order: - Web: @mosaic/config → @mosaic/shared → @mosaic/ui → @mosaic/web - API: @mosaic/config → @mosaic/shared → prisma:generate → @mosaic/api Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 00:37:34 -06:00
Jason Woltje	cb0948214e	feat(auth): Configure Authentik OIDC integration with better-auth All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details - Add genericOAuth plugin to auth.config.ts with Authentik provider - Fix LoginButton to use /auth/signin/authentik (not /auth/callback/) - Add production URLs to trustedOrigins - Update .env.example with correct redirect URI documentation Redirect URI for Authentik: https://api.mosaicstack.dev/auth/callback/authentik Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-31 18:11:32 -06:00
Jason Woltje	f2b25079d9	fix(#27 ): address security issues in intent classification All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details - Add input sanitization to prevent LLM prompt injection (escapes quotes, backslashes, replaces newlines) - Add MaxLength(500) validation to DTO to prevent DoS - Add entity validation to filter malicious LLM responses - Add confidence validation to clamp values to 0.0-1.0 - Make LLM model configurable via INTENT_CLASSIFICATION_MODEL env var - Add 12 new security tests (total: 72 tests, from 60) Security fixes identified by code review: - CVE-mitigated: Prompt injection via unescaped user input - CVE-mitigated: Unvalidated entity data from LLM response - CVE-mitigated: Missing input length validation Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-31 16:50:32 -06:00
Jason Woltje	d7f04d1148	feat(#27 ): implement intent classification service All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details Implement intent classification for natural language queries in the brain module. Features: - Hybrid classification approach: rule-based (fast, <100ms) with optional LLM fallback - 10 intent types: query_tasks, query_events, query_projects, create_task, create_event, update_task, update_event, briefing, search, unknown - Entity extraction: dates, times, priorities, statuses, people - Pattern-based matching with priority system (higher priority = checked first) - Optional LLM classification for ambiguous queries - POST /api/brain/classify endpoint Implementation: - IntentClassificationService with classify(), classifyWithRules(), classifyWithLlm(), extractEntities() - Comprehensive regex patterns for common query types - Entity extraction for dates, times, priorities, statuses, mentions - Type-safe interfaces for IntentType, IntentClassification, ExtractedEntity, IntentPattern - ClassifyIntentDto and IntentClassificationResultDto for API validation - Integrated with existing LlmService (optional dependency) Testing: - 60 comprehensive tests covering all intent types - Edge cases: empty queries, special characters, case sensitivity, multiple whitespace - Entity extraction tests with position tracking - LLM fallback tests with error handling - 100% test coverage - All tests passing (60/60) - TDD approach: tests written first Quality: - No explicit any types - Explicit return types on all functions - No TypeScript errors - Build successful - Follows existing code patterns - Quality Rails compliance: All lint checks pass Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-31 15:41:10 -06:00
Jason Woltje	3d6159ae15	fix: address code review issues and cleanup QA reports All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details Code review fixes: - Add error logging to LlmProviderAdminController.testProvider catch block - Use atomic increment operations in TokenBudgetService.updateUsage to prevent race conditions - Update test expectations for atomic increment pattern Cleanup: - Remove obsolete QA automation reports All 1169 tests passing. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-31 15:01:18 -06:00
Jason Woltje	903109ea40	docs: Add overlap analysis for non-AI coordinator patterns All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details Detailed comparison showing: - Existing doc addresses L-015 (premature completion) - New doc addresses context exhaustion (multi-issue orchestration) - ~20% overlap (both use non-AI coordinator, mechanical gates) - 80% complementary (different problems, different solutions) Recommends merging into comprehensive document (already done). Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-31 14:47:59 -06:00
Jason Woltje	4b4d21c732	feat(#129 ): add LLM provider admin API endpoints Implement REST API endpoints for managing LLM provider instances. Changes: - Created DTOs for provider CRUD operations (CreateLlmProviderDto, UpdateLlmProviderDto, LlmProviderResponseDto) - Implemented LlmProviderAdminController with full CRUD endpoints: - GET /llm/admin/providers - List all providers - GET /llm/admin/providers/:id - Get provider details - POST /llm/admin/providers - Create new provider - PATCH /llm/admin/providers/:id - Update provider - DELETE /llm/admin/providers/:id - Delete provider - POST /llm/admin/providers/:id/test - Test connection - POST /llm/admin/reload - Reload from database - Updated llm-manager.service.ts to support OpenAI and Claude providers - Added comprehensive test suite with 97.95% coverage - Proper validation, error handling, and type safety All tests pass. Pre-commit hooks pass. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-31 14:37:55 -06:00
Jason Woltje	772776bfd9	feat(#125 ): add Claude (Anthropic) LLM provider Implement Anthropic Claude provider for Claude Opus, Sonnet, and Haiku models. Implementation details: - Created ClaudeProvider class implementing LlmProviderInterface - Added @anthropic-ai/sdk npm package integration - Implemented chat completion with streaming support - Claude-specific message format (system prompt separate from messages) - Static model list (Claude API doesn't provide list models endpoint) - Embeddings throw error as Claude doesn't support native embeddings - Added OpenTelemetry tracing with @TraceLlmCall decorator - 100% statement, function, and line coverage (79% branch coverage) Tests: - Created comprehensive test suite with 20 tests - All tests follow TDD pattern (written before implementation) - Tests cover initialization, health checks, chat, streaming, and error handling - Mocked Anthropic SDK client for isolated unit testing Quality checks: - All tests pass (1131 total tests across project) - ESLint passes with no errors - TypeScript type checking passes - Follows existing code patterns from OpenAI and Ollama providers Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-31 14:29:40 -06:00
Jason Woltje	0fdcfa6ed3	feat(#124 ): add OpenAI LLM provider Implement OpenAI provider for GPT-4, GPT-3.5, and other OpenAI models. Implementation includes: - OpenAI SDK integration with API key authentication - Chat completion with streaming support - Embeddings generation - Health checks and model listing - OpenTelemetry tracing - Comprehensive test suite with 97% coverage Follows TDD methodology: - Written tests first (RED phase) - Implemented minimal code to pass tests (GREEN phase) - Code passes typecheck, linter, and all quality gates Test coverage: 97.18% statements, 97.05% lines All 22 tests passing Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-31 14:21:38 -06:00
Jason Woltje	faf6328e0b	test(#141 ): add Non-AI Coordinator integration tests Comprehensive E2E validation proving coordinator enforces quality gates and prevents premature completion claims. Test scenarios (21 tests): - Rejection Flow: Build/lint/test/coverage gate failures - Acceptance Flow: All gates pass, required-only pass - Continuation Flow: Retry, escalation, attempt tracking - Escalation Flow: Manual review, notifications, history - Configuration: Workspace-specific, defaults, custom gates - Performance: Timeout compliance, memory limits - Complete E2E: Full rejection-continuation-acceptance cycle Fixtures: - mock-agent-outputs.ts: Simulated gate execution results - mock-gate-configs.ts: Various gate configurations Validates integration of: - Quality Orchestrator (#134) - Quality Gate Config (#135) - Completion Verification (#136) - Continuation Prompts (#137) - Rejection Handler (#139) All 21 tests passing Fixes #141 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-31 14:14:56 -06:00
Jason Woltje	a86d304f07	feat(#139 ): build Gate Rejection Response Handler All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details Implement rejection handling for tasks that fail quality gates after all continuation attempts are exhausted. Schema: - Add TaskRejection model for tracking rejections - Store failures, attempts, escalation state Service: - handleRejection: Main entry point for rejection handling - logRejection: Database logging - determineEscalation: Rule-based escalation determination - executeEscalation: Execute escalation actions - sendNotification: Notification dispatch - markForManualReview: Flag tasks for human review - getRejectionHistory: Query rejection history - generateRejectionReport: Markdown report generation Escalation rules: - max-attempts: Trigger after 3+ attempts - time-exceeded: Trigger after 2+ hours - critical-failure: Trigger on security/critical issues Actions: notify, block, reassign, cancel Tests: 16 passing with 80% statement coverage Fixes #139 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-31 14:01:42 -06:00
Jason Woltje	0387cce116	feat(#137 ): create Forced Continuation Prompt System Implement prompt generation system that produces continuation prompts based on verification failures to force AI agents to complete work. Service: - generatePrompt: Complete prompt from failure context - generateTestFailurePrompt: Test-specific guidance - generateBuildErrorPrompt: Build error resolution - generateCoveragePrompt: Coverage improvement strategy - generateIncompleteWorkPrompt: Completion requirements Templates: - base.template: System/user prompt structure - test-failure.template: Test fix guidance - build-error.template: Compilation error guidance - coverage.template: Coverage improvement strategy - incomplete-work.template: Completion requirements Constraint escalation: - Attempt 1: Normal guidance - Attempt 2: Focus only on failures - Attempt 3: Minimal changes only - Final: Last attempt warning Priority levels: critical/high/normal based on failure severity Tests: 24 passing with 95.31% coverage Fixes #137 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-31 13:51:46 -06:00
Jason Woltje	72ae92f5a6	feat(#136 ): build Completion Verification Engine Implement verification engine to determine if AI agent work is truly complete by analyzing outputs and detecting deferred work patterns. Strategies: - FileChangeStrategy: Detect TODO/FIXME, placeholders, stubs - TestOutputStrategy: Validate pass rates, coverage (85%), skipped tests - BuildOutputStrategy: Detect TS errors, ESLint errors, build failures Deferred work detection patterns: - "follow-up", "to be added later" - "incremental improvement", "future enhancement" - "TODO: complete", "placeholder implementation" - "stub", "work in progress", "partially implemented" Features: - Confidence scoring (0-100%) - Verdict system: complete/incomplete/needs-review - Actionable suggestions for improvements - Strategy-based extensibility Integration: - Complements Quality Orchestrator (#134) - Uses Quality Gate Config (#135) Tests: 46 passing with 95.27% coverage Fixes #136 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-31 13:44:23 -06:00
Jason Woltje	4a2909ce1e	feat(#135 ): implement Quality Gate Configuration System Add database-backed quality gate configuration for workspaces with full CRUD operations and default gate seeding. Schema: - Add QualityGate model with workspace relation - Support for custom commands and regex patterns - Enable/disable and ordering support Service: - CRUD operations for quality gates - findEnabled: Get ordered, enabled gates - reorder: Bulk reorder with transaction - seedDefaults: Seed 4 default gates - toOrchestratorFormat: Convert to orchestrator interface Endpoints: - GET /workspaces/:id/quality-gates - List - GET /workspaces/:id/quality-gates/:gateId - Get one - POST /workspaces/:id/quality-gates - Create - PATCH /workspaces/:id/quality-gates/:gateId - Update - DELETE /workspaces/:id/quality-gates/:gateId - Delete - POST /workspaces/:id/quality-gates/reorder - POST /workspaces/:id/quality-gates/seed-defaults Default gates: Build, Lint, Test, Coverage (85%) Tests: 25 passing with 95.16% coverage Fixes #135 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-31 13:33:04 -06:00
Jason Woltje	a25e9048be	feat(#134 ): design Non-AI Quality Orchestrator service Implement quality orchestration service to enforce standards on AI agent work and prevent premature completion claims. Components: - QualityOrchestratorService: Core validation and gate execution - QualityGate interface: Extensible gate definitions - CompletionClaim/Validation: Track claims and verdicts - OrchestrationConfig: Per-workspace configuration Features: - Validate completions against quality gates (build/lint/test/coverage) - Run gates with command execution and output validation - Support string and RegExp output pattern matching - Smart continuation logic with attempt tracking - Generate actionable feedback for failed gates - Strict/lenient mode for gate enforcement - 5-minute timeout, 10MB output buffer per gate Default gates: - Build Check (required) - Lint Check (required) - Test Suite (required) - Coverage Check (optional, 85% threshold) Tests: 21 passing with 85.98% coverage Fixes #134 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-31 13:24:46 -06:00
Jason Woltje	0c78923138	feat(#133 ): add workspace-scoped LLM configuration Implement per-workspace LLM provider and personality configuration with proper hierarchy (workspace > user > system fallback). Schema: - Add WorkspaceLlmSettings model with provider/personality FKs - One-to-one relation with Workspace - JSON settings field for extensibility Service: - getSettings: Retrieves/creates workspace settings - updateSettings: Updates with null value support - getEffectiveLlmProvider: Hierarchy-based provider selection - getEffectivePersonality: Hierarchy-based personality selection Endpoints: - GET /workspaces/:id/settings/llm - Get settings - PATCH /workspaces/:id/settings/llm - Update settings - GET /workspaces/:id/settings/llm/effective-provider - GET /workspaces/:id/settings/llm/effective-personality Configuration hierarchy: 1. Workspace-configured provider/personality 2. User-specific provider (for providers) 3. System default fallback Tests: 34 passing with 100% coverage Fixes #133 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-31 13:15:36 -06:00
Jason Woltje	b8805cee50	feat(#132 ): port MCP (Model Context Protocol) infrastructure Implement MCP Phase 1 infrastructure for agent tool integration with central hub, tool registry, and STDIO transport layers. Components: - McpHubService: Central registry for MCP server lifecycle - StdioTransport: STDIO process communication with JSON-RPC 2.0 - ToolRegistryService: Tool catalog management - McpController: REST API for MCP management Endpoints: - GET/POST /mcp/servers - List/register servers - POST /mcp/servers/:id/start\|stop - Lifecycle control - DELETE /mcp/servers/:id - Unregister - GET /mcp/tools - List tools - POST /mcp/tools/:name/invoke - Invoke tool Features: - Full JSON-RPC 2.0 protocol support - Process lifecycle management - Buffered message parsing - Type-safe with no explicit any types - Proper cleanup on shutdown Tests: 85 passing with 90.9% coverage Fixes #132 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-31 13:07:58 -06:00
Jason Woltje	51e6ad0792	feat(#131 ): add OpenTelemetry tracing infrastructure Implement comprehensive distributed tracing for HTTP requests and LLM operations using OpenTelemetry with GenAI semantic conventions. Features: - TelemetryService: SDK initialization with OTLP HTTP exporter - TelemetryInterceptor: Automatic HTTP request spans - @TraceLlmCall decorator: LLM operation tracing - GenAI semantic conventions for model/token tracking - Graceful degradation when tracing disabled Instrumented: - All HTTP requests (automatic spans) - OllamaProvider chat/chatStream/embed operations - Token counts, model names, durations Environment: - OTEL_ENABLED (default: true) - OTEL_SERVICE_NAME (default: mosaic-api) - OTEL_EXPORTER_OTLP_ENDPOINT (default: localhost:4318) Tests: 23 passing with full coverage Fixes #131 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-31 12:55:11 -06:00
Jason Woltje	64cb5c1edd	feat(#130 ): add Personality Prisma schema and backend Implement Personality system backend with database schema, service, controller, and comprehensive tests. Personalities define assistant behavior with system prompts and LLM configuration. Changes: - Update Personality model in schema.prisma with LLM provider relation - Create PersonalitiesService with CRUD and default management - Create PersonalitiesController with REST endpoints - Add DTOs with validation (create/update) - Add entity for type safety - Remove unused PromptFormatterService - Achieve 26 tests with full coverage Endpoints: - GET /personality - List all - GET /personality/default - Get default - GET /personality/by-name/:name - Get by name - GET /personality/:id - Get one - POST /personality - Create - PATCH /personality/:id - Update - DELETE /personality/:id - Delete - POST /personality/:id/set-default - Set default Fixes #130 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-31 12:44:50 -06:00
Jason Woltje	1f97e6de40	feat(#127 ): refactor LlmService to use provider pattern Refactor LlmService to delegate to LlmManagerService instead of using Ollama directly. This enables multiple provider support and user-specific provider configuration. Changes: - Remove direct Ollama client from LlmService - Delegate all LLM operations to provider via LlmManagerService - Update health status to use provider-agnostic interface - Add PrismaModule to LlmModule for manager service - Maintain backward compatibility with existing API - Achieve 89.74% test coverage Fixes #127 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-31 12:33:56 -06:00
Jason Woltje	be6c15116d	feat(#126 ): create LLM Manager Service Implemented centralized service for managing multiple LLM provider instances. Architecture: - LlmManagerService manages provider lifecycle and selection - Loads provider instances from Prisma database on startup - Maintains in-memory registry of active providers - Factory pattern for provider instantiation Core Features: - Database integration via PrismaService - Provider initialization on module startup (OnModuleInit) - Get provider by ID - Get all active providers - Get system default provider - Get user-specific provider with fallback to system default - Health check all registered providers - Dynamic registration/unregistration (hot reload) - Reload from database without restart Provider Selection Logic: - User-level providers: userId matches, is enabled - System-level providers: userId is NULL, is enabled - Fallback: system default if no user provider found - Graceful error handling with detailed logging Integration: - Added to LlmModule providers and exports - Uses PrismaService for database queries - Factory creates OllamaProvider from config - Extensible for future providers (Claude, OpenAI) Testing: - 31 comprehensive unit tests - 93.05% code coverage (exceeds 85% requirement) - All error scenarios covered - Proper mocking of dependencies Quality Gates: - ✅ All 31 tests passing - ✅ 93.05% coverage - ✅ Linting clean - ✅ Type checking passed - ✅ Code review approved Fixes #126 Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-31 12:22:14 -06:00
Jason Woltje	94afeb67e3	feat(#123 ): port Ollama LLM provider Implemented first concrete LLM provider following the provider interface pattern. Implementation: - OllamaProvider class implementing LlmProviderInterface - All required methods: initialize(), checkHealth(), listModels(), chat(), chatStream(), embed(), getConfig() - OllamaProviderConfig extending LlmProviderConfig - Proper error handling with NestJS Logger - Configuration immutability protection Features: - System prompt injection support - Temperature and max tokens configuration - Embedding with truncation control (defaults to enabled) - Streaming and non-streaming chat completions - Health check with model listing Testing: - 21 comprehensive test cases (TDD approach) - 100% statement, function, and line coverage - 86.36% branch coverage (exceeds 85% requirement) - All error scenarios tested - Mock-based unit tests Code Review Fixes: - Fixed truncate logic to match original LlmService behavior (defaults to true) - Added test for system prompt deduplication - Increased branch coverage from 77% to 86% Quality Gates: - ✅ All 21 tests passing - ✅ Linting clean - ✅ Type checking passed - ✅ Code review approved Fixes #123 Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-31 12:10:43 -06:00
Jason Woltje	1e35e63444	feat(#128 ): add LlmProviderInstance Prisma schema Added database schema for LLM provider instance configuration to support multi-provider architecture. Schema design: - LlmProviderInstance model with UUID primary key - Fields: providerType, displayName, userId, config, isDefault, isEnabled - JSON config field for flexible provider-specific settings - Nullable userId: NULL = system-level, UUID = user-level - Foreign key to User with CASCADE delete - Added llmProviders relation to User model Indexes: - user_id: Fast user lookup - provider_type: Filter by provider - is_default: Quick default lookup - is_enabled: Enabled/disabled filtering Migration: 20260131115600_add_llm_provider_instance - PostgreSQL table creation with proper types - Foreign key constraint - Performance indexes Prisma client regenerated successfully. Database migration requires manual deployment when DB is available. Fixes #128 Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-31 11:57:40 -06:00
Jason Woltje	dc4f6cbb9d	feat(#122 ): create LLM provider interface Implemented abstract LLM provider interface to enable multi-provider support. Key components: - LlmProviderInterface: Abstract contract for all LLM providers - LlmProviderConfig: Base configuration interface - LlmProviderHealthStatus: Standardized health check response - LlmProviderType: Type discriminator for runtime checks Methods defined: - initialize(): Async provider setup - checkHealth(): Health status verification - listModels(): Available model enumeration - chat(): Synchronous completion - chatStream(): Streaming completion (async generator) - embed(): Embedding generation - getConfig(): Configuration access All methods fully documented with JSDoc. 13 tests written and passing. Type checking verified. Fixes #122 Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-31 11:38:38 -06:00

1 2 3

149 Commits