stack

Author	SHA1	Message	Date
Jason Woltje	1f86c36cc1	chore: Update tasks.md - Phase 2 complete (3/3) Some checks failed ci/woodpecker/push/woodpecker Pipeline failed Details	2026-02-07 16:17:51 -06:00
Jason Woltje	40f7e7e4c0	docs(#354 ): Add comprehensive OpenBao integration guide Some checks failed ci/woodpecker/push/woodpecker Pipeline failed Details Complete documentation for OpenBao Transit encryption covering setup, architecture, production hardening, and operations. Sections: - Overview: Why OpenBao, Transit encryption explained - Architecture: Data flow diagrams, fallback behavior - Default Setup: Turnkey auto-init/unseal, file locations - Environment Variables: Configuration options - Transit Keys: Named keys, rotation procedures - Production Hardening: 10-point security checklist - Operations: Health checks, manual procedures, monitoring - Troubleshooting: Common issues and solutions - Disaster Recovery: Backup/restore procedures Key Topics: - Shamir key splitting upgrade (1-of-1 → 3-of-5) - TLS configuration for production - Audit logging enablement - HA storage backends (Raft/Consul) - External auto-unseal with KMS - Rate limiting via reverse proxy - Network isolation best practices - Key rotation procedures - Backup automation Closes #354 Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-07 16:16:51 -06:00
Jason Woltje	dd171b287f	feat(#353 ): Create VaultService NestJS module for OpenBao Transit Some checks failed ci/woodpecker/push/woodpecker Pipeline failed Details Implements secure credential encryption using OpenBao Transit API with automatic fallback to AES-256-GCM when OpenBao is unavailable. Features: - AppRole authentication with automatic token renewal at 50% TTL - Transit encrypt/decrypt with 4 named keys - Automatic fallback to CryptoService when OpenBao unavailable - Auto-detection of ciphertext format (vault:v1: vs AES) - Request timeout protection (5s default) - Health indicator for monitoring - Backward compatible with existing AES-encrypted data Security: - ERROR-level logging for fallback - Proper error propagation (no silent failures) - Request timeouts prevent hung operations - Secure credential file reading Migrations: - Account encryption middleware uses VaultService - Uses TransitKey.ACCOUNT_TOKENS for OAuth tokens - Backward compatible with existing encrypted data Tests: 56 tests passing (36 VaultService + 20 middleware) Closes #353 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-07 16:13:05 -06:00
Jason Woltje	d4d1e59885	feat(#357 ): Add OpenBao to Docker Compose with turnkey setup Some checks failed ci/woodpecker/push/woodpecker Pipeline failed Details Implements secure credential storage using OpenBao Transit encryption. Features: - Auto-initialization on first run (1-of-1 Shamir key for dev) - Auto-unseal on container restart with verification and retry logic - Transit secrets engine with 4 named encryption keys - AppRole authentication with Transit-only policy - Localhost-only API binding for security - Comprehensive integration test suite (22 tests, all passing) Security: - API bound to 127.0.0.1 (localhost only, no external access) - Unseal verification with 3-attempt retry logic - Sanitized error messages in tests (no secret leakage) - Volume-based secret reading (doesn't require running container) Files: - docker/openbao/config.hcl: Server configuration - docker/openbao/init.sh: Auto-init/unseal script - docker/docker-compose.yml: OpenBao and init services - tests/integration/openbao.test.ts: Full test coverage - .env.example: OpenBao configuration variables Closes #357 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-07 15:40:24 -06:00
Jason Woltje	9446475ea2	chore: Update tasks.md - Phase 1 complete (3/3) Some checks failed ci/woodpecker/push/woodpecker Pipeline failed Details	2026-02-07 13:17:12 -06:00
Jason Woltje	737eb40d18	feat(#352 ): Encrypt existing plaintext Account tokens Some checks failed ci/woodpecker/push/woodpecker Pipeline failed Details Implements transparent encryption/decryption of OAuth tokens via Prisma middleware with progressive migration strategy. Core Implementation: - Prisma middleware transparently encrypts tokens on write, decrypts on read - Auto-detects ciphertext format: aes:iv:authTag:encrypted, vault:v1:..., or plaintext - Uses existing CryptoService (AES-256-GCM) for encryption - Progressive encryption: tokens encrypted as they're accessed/refreshed - Zero-downtime migration (schema change only, no bulk data migration) Security Features: - Startup key validation prevents silent data loss if ENCRYPTION_KEY changes - Secure error logging (no stack traces that could leak sensitive data) - Graceful handling of corrupted encrypted data - Idempotent encryption prevents double-encryption - Future-proofed for OpenBao Transit encryption (Phase 2) Token Fields Encrypted: - accessToken (OAuth access tokens) - refreshToken (OAuth refresh tokens) - idToken (OpenID Connect ID tokens) Backward Compatibility: - Existing plaintext tokens readable (encryptionVersion = NULL) - Progressive encryption on next write - BetterAuth integration transparent (middleware layer) Test Coverage: - 20 comprehensive unit tests (89.06% coverage) - Encryption/decryption scenarios - Null/undefined handling - Corrupted data handling - Legacy plaintext compatibility - Future vault format support - All CRUD operations (create, update, updateMany, upsert) Files Created: - apps/api/src/prisma/account-encryption.middleware.ts - apps/api/src/prisma/account-encryption.middleware.spec.ts - apps/api/prisma/migrations/20260207_encrypt_account_tokens/migration.sql Files Modified: - apps/api/src/prisma/prisma.service.ts (register middleware) - apps/api/src/prisma/prisma.module.ts (add CryptoService) - apps/api/src/federation/crypto.service.ts (add key validation) - apps/api/prisma/schema.prisma (add encryptionVersion) - .env.example (document ENCRYPTION_KEY) Fixes #352 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-07 13:16:43 -06:00
Jason Woltje	89464583a4	chore: Update tasks.md - Issue #350 complete Some checks failed ci/woodpecker/push/woodpecker Pipeline failed Details	2026-02-07 12:49:57 -06:00
Jason Woltje	cf9a3dc526	feat(#350 ): Add RLS policies to auth tables with FORCE enforcement Some checks failed ci/woodpecker/push/woodpecker Pipeline failed Details Implements Row-Level Security (RLS) policies on accounts and sessions tables with FORCE enforcement. Core Implementation: - Added FORCE ROW LEVEL SECURITY to accounts and sessions tables - Created conditional owner bypass policies (when current_user_id() IS NULL) - Created user-scoped access policies using current_user_id() helper - Documented PostgreSQL superuser limitation with production deployment guide Security Features: - Prevents cross-user data access at database level - Defense-in-depth security layer complementing application logic - Owner bypass allows migrations and BetterAuth operations when no RLS context - Production requires non-superuser application role (documented in migration) Test Coverage: - 22 comprehensive integration tests (9 accounts + 9 sessions + 4 context) - Complete CRUD coverage: CREATE, READ, UPDATE, DELETE (own + others) - Superuser detection with fail-fast error message - Verification that blocked DELETE operations preserve data - 100% test coverage, all tests passing Integration: - Uses RLS context provider from #351 (runWithRlsClient, getRlsClient) - Parameterized queries using set_config() for security - Transaction-scoped session variables with SET LOCAL Files Created: - apps/api/prisma/migrations/20260207_add_auth_rls_policies/migration.sql - apps/api/src/auth/auth-rls.integration.spec.ts Fixes #350 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-07 12:49:14 -06:00
Jason Woltje	6a1ca5bc10	chore: Update tasks.md - Issue #351 complete All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details	2026-02-07 12:26:33 -06:00
Jason Woltje	93d403807b	feat(#351 ): Implement RLS context interceptor (fix SEC-API-4) Some checks failed ci/woodpecker/push/woodpecker Pipeline failed Details Implements Row-Level Security (RLS) context propagation via NestJS interceptor and AsyncLocalStorage. Core Implementation: - RlsContextInterceptor sets PostgreSQL session variables (app.current_user_id, app.current_workspace_id) within transaction boundaries - Uses SET LOCAL for transaction-scoped variables, preventing connection pool leakage - AsyncLocalStorage propagates transaction-scoped Prisma client to services - Graceful handling of unauthenticated routes - 30-second transaction timeout with 10-second max wait Security Features: - Error sanitization prevents information disclosure to clients - TransactionClient type provides compile-time safety, prevents invalid method calls - Defense-in-depth security layer for RLS policy enforcement Quality Rails Compliance: - Fixed 154 lint errors in llm-usage module (package-level enforcement) - Added proper TypeScript typing for Prisma operations - Resolved all type safety violations Test Coverage: - 19 tests (7 provider + 9 interceptor + 3 integration) - 95.75% overall coverage (100% statements on implementation files) - All tests passing, zero lint errors Documentation: - Comprehensive RLS-CONTEXT-USAGE.md with examples and migration guide Files Created: - apps/api/src/common/interceptors/rls-context.interceptor.ts - apps/api/src/common/interceptors/rls-context.interceptor.spec.ts - apps/api/src/common/interceptors/rls-context.integration.spec.ts - apps/api/src/prisma/rls-context.provider.ts - apps/api/src/prisma/rls-context.provider.spec.ts - apps/api/src/prisma/RLS-CONTEXT-USAGE.md Fixes #351 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-07 12:25:50 -06:00
Jason Woltje	e20aea99b9	test(#344 ): Add comprehensive tests for CI operations service All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details - Add 52 tests achieving 99.3% coverage - Test all public methods: getLatestPipeline, getPipeline, waitForPipeline, getPipelineLogs - Test auto-diagnosis for all failure categories - Test pipeline parsing and status handling - Mock ConfigService and child_process exec - All tests passing with >85% coverage requirement met Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-07 11:27:35 -06:00
Jason Woltje	a69904a47b	docs(#344 ): Add CI verification to orchestrator guide - Document CI configuration requirements - Add CI verification step to execution loop - Document auto-diagnosis categories and patterns - Add CLI integration examples - Add service integration code examples Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-07 11:22:58 -06:00
Jason Woltje	7feb686d73	feat(#344 ): Add CI operations service to orchestrator - Add CIOperationsService for Woodpecker CI integration - Add types for pipeline status, failure diagnosis - Add waitForPipeline with auto-diagnosis on failure - Add getPipelineLogs for log retrieval - Integrate CIModule into orchestrator app Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-07 11:21:38 -06:00
Jason Woltje	51ce32cc76	docs(#346 ): Add credential security architecture design document Comprehensive design document for M7-CredentialSecurity milestone covering hybrid OpenBao Transit + PostgreSQL encryption approach, threat model, UserCredential data model, API design, RLS enforcement strategy, turnkey OpenBao Docker integration, and 5-phase implementation plan. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-07 11:15:58 -06:00
Jason Woltje	ec87c5479b	feat(#344 ): Add Woodpecker CI pipeline monitoring to cli-tools - Add ci-pipeline-status.sh for checking pipeline status - Add ci-pipeline-logs.sh for fetching logs - Add ci-pipeline-wait.sh for waiting on completion - Update package.json bin section - Update README with CI commands and examples Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-07 11:13:43 -06:00
Jason Woltje	bed440dc36	docs(m6): Add Usage Budget Management section Some checks failed ci/woodpecker/push/woodpecker Pipeline failed Details Add comprehensive usage budget management design to M6 orchestration architecture. FEATURES: - Real-time usage tracking across agents - Budget allocation per task/milestone/project - Usage projection and burn rate calculation - Throttling decisions to prevent budget exhaustion - Model tier optimization (Haiku/Sonnet/Opus) - Pre-commit usage validation DATA MODEL: - usage_budgets table (allocated/consumed/remaining) - agent_usage_logs table (per-agent tracking) - Valkey keys for real-time state BUDGET CHECKPOINTS: 1. Task assignment - can afford this task? 2. Agent spawn - verify budget headroom 3. Checkpoint intervals - periodic compliance 4. Pre-commit validation - usage efficiency PRIORITY: MVP (M6 Phase 3) for basic tracking, Phase 5 for advanced projection and optimization. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-07 09:55:21 -06:00
jason.woltje	65e56cac5e	Merge pull request 'Integrate M4-LLM error handling into develop' (#349 ) from feature/m4-llm-integration into develop All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details Reviewed-on: #349	2026-02-07 02:38:20 +00:00
Jason Woltje	69cc3f8e1e	fix(web): Remove re-throw from loadConversation to prevent unhandled rejections Some checks failed ci/woodpecker/push/woodpecker Pipeline failed Details ci/woodpecker/pr/woodpecker Pipeline failed Details - Make loadConversation fully self-contained like sendMessage (handle errors internally via state, onError callback, and structured logging) - Remove duplicate try/catch+log from Chat.tsx imperative handle - Replace re-throw tests with delegation and no-throw tests - Add hook-level loadConversation error path tests (getIdea rejection) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 20:33:52 -06:00
Jason Woltje	f64ca3871d	fix(web): Address review findings for M4-LLM integration Some checks failed ci/woodpecker/push/woodpecker Pipeline failed Details ci/woodpecker/pr/woodpecker Pipeline was successful Details - Sanitize user-facing error messages (no raw API/DB errors) - Remove dead try/catch from Chat.tsx handleSendMessage - Add onError callback for persistence errors in useChat - Add console.error logging to loadConversation - Guard minimize/toggleMinimize against closed overlay state - Improve error dedup bucketing for non-DOMException errors - Add tests: non-Error throws, updateConversation failure, minimize/toggleMinimize guards Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 20:25:03 -06:00
Jason Woltje	da1862816f	docs(orchestrator): Add Sprint Completion Protocol + archive M6-Fixes Add sprint archival instructions so completed tasks.md files are retained in docs/tasks/ for post-mortem reference. Includes recovery behavior when an orchestrator finds no active tasks.md. Archive M6-AgentOrchestration-Fixes: 88/90 done, 2 deferred. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 20:13:59 -06:00
Jason Woltje	893a139087	feat(web): Integrate M4-LLM error handling improvements Some checks failed ci/woodpecker/push/woodpecker Pipeline was successful Details ci/woodpecker/pr/woodpecker Pipeline failed Details Port high-value features from work/m4-llm branch into develop's security-hardened codebase: - Separate LLM vs persistence error handling in useChat (shows assistant response even when save fails) - Add structured error context logging with errorType, messagePreview, messageCount fields for debugging - Enforce state invariant in useChatOverlay: cannot be minimized when closed - Add onStorageError callback with user-friendly messages and per-error-type deduplication - Add error logging to Chat imperative handle methods - Create Chat.test.tsx with loadConversation failure mode tests Skipped from work/m4-llm (superseded by develop): - AbortSignal timeout (develop has centralized client timeout) - Custom toast system (duplicates @mosaic/ui) - ErrorBoundary (develop has its own) - WebSocket typed events (develop's ref-based pattern is superior) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 20:04:53 -06:00
jason.woltje	ac796072d8	Merge pull request 'Security Remediation: All Phases Complete (84 fixes)' (#348 ) from fix/security into develop Some checks failed ci/woodpecker/push/woodpecker Pipeline failed Details	2026-02-07 01:41:32 +00:00
Jason Woltje	fd73709092	chore(orchestrator): Phase 5 complete - all 17 tasks done + verification Some checks failed ci/woodpecker/push/woodpecker Pipeline was successful Details ci/woodpecker/pr/woodpecker Pipeline failed Details Issue #340: Low Priority - Cleanup + Performance - 26 findings across 7 CQ + 19 SEC-Low, all remediated - 2 findings pre-completed from Phase 4 (CQ-API-7, CQ-ORCH-9) - Test counts: api=2432, web=786, orchestrator=682 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 18:48:58 -06:00
Jason Woltje	3d9edf4141	fix(CQ-WEB-11+12): Fix accessibility labels + SSR window check All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details CQ-WEB-11: Add aria-label attributes to search input, date inputs, and id/htmlFor associations for status and priority filter checkboxes in FilterBar component to improve screen reader accessibility. CQ-WEB-12: Guard all browser-specific API usage in ReactFlowEditor behind typeof window checks. Move isDark detection into useState + useEffect to prevent SSR/hydration mismatches. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 18:45:56 -06:00
Jason Woltje	bfeea743f7	fix(CQ-WEB-10): Add loading/error states to pages with mock data All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details Convert tasks, calendar, and dashboard pages from synchronous mock data to async loading pattern with useState/useEffect. Each page now shows a loading state via child components while data loads, and displays a PDA-friendly amber-styled message with a retry button if loading fails. This prepares these pages for real API integration by establishing the async data flow pattern. Child components (TaskList, Calendar, dashboard widgets) already handled isLoading props — now the pages actually use them. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 18:40:21 -06:00
Jason Woltje	952eeb7323	fix(CQ-WEB-9): Cache DOM measurement element in LinkAutocomplete Some checks failed ci/woodpecker/push/woodpecker Pipeline failed Details Replace per-keystroke DOM element creation/removal with a persistent off-screen mirror element stored in useRef. The mirror and cursor span are lazily created on first use and reused for all subsequent caret position measurements, eliminating layout thrashing. Cleanup on component unmount removes the element from the DOM. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 18:32:50 -06:00
Jason Woltje	214139f4d5	fix(CQ-WEB-8): Add React.memo to performance-sensitive components All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details Wrap 7 list-item/card components with React.memo to prevent unnecessary re-renders when parent components update but props remain unchanged: - TaskItem (task lists) - EventCard (calendar views) - EntryCard (knowledge base) - WorkspaceCard (workspace list) - TeamCard (team list) - DomainItem (domain list) - ConnectionCard (federation connections) All are pure components rendered inside .map() loops that depend solely on their props for rendering output. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 18:28:08 -06:00
Jason Woltje	1005b7969c	fix(SEC-WEB-37): Gate federation mock data behind NODE_ENV check All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details Replace exported const mockConnections with getMockConnections() function that returns mock data only when NODE_ENV === "development". In production and test environments, returns an empty array as defense-in-depth alongside the existing ComingSoon page gate (SEC-WEB-4). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 18:22:12 -06:00
Jason Woltje	12fa093f58	fix(SEC-WEB-33+35): Fix Mermaid error display + useWorkspaceId error logging All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details SEC-WEB-33: Replace raw diagram source and detailed error messages in MermaidViewer error UI with a generic "Diagram rendering failed" message. Detailed errors are logged to console.error for debugging only. SEC-WEB-35: Add console.warn in useWorkspaceId when no workspace ID is found in localStorage, making it easier to distinguish "no workspace selected" from silent hook failure. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 18:16:07 -06:00
Jason Woltje	014264c592	fix(SEC-WEB-32+34): Add input maxLength limits + API request timeout All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details SEC-WEB-32: Added maxLength to form inputs (names: 100, descriptions: 500, emails: 254) in WorkspaceSettings, TeamSettings, InviteMember components. SEC-WEB-34: Added AbortController timeout (30s default, configurable) to apiRequest and apiPostFormData in API client. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 18:11:00 -06:00
Jason Woltje	14b547d468	fix(SEC-WEB-30+31+36): Validate JSON.parse/localStorage deserialization All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details Add runtime type validation after all JSON.parse calls in the web app to prevent runtime crashes from corrupted or tampered storage data. Creates a shared safeJsonParse utility with type guard functions for each data shape (Message[], ChatOverlayState, LayoutConfigRecord). All four affected callsites now validate parsed data and fall back to safe defaults on mismatch. Files changed: - apps/web/src/lib/utils/safe-json.ts (new utility) - apps/web/src/lib/utils/safe-json.test.ts (25 tests) - apps/web/src/hooks/useChat.ts (deserializeMessages) - apps/web/src/hooks/useChat.test.ts (3 new corruption tests) - apps/web/src/hooks/useChatOverlay.ts (loadState) - apps/web/src/hooks/useChatOverlay.test.ts (3 new corruption tests) - apps/web/src/components/chat/ConversationSidebar.tsx (ideaToConversation) - apps/web/src/lib/hooks/useLayout.ts (layout loading) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 15:46:58 -06:00
Jason Woltje	6d92251fc1	fix(SEC-WEB-27+28): Robust email validation + role cast validation All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details SEC-WEB-27: Replace weak email.includes('@') check with RFC 5322-aligned programmatic validation (isValidEmail). Uses character-level domain label validation to avoid ReDoS vulnerabilities from complex regex patterns. SEC-WEB-28: Replace unsafe 'as WorkspaceMemberRole' type casts with runtime validation (toWorkspaceMemberRole) that checks against known enum values and falls back to MEMBER for invalid inputs. Applied in both InviteMember.tsx and MemberList.tsx. Adds 43 tests covering validation logic, InviteMember component, and MemberList component behavior. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 15:40:05 -06:00
Jason Woltje	65b078c85e	fix(SEC-WEB-26+29): Remove console.log + fix formatTime error handling All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details - Remove debug console.log from workspaces page and teams page - Fix formatTime to return "Invalid date" fallback instead of empty string when date parsing fails (handles both thrown errors and NaN dates) - Export formatTime and add unit tests for error handling cases Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 15:29:32 -06:00
Jason Woltje	dfef71b660	fix(CQ-ORCH-10): Make BullMQ job retention configurable via env vars All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details Replace hardcoded BullMQ job retention values (completed: 100 jobs / 1h, failed: 1000 jobs / 24h) with configurable env vars to prevent memory growth under load. Adds QUEUE_COMPLETED_RETENTION_COUNT, QUEUE_COMPLETED_RETENTION_AGE_S, QUEUE_FAILED_RETENTION_COUNT, and QUEUE_FAILED_RETENTION_AGE_S to orchestrator config. Defaults preserve existing behavior. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 15:25:55 -06:00
Jason Woltje	6934d9261c	fix(SEC-ORCH-30): Add unique suffix to container names All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details Add crypto.randomBytes(4) hex suffix to container name generation to prevent name collisions when multiple agents spawn simultaneously within the same millisecond. Container names now include both a timestamp and 8 random hex characters for guaranteed uniqueness. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 15:22:12 -06:00
Jason Woltje	3880993b60	fix(SEC-ORCH-28+29): Add Valkey connection timeout + workItems MaxLength Some checks failed ci/woodpecker/push/woodpecker Pipeline failed Details SEC-ORCH-28: Add connectTimeout (5000ms default) and commandTimeout (3000ms default) to Valkey/Redis client to prevent indefinite connection hangs. Both are configurable via VALKEY_CONNECT_TIMEOUT_MS and VALKEY_COMMAND_TIMEOUT_MS environment variables. SEC-ORCH-29: Add @ArrayMaxSize(50) and @MaxLength(2000) to workItems in AgentContextDto to prevent memory exhaustion from unbounded input. Also adds @ArrayMaxSize(20) and @MaxLength(200) to skills array. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 15:19:44 -06:00
Jason Woltje	144495ae6b	fix(CQ-API-5): Document throttler in-memory fallback as best-effort All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details Add comprehensive JSDoc and inline comments documenting the known race condition in the in-memory fallback path of ThrottlerValkeyStorageService. The non-atomic read-modify-write in incrementMemory() is intentionally left without a mutex because: - It is only the fallback path when Valkey is unavailable - The primary Valkey path uses atomic INCR and is race-free - Adding locking to a rarely-used degraded path adds complexity with minimal benefit Also adds Logger.warn calls when falling back to in-memory mode at runtime (Redis command failures). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 15:15:11 -06:00
Jason Woltje	08d077605a	fix(SEC-API-28): Replace MCP console.error with NestJS Logger All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details Replace all console.error calls in MCP services with NestJS Logger instances for consistent structured logging in production. - mcp-hub.service.ts: Add Logger instance, replace console.error in onModuleDestroy cleanup - stdio-transport.ts: Add Logger instance, replace console.error for stderr output (as warn) and JSON parse failures (as error) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 15:11:41 -06:00
Jason Woltje	2e11931ded	fix(SEC-API-27): Scope RLS context to transaction boundary All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details createAuthMiddleware was calling SET LOCAL on the raw PrismaClient outside of any transaction. In PostgreSQL, SET LOCAL without a transaction acts as a session-level SET, which can leak RLS context to subsequent requests sharing the same pooled connection, enabling cross-tenant data access. Wrapped the setCurrentUser call and downstream handler execution inside a $transaction block so SET LOCAL is automatically reverted when the transaction ends (on both success and failure). Added comprehensive test suite for db-context module verifying: - RLS context is set on the transaction client, not the raw client - next() executes inside the transaction boundary - Authentication errors prevent any transaction from starting - Errors in downstream handlers propagate correctly Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 15:07:49 -06:00
Jason Woltje	617df12b52	fix(SEC-API-25+26): Enable strict ValidationPipe + tighten CORS origin All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details - Set forbidNonWhitelisted: true in ValidationPipe to reject requests with unknown DTO properties, preventing mass assignment vulnerabilities - Reject requests with no Origin header in production (SEC-API-26) - Restrict localhost:3001 to development mode only - Update CORS tests to cover production/development origin validation Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 15:02:55 -06:00
Jason Woltje	6c379d099a	chore(orchestrator): Bootstrap Phase 5 tasks for issue #340 Parsed 26 findings (7 CQ + 19 SEC-Low) into 17 tasks + verification. 2 findings already done (CQ-API-7, CQ-ORCH-9). Estimated total: 155K tokens. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 14:59:12 -06:00
Jason Woltje	92c310333c	fix(SEC-REVIEW-4-7): Address remaining MEDIUM security review findings All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details - Graceful container shutdown: detect "not running" containers and skip force-remove escalation, only SIGKILL for genuine stop failures - data: URI stripping: add security audit logging via NestJS Logger when data: URIs are blocked in markdown links and images - Orchestrator bootstrap: replace void bootstrap() with .catch() handler for clear startup failure logging and clean process.exit(1) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 14:51:22 -06:00
Jason Woltje	2bb1dffe97	docs(orchestrator): Note future DB-configurable settings Worker limits and other orchestrator settings will be configurable via the Coordinator service with DB-centric storage. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-06 14:49:57 -06:00
Jason Woltje	36f55558d2	fix(SEC-REVIEW-1): Surface search errors in LinkAutocomplete All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details Previously the catch block in searchEntries silently swallowed all non-abort errors, showing "No entries found" when the search actually failed. This misled users into thinking the knowledge base was empty. - Add searchError state variable - Set PDA-friendly error message on non-abort failures - Clear error state on subsequent successful searches - Render error in amber (distinct from gray "No entries found") - Add 3 tests: error display, error clearing, abort exclusion Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 14:42:47 -06:00
Jason Woltje	57441e2e64	fix(SEC-REVIEW-3): Add @MaxLength to SearchQueryDto.q for consistency All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details All other search DTOs (SemanticSearchBodyDto, HybridSearchBodyDto, BrainQueryDto, BrainSearchDto) already enforce @MaxLength(500) on their query fields. SearchQueryDto.q was missed, leaving the full-text knowledge search endpoint accepting arbitrarily long queries. Adds @MaxLength(500) decorator and validation test coverage. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 14:39:08 -06:00
Jason Woltje	433212e00f	test(CQ-ORCH-9): Add SpawnAgentDto validation tests All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details Adds 23 dedicated DTO-level validation tests for SpawnAgentDto and AgentContextDto using plainToInstance + validate() from class-validator. Covers: valid payloads, missing/empty taskId, invalid agentType, empty repository/branch, empty workItems, shell injection in branch names, SSRF in repository URLs, file:// protocol blocking, option injection, and invalid gateProfile values. Replaces the 5 controller-level validation tests removed in CQ-ORCH-9 with proper DTO-level equivalents. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 14:31:37 -06:00
Jason Woltje	298a379c42	chore(orchestrator): Add Phase 4 summary to learnings All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details Phase 4: 12/12 tasks, 97% variance (estimates consistently low). Closed issue #347. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 14:10:47 -06:00
Jason Woltje	d52423d3ce	chore(orchestrator): Phase 4 complete - all 12 tasks done + verification Some checks failed ci/woodpecker/push/woodpecker Pipeline failed Details Phase 4: 12/12 tasks completed, 0 failed, 0 deferred. Test counts: api=2397, web=653, orchestrator=642, shared=17, ui=11. All quality gates passing (lint, typecheck, tests). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 14:10:13 -06:00
Jason Woltje	c9ad3a661a	fix(CQ-ORCH-9): Deduplicate spawn validation logic Some checks failed ci/woodpecker/push/woodpecker Pipeline failed Details Remove duplicate validateSpawnRequest from AgentsController. Validation is now handled exclusively by: 1. ValidationPipe + DTO decorators (HTTP layer, class-validator) 2. AgentSpawnerService.validateSpawnRequest (business logic layer) This eliminates the maintenance burden and divergence risk of having identical validation in two places. Controller tests for the removed duplicate validation are also removed since they are fully covered by the service tests and DTO validation decorators. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 14:09:06 -06:00
Jason Woltje	a0062494b7	fix(CQ-ORCH-7): Graceful Docker container shutdown before force remove All checks were successful ci/woodpecker/push/woodpecker Pipeline was successful Details Replace the always-force container removal (SIGKILL) with a two-phase approach: first attempt graceful stop (SIGTERM with configurable timeout), then remove without force. Falls back to force remove only if the graceful path fails. The graceful stop timeout is configurable via orchestrator.sandbox.gracefulStopTimeoutSeconds (default: 10s). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 14:05:53 -06:00

1 2 3 4 5 ...

537 Commits