feat: MACP Phase 2A — Event Bridge + Notification System (#11)
This commit was merged in pull request #11.
This commit is contained in:
@@ -3,3 +3,4 @@
|
||||
## Orchestrator Matrix
|
||||
|
||||
- [MACP Phase 1](./orchestrator-matrix/macp-phase1.md)
|
||||
- [MACP Phase 2A](./orchestrator-matrix/macp-phase2a.md)
|
||||
|
||||
45
docs/DEVELOPER-GUIDE/orchestrator-matrix/macp-phase2a.md
Normal file
45
docs/DEVELOPER-GUIDE/orchestrator-matrix/macp-phase2a.md
Normal file
@@ -0,0 +1,45 @@
|
||||
# MACP Phase 2A
|
||||
|
||||
MACP Phase 2A adds the repo-local event bridge that makes orchestrator lifecycle events consumable by external systems.
|
||||
|
||||
## What Changed
|
||||
|
||||
1. `tools/orchestrator-matrix/events/event_watcher.py` polls `.mosaic/orchestrator/events.ndjson`, parses appended NDJSON events, dispatches callbacks, and persists a byte-offset cursor in `.mosaic/orchestrator/event_cursor.json`.
|
||||
2. `tools/orchestrator-matrix/events/webhook_adapter.py` forwards selected MACP events to a configured webhook endpoint with bounded retries and optional bearer auth.
|
||||
3. `tools/orchestrator-matrix/events/discord_formatter.py` renders task lifecycle events into concise Discord-friendly status lines.
|
||||
4. `bin/mosaic-macp` adds `watch` mode for one-shot or continuous event processing.
|
||||
|
||||
## Watcher Behavior
|
||||
|
||||
1. File watching is polling-based and stdlib-only for portability.
|
||||
2. The watcher resets its cursor if the events file is truncated.
|
||||
3. Corrupt JSON lines are logged to stderr and skipped.
|
||||
4. A trailing partial line is left unread until the newline arrives, preventing half-written events from being consumed.
|
||||
|
||||
## Webhook Configuration
|
||||
|
||||
Configure `.mosaic/orchestrator/config.json` under `macp.webhook`:
|
||||
|
||||
```json
|
||||
{
|
||||
"macp": {
|
||||
"webhook": {
|
||||
"enabled": false,
|
||||
"url": "http://localhost:8080/macp/events",
|
||||
"auth_token": "",
|
||||
"timeout_seconds": 10,
|
||||
"retry_count": 2,
|
||||
"event_filter": ["task.completed", "task.failed", "task.escalated"]
|
||||
}
|
||||
}
|
||||
}
|
||||
```
|
||||
|
||||
## CLI
|
||||
|
||||
```bash
|
||||
mosaic macp watch --once
|
||||
mosaic macp watch --webhook
|
||||
```
|
||||
|
||||
`--once` performs a single poll and exits. `--webhook` enables delivery via the configured `macp.webhook` block while still printing Discord-formatted event lines to stdout.
|
||||
86
docs/MACP-BRIEF-TEMPLATE.md
Normal file
86
docs/MACP-BRIEF-TEMPLATE.md
Normal file
@@ -0,0 +1,86 @@
|
||||
# MACP Task Brief Template
|
||||
|
||||
**Use this template for all MACP task briefs.** Workers that receive briefs not following this structure should flag it as an issue.
|
||||
|
||||
---
|
||||
|
||||
```markdown
|
||||
# <Title>
|
||||
|
||||
**Branch:** `feat/<branch-name>`
|
||||
**Repo worktree:** `~/src/<repo>-worktrees/<task-slug>`
|
||||
|
||||
---
|
||||
|
||||
## Objective
|
||||
|
||||
<1-2 sentences: what is being built and why>
|
||||
|
||||
---
|
||||
|
||||
## Task 1: <Component Name>
|
||||
|
||||
<Description of what to build>
|
||||
|
||||
### Requirements:
|
||||
- <Specific, testable requirements>
|
||||
|
||||
### Key Functions/APIs:
|
||||
<Code signatures or interface definitions>
|
||||
|
||||
### Constraints:
|
||||
- <Language, dependencies, patterns to follow>
|
||||
|
||||
---
|
||||
|
||||
## Task 2: <Component Name>
|
||||
<Same structure as Task 1>
|
||||
|
||||
---
|
||||
|
||||
## Tests (MANDATORY)
|
||||
|
||||
**Every brief MUST include a Tests section. Workers MUST write tests before or alongside implementation. Tests MUST pass before committing.**
|
||||
|
||||
### Test file: `tests/test_<module>.py`
|
||||
|
||||
### Test cases:
|
||||
1. `test_<name>` — <what it verifies>
|
||||
2. `test_<name>` — <what it verifies>
|
||||
...
|
||||
|
||||
### Test runner:
|
||||
```bash
|
||||
python3 -m unittest discover -s tests -p 'test_*.py' -v
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## Verification
|
||||
|
||||
1. All tests pass: `<test command>`
|
||||
2. Python syntax: `python3 -c "import <module>"`
|
||||
3. <Any additional verification steps>
|
||||
|
||||
## Ground Rules
|
||||
- Python 3.10+ stdlib only (no pip dependencies)
|
||||
- Commit message: `feat: <what changed>` (conventional commits)
|
||||
- Push to `feat/<branch>` branch when done
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## Brief Sizing Rules
|
||||
|
||||
| Brief Type | Max Items | Rationale |
|
||||
|------------|-----------|-----------|
|
||||
| **Build** (new code) | 2-3 | High cognitive load per item |
|
||||
| **Fix** (surgical changes) | 5-7 | Low cognitive load, exact file/line/fix |
|
||||
| **Review** | 1 | Naturally focused |
|
||||
| **Test** (add tests) | 3-4 | Medium load, but well-scoped |
|
||||
|
||||
The key metric is **cognitive load per item**, not item count.
|
||||
- Build = construction (high load)
|
||||
- Fix = scalpel (low load)
|
||||
- Review = naturally focused
|
||||
- Test = moderate (reading existing code + writing test logic)
|
||||
105
docs/PRD.md
105
docs/PRD.md
@@ -1,4 +1,4 @@
|
||||
# PRD: MACP Phase 1 Core Protocol Implementation
|
||||
# PRD: MACP Phase 2A Event Bridge + Notification System
|
||||
|
||||
## Metadata
|
||||
|
||||
@@ -9,90 +9,91 @@
|
||||
|
||||
## Problem Statement
|
||||
|
||||
The current orchestrator-matrix rail can queue shell-based worker tasks, but it does not yet expose a standardized protocol for dispatch selection, worktree-aware execution, structured results, or manual MACP queue operations. MACP Phase 1 extends the existing rail so orchestrators can delegate to multiple runtimes through a consistent task model while preserving current behavior for legacy tasks.
|
||||
MACP Phase 1 writes structured lifecycle events to `.mosaic/orchestrator/events.ndjson`, but no repo-local bridge consumes those events for external systems. Phase 2A adds a portable watcher, webhook delivery, and Discord-friendly formatting so MACP event streams can drive OpenClaw integrations and human-facing notifications.
|
||||
|
||||
## Objectives
|
||||
|
||||
1. Extend the existing orchestrator-matrix protocol and controller to support MACP-aware task dispatch and status tracking.
|
||||
2. Add a dispatcher layer that manages worktree lifecycle, runtime command generation, and standardized results.
|
||||
3. Provide a CLI entrypoint for manual MACP submission, status inspection, queue draining, and history review.
|
||||
1. Add a synchronous event watcher that tails `events.ndjson` using stdlib-only file polling and persists cursor state across restarts.
|
||||
2. Add a webhook adapter that can forward selected MACP events to a configured HTTP endpoint with bounded retries.
|
||||
3. Add a Discord formatter that turns task lifecycle events into concise human-readable strings.
|
||||
4. Extend the `mosaic macp` CLI with a `watch` command for one-shot or continuous event bridge execution.
|
||||
|
||||
## Scope
|
||||
|
||||
### In Scope
|
||||
|
||||
1. Extend the orchestrator task and event schemas and add a result schema.
|
||||
2. Add a Python dispatcher module under `tools/orchestrator-matrix/dispatcher/`.
|
||||
3. Update the controller to use the dispatcher for MACP-aware tasks while preserving legacy execution paths.
|
||||
4. Update orchestrator config templates, task markdown sync logic, and CLI routing/scripts for MACP commands.
|
||||
5. Add verification for backward compatibility, schema validity, imports, and basic MACP execution flow.
|
||||
1. New `tools/orchestrator-matrix/events/` package with watcher, webhook adapter, and Discord formatter modules.
|
||||
2. Cursor persistence at `.mosaic/orchestrator/event_cursor.json`.
|
||||
3. `mosaic macp watch [--webhook] [--once]` CLI support using `.mosaic/orchestrator/config.json`.
|
||||
4. Stdlib-only verification of watcher polling, webhook delivery, Discord formatting, CLI watch behavior, and cursor persistence.
|
||||
5. Developer documentation and sitemap updates covering the Phase 2A event bridge.
|
||||
6. A repo-local unittest suite under `tests/` that covers watcher polling/cursor behavior, webhook delivery logic, and Discord formatting.
|
||||
|
||||
### Out of Scope
|
||||
|
||||
1. Rewriting the orchestrator controller architecture.
|
||||
2. Changing Matrix transport behavior beyond schema compatibility.
|
||||
3. Implementing real OpenClaw `sessions_spawn` execution beyond producing the config payload/command for callers.
|
||||
4. Adding non-stdlib Python dependencies or npm-based tooling.
|
||||
1. Adding Discord transport or webhook server hosting inside this repository.
|
||||
2. Replacing the existing Matrix transport bridge.
|
||||
3. Introducing async, threads, or third-party Python packages.
|
||||
4. Changing event emission behavior in the controller beyond consuming the existing event stream.
|
||||
|
||||
## User/Stakeholder Requirements
|
||||
|
||||
1. MACP must evolve the current orchestrator-matrix implementation rather than replace it.
|
||||
2. Legacy task queues without `dispatch` fields must continue to run exactly as before.
|
||||
3. MACP-aware tasks must support dispatch modes `yolo`, `acp`, and `exec`.
|
||||
4. Results must be written in a structured JSON format suitable for audit and orchestration follow-up.
|
||||
5. A manual `mosaic macp` CLI must expose submit, status, drain, and history flows.
|
||||
1. External systems must be able to consume MACP events without reading the NDJSON file directly.
|
||||
2. The watcher must remain portable across environments, so file polling is required instead of platform-specific file watching.
|
||||
3. Restarting the watcher must not replay previously consumed events.
|
||||
4. Webhook delivery failures must be logged and isolated so the watcher loop continues running.
|
||||
5. Discord formatting must stay concise and useful for task lifecycle visibility.
|
||||
|
||||
## Functional Requirements
|
||||
|
||||
1. Task schema must include MACP dispatch, worktree, result, retry, branch, brief, issue/PR, and dependency fields.
|
||||
2. Event schema must recognize `task.gated`, `task.escalated`, and `task.retry.scheduled`, plus a `dispatcher` source.
|
||||
3. Dispatcher functions must set up worktrees, build commands, execute tasks, collect results, and clean up worktrees.
|
||||
4. Controller `run_single_task()` must route MACP-aware tasks through the dispatcher and emit the correct lifecycle events/status transitions.
|
||||
5. `tasks_md_sync.py` must map optional MACP table columns only when those headers are present in `docs/TASKS.md`; absent MACP headers must not inject MACP fields into legacy tasks.
|
||||
6. `bin/mosaic` must route `mosaic macp ...` to a new `bin/mosaic-macp` script.
|
||||
1. `EventWatcher` must watch `.mosaic/orchestrator/events.ndjson`, parse appended JSON lines, and invoke registered callbacks for matching event types.
|
||||
2. `EventWatcher.poll_once()` must tolerate a missing events file, truncated/corrupt lines, and cursor positions that are stale after file truncation.
|
||||
3. Cursor writes must be atomic and stored at `.mosaic/orchestrator/event_cursor.json`.
|
||||
4. `send_webhook(event, config)` must POST JSON to the configured URL using `urllib.request`, optionally adding a bearer token, respecting timeout, and retrying with exponential backoff.
|
||||
5. `create_webhook_callback(config)` must return a callback that swallows/logs failures instead of raising into the watcher loop.
|
||||
6. `format_event(event)` must support `task.completed`, `task.failed`, `task.escalated`, `task.gated`, and `task.started`, including useful task metadata when present.
|
||||
7. `format_summary(events)` must produce a short batch summary suitable for notification digests.
|
||||
8. `bin/mosaic-macp` must expose `watch`, optionally enabling webhook delivery from config, and support one-shot polling with `--once`.
|
||||
|
||||
## Non-Functional Requirements
|
||||
|
||||
1. Security: no secrets embedded in generated commands, config, or results.
|
||||
2. Performance: controller remains deterministic and synchronous with no async or thread-based orchestration.
|
||||
3. Reliability: worktree creation/cleanup failures must be surfaced predictably and produce structured task failure/escalation states.
|
||||
4. Observability: lifecycle events, logs, and result JSON must clearly show task outcome, attempts, gates, and errors.
|
||||
1. Security: no secrets embedded in code or logs; auth token only sent via header when configured.
|
||||
2. Performance: each webhook attempt must be bounded by `timeout_seconds`; no event-processing path may hang indefinitely.
|
||||
3. Reliability: corrupt input lines and callback delivery failures must be logged to stderr and skipped without crashing the watcher.
|
||||
4. Portability: Python 3.10+ stdlib only; no OS-specific file watcher APIs.
|
||||
5. Observability: warnings and failures must be clear enough to diagnose cursor, parsing, and webhook problems.
|
||||
|
||||
## Acceptance Criteria
|
||||
|
||||
1. Existing legacy tasks without `dispatch` still run through the old shell path with unchanged behavior.
|
||||
2. MACP-aware `exec` tasks run through the dispatcher and produce result JSON with gate outcomes.
|
||||
3. New schemas validate task/event/result payload expectations for MACP fields and statuses.
|
||||
4. `mosaic macp submit`, `status`, and `history` work from a bootstrapped repo state, and `drain` delegates to the existing orchestrator runner.
|
||||
5. Python imports for the updated controller, dispatcher, and sync code complete without errors on Python 3.10+.
|
||||
1. `EventWatcher.poll_once()` reads newly appended events, returns parsed dicts, invokes registered callbacks, and skips already-consumed events after restart.
|
||||
2. Webhook delivery posts matching events to a local test endpoint, supports bearer auth configuration, and retries boundedly on failure.
|
||||
3. Discord formatter returns expected concise strings for the required task lifecycle event types and a usable batch summary.
|
||||
4. `mosaic macp watch --once` processes events from a bootstrapped repo state without error and honors `--webhook`.
|
||||
5. Cursor persistence prevents replay on a second run and resets safely when the events file is truncated.
|
||||
6. `python3 -m unittest discover -s tests -p 'test_*.py' -v` passes with stdlib-only tests for the Phase 2A event bridge modules.
|
||||
|
||||
## Constraints and Dependencies
|
||||
|
||||
1. Python implementation must use stdlib only and support Python 3.10+.
|
||||
2. Shell tooling must remain bash-based and fit the existing Mosaic CLI style.
|
||||
3. Dispatch fallback rules must use `exec` when `dispatch` is absent and config/default runtime when `runtime` is absent.
|
||||
4. Worktree convention must derive from the repository name and task metadata unless explicitly overridden by task fields.
|
||||
2. Shell CLI behavior must remain bash-based and consistent with the existing Mosaic command style.
|
||||
3. The watcher consumes the event schema already emitted by Phase 1 controller logic.
|
||||
4. Webhook configuration lives under `.mosaic/orchestrator/config.json` at `macp.webhook`.
|
||||
|
||||
## Risks and Open Questions
|
||||
|
||||
1. Risk: yolo command execution requires a PTY, so the dispatcher needs a safe wrapper that still behaves under `subprocess`.
|
||||
2. Risk: worktree cleanup could remove a path unexpectedly if task metadata is malformed.
|
||||
3. Risk: old queue consumers may assume only the original task statuses and event types.
|
||||
4. Open Question: whether `task.gated` should be emitted by the dispatcher or controller once worker execution ends and quality gates begin.
|
||||
1. Risk: partial writes may leave an incomplete trailing JSON line that must not advance the cursor incorrectly.
|
||||
2. Risk: synchronous webhook retries can slow one poll cycle if the endpoint is unavailable; timeout and retry behavior must remain bounded.
|
||||
3. Risk: event payloads may omit optional metadata fields, so formatter output must degrade cleanly.
|
||||
4. ASSUMPTION: the watcher should advance past corrupt lines after logging them so a single bad line does not permanently stall downstream consumption.
|
||||
5. ASSUMPTION: CLI `watch` should default to no-op callback processing when no delivery option is enabled, while still updating the cursor and reporting processed count.
|
||||
|
||||
## Testing and Verification Expectations
|
||||
|
||||
1. Baseline checks: Python import validation, targeted script execution checks, JSON syntax/schema validation, and any repo-local validation applicable to changed code paths.
|
||||
2. Situational testing: legacy orchestrator run with old-style tasks, MACP `exec` flow including result file generation, CLI submit/status/history behavior, and worktree lifecycle validation.
|
||||
3. Evidence format: command-level results captured in the scratchpad and summarized in the final delivery report.
|
||||
1. Baseline checks: Python bytecode compilation/import validation for new modules and shell syntax validation for `bin/mosaic-macp`.
|
||||
2. Situational tests: temporary orchestrator state exercising watcher polling, callback filtering, webhook POST capture/mocking, formatter sanitization, CLI one-shot watch execution, and cursor persistence across repeated runs.
|
||||
3. Evidence format: command-level results recorded in the scratchpad and summarized against acceptance criteria.
|
||||
|
||||
## Milestone / Delivery Intent
|
||||
|
||||
1. Target milestone/version: 0.0.x bootstrap enhancement
|
||||
2. Definition of done: code merged to `main`, CI terminal green, issue `#8` closed, and verification evidence recorded against all acceptance criteria.
|
||||
|
||||
## Assumptions
|
||||
|
||||
1. ASSUMPTION: A single issue can track the full Phase 1 implementation because the user requested one bounded feature delivery rather than separate independent tickets.
|
||||
2. ASSUMPTION: For `acp` dispatch in Phase 1, the controller must escalate the task immediately with a clear reason instead of pretending work ran before OpenClaw integration exists.
|
||||
3. ASSUMPTION: `task.gated` should be emitted by the controller as the transition into quality-gate execution, which keeps gate-state ownership in one place alongside the existing gate loop.
|
||||
1. Target milestone/version: Phase 2A observability bridge
|
||||
2. Definition of done: code merged to `main`, CI terminal green, issue `#10` closed, and verification evidence recorded against all acceptance criteria.
|
||||
|
||||
@@ -4,4 +4,6 @@
|
||||
- [Tasks](./TASKS.md)
|
||||
- [Developer Guide](./DEVELOPER-GUIDE/README.md)
|
||||
- [Task Briefs](./tasks/MACP-PHASE1-brief.md)
|
||||
- [Task Briefs](./tasks/MACP-PHASE2A-brief.md)
|
||||
- [Scratchpads](./scratchpads/macp-phase1.md)
|
||||
- [Scratchpads](./scratchpads/macp-phase2a.md)
|
||||
|
||||
@@ -14,4 +14,5 @@ Canonical tracking for active work. Keep this file current.
|
||||
|
||||
| id | status | description | issue | repo | branch | depends_on | blocks | agent | started_at | completed_at | estimate | used | notes |
|
||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
||||
| MACP-PHASE1 | in-progress | Implement MACP Phase 1 across orchestrator schemas, dispatcher, controller, CLI, config, and task sync while preserving legacy queue behavior. | #8 | bootstrap | feat/macp-phase1 | | | Jarvis | 2026-03-27T23:00:00Z | | medium | in-progress | Review-fix pass started 2026-03-28T00:38:01Z to address backward-compatibility, ACP safety, result timing, and worktree/brief security findings on top of the blocked PR-create state. Prior blocker remains: `~/.config/mosaic/tools/git/pr-create.sh -t 'feat: implement MACP phase 1 core protocol' -b ... -B main -H feat/macp-phase1` failed with `Remote repository required: Specify ID via --repo or execute from a local git repo.` |
|
||||
| MACP-PHASE2A | in-progress | Build the MACP event bridge with event watcher, webhook adapter, Discord formatter, CLI watch wiring, docs updates, and verification evidence. | #10 | bootstrap | feat/macp-phase2a | | | Jarvis | 2026-03-28T02:02:38Z | | medium | in-progress | Issue created via `~/.config/mosaic/tools/git/issue-create.sh` fallback after `tea` reported `Remote repository required: Specify ID via --repo or execute from a local git repo.` |
|
||||
| MACP-PHASE2A-TESTS | in-progress | Add comprehensive stdlib unittest coverage for the Phase 2A event bridge modules and runner scaffolding. | #10 | bootstrap | feat/macp-phase2a | MACP-PHASE2A | | Jarvis | 2026-03-28T02:17:40Z | | small | in-progress | User-requested follow-on task from `docs/tasks/MACP-PHASE2A-tests.md`; verification target is `python3 -m unittest discover -s tests -p 'test_*.py' -v`. |
|
||||
|
||||
74
docs/scratchpads/macp-phase2a.md
Normal file
74
docs/scratchpads/macp-phase2a.md
Normal file
@@ -0,0 +1,74 @@
|
||||
# MACP Phase 2A Scratchpad
|
||||
|
||||
## Session Start
|
||||
|
||||
- Session date: 2026-03-27 / 2026-03-28 America/Chicago
|
||||
- Branch: `feat/macp-phase2a`
|
||||
- Issue: `#10`
|
||||
- Objective: build the MACP event bridge and notification system from `docs/tasks/MACP-PHASE2A-brief.md`
|
||||
|
||||
## Budget
|
||||
|
||||
- Budget mode: inferred working budget
|
||||
- Estimate: medium
|
||||
- Token strategy: keep context narrow to event-bridge files, verify with targeted temp-repo tests, avoid unnecessary parallel deep dives
|
||||
|
||||
## Requirements Notes
|
||||
|
||||
- PRD updated to Phase 2A before coding
|
||||
- TDD requirement: not mandatory for this feature work; targeted verification is sufficient because this is new observability functionality rather than a bug fix or auth/data-mutation change
|
||||
- Documentation gate applies because developer-facing behavior and CLI surface change
|
||||
|
||||
## Assumptions
|
||||
|
||||
1. ASSUMPTION: corrupt or partial lines should be logged and skipped while still advancing the cursor past the offending line, preventing permanent replay loops.
|
||||
2. ASSUMPTION: `mosaic macp watch` may run without webhook delivery enabled and should still process events plus persist cursor state.
|
||||
3. ASSUMPTION: Discord formatting remains a pure formatting layer; no outbound Discord transport is part of Phase 2A.
|
||||
|
||||
## Plan
|
||||
|
||||
1. Update PRD/TASKS and create the Phase 2A issue/scratchpad.
|
||||
2. Implement watcher, webhook adapter, formatter, and CLI wiring.
|
||||
3. Update developer docs and sitemap.
|
||||
4. Run baseline and situational verification.
|
||||
5. Run independent code review, remediate findings, then commit/push/PR/merge/CI/issue-close.
|
||||
|
||||
## Progress Log
|
||||
|
||||
- 2026-03-28T02:02:38Z: Created provider issue `#10` for Phase 2A using Mosaic wrapper with Gitea API fallback.
|
||||
- 2026-03-28T02:02:38Z: Replaced stale Phase 1 PRD/TASKS planning state with Phase 2A scope and tracking.
|
||||
- 2026-03-28T02:17:40Z: Resumed Phase 2A for the test-suite follow-on task; loaded Mosaic intake, runtime, resume protocol, shared memory, and issue state before implementation.
|
||||
- 2026-03-28T02:17:40Z: Updated PRD/TASKS to include the stdlib unittest coverage requirement and the `MACP-PHASE2A-TESTS` tracking row.
|
||||
- 2026-03-28T02:23:08Z: Added repo-local unittest coverage for watcher, webhook adapter, and Discord formatter plus `tests/run_tests.sh`.
|
||||
- 2026-03-28T02:23:08Z: Test-driven remediation exposed and fixed two formatter sanitization bugs (`re.sub` replacement escaping and ANSI escape stripping order).
|
||||
- 2026-03-28T02:23:08Z: Tightened webhook callback config semantics so `enabled` and `event_filter` are enforced directly by `create_webhook_callback`; tightened literal-IP SSRF blocking to match requested tests.
|
||||
|
||||
## Verification Plan
|
||||
|
||||
| Acceptance Criterion | Verification Method | Evidence |
|
||||
|---|---|---|
|
||||
| AC-1 watcher polls new events and respects cursor | Temp events file + repeated `poll_once()` / CLI runs | pending |
|
||||
| AC-2 webhook delivery retries and succeeds/fails cleanly | Local stdlib echo server capture | pending |
|
||||
| AC-3 Discord formatting covers required event types | Targeted Python formatter check | pending |
|
||||
| AC-4 `mosaic macp watch --once` runs cleanly | CLI one-shot execution in temp repo | pending |
|
||||
| AC-5 cursor persistence handles repeat run and truncation | Temp repo repeated runs with truncated file scenario | pending |
|
||||
| AC-6 unittest suite passes for Phase 2A modules | `python3 -m unittest discover -s tests -p 'test_*.py' -v` | pass |
|
||||
|
||||
## Tests Run
|
||||
|
||||
- `bash -n tests/run_tests.sh` — pass
|
||||
- `python3 -m py_compile tests/__init__.py tests/conftest.py tests/test_event_watcher.py tests/test_webhook_adapter.py tests/test_discord_formatter.py tools/orchestrator-matrix/events/webhook_adapter.py tools/orchestrator-matrix/events/discord_formatter.py` — pass
|
||||
- `./tests/run_tests.sh` — pass (24 tests)
|
||||
- `python3 -m unittest discover -s tests -p 'test_*.py' -v` — pass (24 tests)
|
||||
- `python3 -m pytest tests/` — environment limitation: `pytest` module is not installed in this worktree runtime, so compatibility was inferred from stdlib-only `unittest` test structure rather than executed here
|
||||
|
||||
## Review Notes
|
||||
|
||||
- Manual review of the final delta found no remaining correctness issues after the formatter sanitization fixes and webhook config enforcement updates.
|
||||
- `~/.config/mosaic/tools/codex/codex-security-review.sh --uncommitted` — no findings, risk level `none`
|
||||
- `~/.config/mosaic/tools/codex/codex-code-review.sh --uncommitted` did not return a terminal summary in this runtime; relied on manual review plus passing tests for the final gate in this session.
|
||||
|
||||
## Risks / Blockers
|
||||
|
||||
- Potential git wrapper friction in worktrees for PR creation/merge steps; if it recurs, capture exact failing command and stop per Mosaic contract.
|
||||
- `pytest` is not installed in the current runtime, so the suite’s pytest compatibility was not executed end-to-end here.
|
||||
152
docs/tasks/MACP-PHASE2A-brief.md
Normal file
152
docs/tasks/MACP-PHASE2A-brief.md
Normal file
@@ -0,0 +1,152 @@
|
||||
# MACP Phase 2A — Event Bridge + Notification System
|
||||
|
||||
**Branch:** `feat/macp-phase2a`
|
||||
**Repo worktree:** `~/src/mosaic-bootstrap-worktrees/macp-phase2a`
|
||||
|
||||
---
|
||||
|
||||
## Objective
|
||||
|
||||
Build the event bridge that makes MACP events consumable by external systems (OpenClaw, Discord, webhooks). This is the observability layer — the controller already writes events to `events.ndjson`, but nothing reads them yet.
|
||||
|
||||
---
|
||||
|
||||
## Task 1: Event File Watcher (`tools/orchestrator-matrix/events/event_watcher.py`)
|
||||
|
||||
New Python module that tails `events.ndjson` and fires callbacks on new events.
|
||||
|
||||
### Requirements:
|
||||
- Watch `.mosaic/orchestrator/events.ndjson` for new lines (use file polling, not inotify — keeps it portable)
|
||||
- Parse each new line as JSON
|
||||
- Call registered callback functions with the parsed event
|
||||
- Support filtering by event type (e.g., only `task.completed` and `task.failed`)
|
||||
- Maintain a cursor (last read position) so restarts don't replay old events
|
||||
- Cursor stored in `.mosaic/orchestrator/event_cursor.json`
|
||||
|
||||
### Key Functions:
|
||||
```python
|
||||
class EventWatcher:
|
||||
def __init__(self, events_path: Path, cursor_path: Path, poll_interval: float = 2.0):
|
||||
...
|
||||
|
||||
def on(self, event_types: list[str], callback: Callable[[dict], None]) -> None:
|
||||
"""Register a callback for specific event types."""
|
||||
|
||||
def poll_once(self) -> list[dict]:
|
||||
"""Read new events since last cursor position. Returns list of new events."""
|
||||
|
||||
def run(self, max_iterations: int = 0) -> None:
|
||||
"""Polling loop. max_iterations=0 means infinite."""
|
||||
```
|
||||
|
||||
### Constraints:
|
||||
- Python 3.10+ stdlib only (no pip dependencies)
|
||||
- Must handle truncated/corrupt lines gracefully (skip, log warning)
|
||||
- File might not exist yet — handle gracefully
|
||||
- Thread-safe cursor updates (atomic write via temp file rename)
|
||||
|
||||
---
|
||||
|
||||
## Task 2: Webhook Adapter (`tools/orchestrator-matrix/events/webhook_adapter.py`)
|
||||
|
||||
POST events to a configurable URL. This is how the OC plugin will consume MACP events.
|
||||
|
||||
### Requirements:
|
||||
- Accept an event dict, POST it as JSON to a configured URL
|
||||
- Support optional `Authorization` header (bearer token)
|
||||
- Configurable from `.mosaic/orchestrator/config.json` under `macp.webhook`:
|
||||
```json
|
||||
{
|
||||
"macp": {
|
||||
"webhook": {
|
||||
"enabled": false,
|
||||
"url": "http://localhost:8080/macp/events",
|
||||
"auth_token": "",
|
||||
"timeout_seconds": 10,
|
||||
"retry_count": 2,
|
||||
"event_filter": ["task.completed", "task.failed", "task.escalated"]
|
||||
}
|
||||
}
|
||||
}
|
||||
```
|
||||
- Retry with exponential backoff on failure (configurable count)
|
||||
- Log failures but don't crash the watcher
|
||||
- Return success/failure status
|
||||
|
||||
### Key Functions:
|
||||
```python
|
||||
def send_webhook(event: dict, config: dict) -> bool:
|
||||
"""POST event to webhook URL. Returns True on success."""
|
||||
|
||||
def create_webhook_callback(config: dict) -> Callable[[dict], None]:
|
||||
"""Factory that creates a watcher callback from config."""
|
||||
```
|
||||
|
||||
### Constraints:
|
||||
- Use `urllib.request` only (no `requests` library)
|
||||
- Must not block the event watcher for more than `timeout_seconds` per event
|
||||
- Log to stderr on failure
|
||||
|
||||
---
|
||||
|
||||
## Task 3: Discord Notification Formatter (`tools/orchestrator-matrix/events/discord_formatter.py`)
|
||||
|
||||
Format MACP events into human-readable Discord messages.
|
||||
|
||||
### Requirements:
|
||||
- Format functions for each event type:
|
||||
- `task.completed` → "✅ **Task TASK-001 completed** — Implement user auth (attempt 1/1, 45s)"
|
||||
- `task.failed` → "❌ **Task TASK-001 failed** — Build error: exit code 1 (attempt 2/3)"
|
||||
- `task.escalated` → "🚨 **Task TASK-001 escalated** — Gate failures after 3 attempts. Human review needed."
|
||||
- `task.gated` → "🔍 **Task TASK-001 gated** — Quality gates running..."
|
||||
- `task.started` → "⚙️ **Task TASK-001 started** — Worker: codex, dispatch: yolo"
|
||||
- Include task metadata: runtime, dispatch type, attempt count, duration (if available)
|
||||
- Keep messages concise — Discord has character limits
|
||||
- Return plain strings (the caller decides where to send them)
|
||||
|
||||
### Key Functions:
|
||||
```python
|
||||
def format_event(event: dict) -> str | None:
|
||||
"""Format an MACP event for Discord. Returns None for unformattable events."""
|
||||
|
||||
def format_summary(events: list[dict]) -> str:
|
||||
"""Format a batch summary (e.g., daily digest)."""
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## Wiring: CLI Integration
|
||||
|
||||
Add to `bin/mosaic-macp`:
|
||||
```bash
|
||||
mosaic macp watch [--webhook] [--once]
|
||||
```
|
||||
- `watch`: Start the event watcher with configured callbacks
|
||||
- `--webhook`: Enable webhook delivery (reads config from `.mosaic/orchestrator/config.json`)
|
||||
- `--once`: Poll once and exit (useful for cron)
|
||||
|
||||
---
|
||||
|
||||
## Verification
|
||||
|
||||
1. Create a test `events.ndjson` with sample events, run `event_watcher.poll_once()`, verify all events returned
|
||||
2. Run watcher with webhook pointing to a local echo server, verify POST payload
|
||||
3. Format each event type through `discord_formatter`, verify output strings
|
||||
4. `mosaic macp watch --once` processes events without error
|
||||
5. Cursor persistence: run twice, second run returns no events
|
||||
|
||||
## File Map
|
||||
```
|
||||
tools/orchestrator-matrix/
|
||||
├── events/
|
||||
│ ├── __init__.py ← NEW
|
||||
│ ├── event_watcher.py ← NEW
|
||||
│ ├── webhook_adapter.py ← NEW
|
||||
│ └── discord_formatter.py ← NEW
|
||||
```
|
||||
|
||||
## Ground Rules
|
||||
- Python 3.10+ stdlib only
|
||||
- No async/threads — synchronous polling
|
||||
- Commit: `feat: add MACP event bridge — watcher, webhook, Discord formatter`
|
||||
- Push to `feat/macp-phase2a`
|
||||
81
docs/tasks/MACP-PHASE2A-tests.md
Normal file
81
docs/tasks/MACP-PHASE2A-tests.md
Normal file
@@ -0,0 +1,81 @@
|
||||
# MACP Phase 2A — Test Suite
|
||||
|
||||
**Branch:** `feat/macp-phase2a` (commit on top of existing)
|
||||
**Repo worktree:** `~/src/mosaic-bootstrap-worktrees/macp-phase2a`
|
||||
|
||||
---
|
||||
|
||||
## Objective
|
||||
|
||||
Write a comprehensive test suite for the Phase 2A event bridge code using Python `unittest` (stdlib only). Tests must be runnable with `python3 -m pytest tests/` or `python3 -m unittest discover tests/`.
|
||||
|
||||
---
|
||||
|
||||
## Task 1: Test infrastructure (`tests/conftest.py` + `tests/run_tests.sh`)
|
||||
|
||||
Create `tests/` directory at repo root with:
|
||||
- `conftest.py` — shared fixtures: temp directories, sample events, sample config
|
||||
- `run_tests.sh` — simple runner: `python3 -m unittest discover -s tests -p 'test_*.py' -v`
|
||||
- `__init__.py` — empty, makes tests a package
|
||||
|
||||
Sample events fixture should include one of each type: `task.assigned`, `task.started`, `task.completed`, `task.failed`, `task.escalated`, `task.gated`, `task.retry.scheduled`
|
||||
|
||||
---
|
||||
|
||||
## Task 2: Event watcher tests (`tests/test_event_watcher.py`)
|
||||
|
||||
Test the `EventWatcher` class from `tools/orchestrator-matrix/events/event_watcher.py`.
|
||||
|
||||
### Test cases:
|
||||
1. `test_poll_empty_file` — No events file exists → returns empty list
|
||||
2. `test_poll_new_events` — Write 3 events to ndjson, poll → returns all 3
|
||||
3. `test_cursor_persistence` — Poll once (reads 3), poll again → returns 0 (cursor saved)
|
||||
4. `test_cursor_survives_restart` — Poll, create new watcher instance, poll → no duplicates
|
||||
5. `test_corrupt_line_skipped` — Insert a corrupt JSON line between valid events → valid events returned, corrupt skipped
|
||||
6. `test_callback_filtering` — Register callback for `task.completed` only → only completed events trigger it
|
||||
7. `test_callback_receives_events` — Register callback, poll → callback called with correct event dicts
|
||||
8. `test_file_grows_between_polls` — Poll (gets 2), append 3 more, poll → gets 3
|
||||
|
||||
---
|
||||
|
||||
## Task 3: Webhook adapter tests (`tests/test_webhook_adapter.py`)
|
||||
|
||||
Test `send_webhook` and `create_webhook_callback` from `tools/orchestrator-matrix/events/webhook_adapter.py`.
|
||||
|
||||
### Test cases:
|
||||
1. `test_send_webhook_success` — Mock HTTP response 200 → returns True
|
||||
2. `test_send_webhook_failure` — Mock HTTP response 500 → returns False
|
||||
3. `test_send_webhook_timeout` — Mock timeout → returns False, no crash
|
||||
4. `test_send_webhook_retry` — Mock 500 then 200 → retries and succeeds
|
||||
5. `test_event_filter` — Config with filter `["task.completed"]` → callback ignores `task.started`
|
||||
6. `test_webhook_disabled` — Config with `enabled: false` → no HTTP call made
|
||||
7. `test_ssrf_blocked` — URL with private IP (127.0.0.1, 10.x) → blocked, returns False
|
||||
|
||||
Use `unittest.mock.patch` to mock `urllib.request.urlopen`.
|
||||
|
||||
---
|
||||
|
||||
## Task 4: Discord formatter tests (`tests/test_discord_formatter.py`)
|
||||
|
||||
Test `format_event` and `format_summary` from `tools/orchestrator-matrix/events/discord_formatter.py`.
|
||||
|
||||
### Test cases:
|
||||
1. `test_format_completed` — Completed event → contains "✅" and task ID
|
||||
2. `test_format_failed` — Failed event → contains "❌" and error message
|
||||
3. `test_format_escalated` — Escalated event → contains "🚨" and escalation reason
|
||||
4. `test_format_gated` — Gated event → contains "🔍"
|
||||
5. `test_format_started` — Started event → contains "⚙️" and runtime info
|
||||
6. `test_format_unknown_type` — Unknown event type → returns None
|
||||
7. `test_sanitize_control_chars` — Event with control characters in message → stripped in output
|
||||
8. `test_sanitize_mentions` — Event with `@everyone` in message → neutralized in output
|
||||
9. `test_format_summary` — List of mixed events → summary with counts
|
||||
|
||||
---
|
||||
|
||||
## Verification
|
||||
|
||||
After writing tests:
|
||||
1. `cd ~/src/mosaic-bootstrap-worktrees/macp-phase2a && python3 -m unittest discover -s tests -p 'test_*.py' -v` — ALL tests must pass
|
||||
2. Fix any failures before committing
|
||||
|
||||
Commit: `test: add comprehensive test suite for Phase 2A event bridge`
|
||||
Reference in New Issue
Block a user