Track orchestrator agent task completions #372

New Issue

jason.woltje · 2026-02-15T05:28:40Z

jason.woltje commented

2026-02-15 05:28:40 +00:00

Summary

When the orchestrator dispatches a task to a coding harness (Claude Code, Codex CLI, OpenCode) and the task completes, emit a TaskCompletionEvent capturing the full execution context.

Context

The orchestrator manages agent task lifecycle: dispatch → monitor → collect results. This is the ideal point to capture end-to-end task metrics that the individual LLM call tracking (separate issue) cannot see — total duration across multiple LLM calls, retry behavior, quality gate enforcement results, and context window management.

Requirements

Event Fields to Capture

Field	Source
`task_type`	From task definition (implementation, debugging, etc.)
`complexity`	From task definition or auto-assessed
`harness`	Which CLI tool ran (claude_code, opencode, etc.)
`model` / `provider`	From harness config
`task_duration_ms`	Wall-clock: dispatch → completion
`estimated_*_tokens`	From prediction API (pre-task)
`actual_*_tokens`	From harness output/logs
`estimated_cost_usd_micros`	From prediction API
`actual_cost_usd_micros`	Computed from actual tokens
`quality_gate_passed`	Orchestrator's quality coordinator result
`quality_gates_run`	Which gates enforced (build, lint, test, typecheck, security)
`quality_gates_failed`	Which gates failed
`context_compactions`	From harness output
`context_rotations`	From harness output
`context_utilization_final`	From harness output (0.0-1.0)
`outcome`	success, failure, partial, timeout
`retry_count`	How many retries before final outcome
`language`	Primary language of the task
`repo_size_category`	tiny, small, medium, large, huge

Integration Points

Task completion handler in the coordinator
Retry logic (increment retry_count per attempt)
Quality gate enforcement (record gate results)
Timeout handler (outcome=timeout)
Killswitch activation (outcome=failure with context)

Acceptance Criteria

Every completed agent task emits a TaskCompletionEvent
Failed tasks tracked with appropriate outcome
Retried tasks increment retry_count
Quality gate results accurately recorded
Context window metrics captured from harness output
Token/cost estimates populated from predictions when available
Unit tests for event construction
No impact on task execution performance (non-blocking track)

## Summary When the orchestrator dispatches a task to a coding harness (Claude Code, Codex CLI, OpenCode) and the task completes, emit a `TaskCompletionEvent` capturing the full execution context. ## Context The orchestrator manages agent task lifecycle: dispatch → monitor → collect results. This is the ideal point to capture end-to-end task metrics that the individual LLM call tracking (separate issue) cannot see — total duration across multiple LLM calls, retry behavior, quality gate enforcement results, and context window management. ## Requirements ### Event Fields to Capture | Field | Source | |-------|--------| | `task_type` | From task definition (implementation, debugging, etc.) | | `complexity` | From task definition or auto-assessed | | `harness` | Which CLI tool ran (claude_code, opencode, etc.) | | `model` / `provider` | From harness config | | `task_duration_ms` | Wall-clock: dispatch → completion | | `estimated_*_tokens` | From prediction API (pre-task) | | `actual_*_tokens` | From harness output/logs | | `estimated_cost_usd_micros` | From prediction API | | `actual_cost_usd_micros` | Computed from actual tokens | | `quality_gate_passed` | Orchestrator's quality coordinator result | | `quality_gates_run` | Which gates enforced (build, lint, test, typecheck, security) | | `quality_gates_failed` | Which gates failed | | `context_compactions` | From harness output | | `context_rotations` | From harness output | | `context_utilization_final` | From harness output (0.0-1.0) | | `outcome` | success, failure, partial, timeout | | `retry_count` | How many retries before final outcome | | `language` | Primary language of the task | | `repo_size_category` | tiny, small, medium, large, huge | ### Integration Points - Task completion handler in the coordinator - Retry logic (increment retry_count per attempt) - Quality gate enforcement (record gate results) - Timeout handler (outcome=timeout) - Killswitch activation (outcome=failure with context) ## Acceptance Criteria - [ ] Every completed agent task emits a TaskCompletionEvent - [ ] Failed tasks tracked with appropriate outcome - [ ] Retried tasks increment retry_count - [ ] Quality gate results accurately recorded - [ ] Context window metrics captured from harness output - [ ] Token/cost estimates populated from predictions when available - [ ] Unit tests for event construction - [ ] No impact on task execution performance (non-blocking track)

jason.woltje added the ai label 2026-02-15 05:28:40 +00:00

jason.woltje added this to the M10-Telemetry (0.0.10) milestone 2026-02-15 05:31:19 +00:00

jason.woltje referenced this issue from a commit

2026-02-15 07:52:59 +00:00

feat(#372): track orchestrator agent task completions via telemetry

jason.woltje closed this issue

2026-02-15 08:04:34 +00:00

jason.woltje commented

2026-02-15 08:05:01 +00:00

Completed in commit 36e6cdd on feature/m10-telemetry. Added _emit_task_telemetry to both Coordinator and OrchestrationLoop with agent-to-telemetry field mapping, non-blocking fire-and-forget. Tests passing.

Completed in commit 36e6cdd on feature/m10-telemetry. Added _emit_task_telemetry to both Coordinator and OrchestrationLoop with agent-to-telemetry field mapping, non-blocking fire-and-forget. Tests passing.

jason.woltje referenced this issue

2026-02-15 08:05:45 +00:00

feat: M10-Telemetry — Mosaic Telemetry integration #407

jason.woltje referenced this issue from a commit

2026-02-15 08:10:33 +00:00

feat(#372): track orchestrator agent task completions via telemetry

Sign in to join this conversation.

Branches Tags

main

fix/ci-glibc-image

fix/dockerfile-npmrc

fix/matrix-native-binary

fix/kaniko-cache

fix/base-image-kaniko-v2

fix/base-image-kaniko

feat/custom-base-image

ci/pnpm-cache

fix/interceptor-tests

fix/kanban-tests

feat/wire-chat

feat/usage-widget

fix/security-hardening

fix/project-domain-v2

feat/kanban-add-task

fix/project-domain-attach

fix/logs-page-clean

fix/workspace-members

fix/ci-lint-632

fix/file-manager-tags

fix/csrf-debug-log

fix/controller-type-imports

fix/system-admin-env

fix/gateway-cors-trusted-origins

feat/project-detail-page

fix/fleet-provider-form-dto-v2

fix/ms22-audit

fix/orchestrator-widgets

fix/fleet-provider-form-dto

fix/csrf-bearer-bypass

fix/ms22-missing-authmodule-imports

fix/container-lifecycle-config-module

fix/swarm-compose-ms22-vars

chore/ms22-p1-complete

feat/ms22-p1h-settings-ui

feat/ms22-p1f-onboarding-ui

feat/ms22-p1i-chat-proxy

feat/ms22-p1k-idle-reaper

feat/ms22-p1j-docker

feat/ms22-p1e-onboarding-api

feat/ms22-p1g-settings-api

feat/ms22-p1d-container-mgr

feat/ms22-p1c-config-api

chore/ms22-prd-tracking

feat/ms22-p1a-schema

feat/ms22-p1b-crypto

chore/ms22-p1-tasks

docs/ms22-architecture

feat/ms22-openclaw-docker

feat/ms22-openclaw-gateway-module

chore/ms21-complete

chore/ms21-final-tasks-done

fix/ms21-ui-001-qa

test/ms21-ui-tests

chore/ms21-tasks-sync

chore/ms22-phase0-complete

feat/ms22-ingest-clean

feat/ms21-ui-users-members

feat/ms22-task-agent

chore/tasks-final

chore/tasks-update

feat/ms21-session-invalidation

feat/ms21-rbac-settings

feat/ms21-teams-page

feat/ms21-users-page

feat/ms19-terminal-persistence

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: mosaic/stack#372