Prediction integration for cost estimation #373

New Issue

jason.woltje · 2026-02-15T05:29:05Z

jason.woltje commented

2026-02-15 05:29:05 +00:00

Summary

Integrate the Mosaic Telemetry prediction API to provide pre-task cost and token estimates. Before executing expensive LLM operations or dispatching agent tasks, query predictions to inform budgeting decisions and display estimates to users.

Context

The telemetry system maintains a prediction model trained on historical task completion data. Given a (task_type, model, provider, complexity) tuple, it returns statistical distributions for tokens, cost, and duration. This enables:

Budget guards: Warn before expensive operations
Model selection: Choose cost-effective models for simple tasks
User transparency: Show estimated cost before confirming

Requirements

Prediction Query Flow

// Before task execution
const prediction = await telemetry.getPrediction({
  taskType: 'implementation',
  model: 'claude-opus-4-6',
  provider: 'anthropic',
  complexity: 'high',
});

if (prediction) {
  const estimatedCost = prediction.costUsdMicros.median;
  const estimatedTokens = prediction.inputTokens.median + prediction.outputTokens.median;
  // Use for budget checks, display to user, populate estimated_* fields in events
}

Prediction Response Fields

input_tokens — Distribution (p10, p25, median, p75, p90)
output_tokens — Distribution
cost_usd_micros — Cost distribution by percentile
duration_ms — Duration distribution
correction_factors — Input/output multipliers for adjustment
quality — Historical gate_pass_rate and success_rate
metadata — sample_size, confidence (none/low/medium/high), fallback_level

Integration Points

Pre-LLM call estimation — Query prediction before chat/embed calls, populate estimated_* fields
Orchestrator task budgeting — Before dispatching agent task, check if predicted cost exceeds budget
Frontend display — Show cost estimate in UI before user confirms expensive actions
Model selection hints — When multiple models available, use predictions to suggest cheapest viable option

Cache Strategy

Predictions cached in-memory (TTL: 6 hours by default)
Refresh on app startup for common (task_type, model, provider) combos
Background refresh periodically

Graceful Degradation

If prediction unavailable (confidence=none, fallback_level=-1): proceed without estimate
Never block task execution on prediction failure
Log missing predictions for coverage analysis

Acceptance Criteria

Predictions queried before LLM calls (populate estimated_* fields)
Orchestrator checks predicted cost against task budget
Cache populated on startup for common combinations
Graceful handling when no prediction data exists
Frontend can display cost estimate (API endpoint or WebSocket event)
Unit tests for prediction integration
Confidence level exposed (so UI can show "estimate confidence: high/low")

## Summary Integrate the Mosaic Telemetry prediction API to provide pre-task cost and token estimates. Before executing expensive LLM operations or dispatching agent tasks, query predictions to inform budgeting decisions and display estimates to users. ## Context The telemetry system maintains a prediction model trained on historical task completion data. Given a (task_type, model, provider, complexity) tuple, it returns statistical distributions for tokens, cost, and duration. This enables: - **Budget guards:** Warn before expensive operations - **Model selection:** Choose cost-effective models for simple tasks - **User transparency:** Show estimated cost before confirming ## Requirements ### Prediction Query Flow ```typescript // Before task execution const prediction = await telemetry.getPrediction({ taskType: 'implementation', model: 'claude-opus-4-6', provider: 'anthropic', complexity: 'high', }); if (prediction) { const estimatedCost = prediction.costUsdMicros.median; const estimatedTokens = prediction.inputTokens.median + prediction.outputTokens.median; // Use for budget checks, display to user, populate estimated_* fields in events } ``` ### Prediction Response Fields - `input_tokens` — Distribution (p10, p25, median, p75, p90) - `output_tokens` — Distribution - `cost_usd_micros` — Cost distribution by percentile - `duration_ms` — Duration distribution - `correction_factors` — Input/output multipliers for adjustment - `quality` — Historical gate_pass_rate and success_rate - `metadata` — sample_size, confidence (none/low/medium/high), fallback_level ### Integration Points 1. **Pre-LLM call estimation** — Query prediction before chat/embed calls, populate `estimated_*` fields 2. **Orchestrator task budgeting** — Before dispatching agent task, check if predicted cost exceeds budget 3. **Frontend display** — Show cost estimate in UI before user confirms expensive actions 4. **Model selection hints** — When multiple models available, use predictions to suggest cheapest viable option ### Cache Strategy - Predictions cached in-memory (TTL: 6 hours by default) - Refresh on app startup for common (task_type, model, provider) combos - Background refresh periodically ### Graceful Degradation - If prediction unavailable (confidence=none, fallback_level=-1): proceed without estimate - Never block task execution on prediction failure - Log missing predictions for coverage analysis ## Acceptance Criteria - [ ] Predictions queried before LLM calls (populate estimated_* fields) - [ ] Orchestrator checks predicted cost against task budget - [ ] Cache populated on startup for common combinations - [ ] Graceful handling when no prediction data exists - [ ] Frontend can display cost estimate (API endpoint or WebSocket event) - [ ] Unit tests for prediction integration - [ ] Confidence level exposed (so UI can show "estimate confidence: high/low")

jason.woltje added the ai label 2026-02-15 05:29:05 +00:00

jason.woltje added this to the M10-Telemetry (0.0.10) milestone 2026-02-15 05:31:19 +00:00

jason.woltje referenced this issue from a commit

2026-02-15 07:51:02 +00:00

feat(#373): prediction integration for cost estimation

jason.woltje closed this issue

2026-02-15 08:04:35 +00:00

jason.woltje commented

2026-02-15 08:05:01 +00:00

Completed in commit d5bf501 on feature/m10-telemetry. PredictionService with 6hr TTL cache, startup refresh, GET /api/telemetry/estimate endpoint. Tests passing.

Completed in commit d5bf501 on feature/m10-telemetry. PredictionService with 6hr TTL cache, startup refresh, GET /api/telemetry/estimate endpoint. Tests passing.

jason.woltje referenced this issue

2026-02-15 08:05:45 +00:00

feat: M10-Telemetry — Mosaic Telemetry integration #407

jason.woltje referenced this issue from a commit

2026-02-15 08:10:33 +00:00

feat(#373): prediction integration for cost estimation

Sign in to join this conversation.

Branches Tags

main

fix/ci-glibc-image

fix/dockerfile-npmrc

fix/matrix-native-binary

fix/kaniko-cache

fix/base-image-kaniko-v2

fix/base-image-kaniko

feat/custom-base-image

ci/pnpm-cache

fix/interceptor-tests

fix/kanban-tests

feat/wire-chat

feat/usage-widget

fix/security-hardening

fix/project-domain-v2

feat/kanban-add-task

fix/project-domain-attach

fix/logs-page-clean

fix/workspace-members

fix/ci-lint-632

fix/file-manager-tags

fix/csrf-debug-log

fix/controller-type-imports

fix/system-admin-env

fix/gateway-cors-trusted-origins

feat/project-detail-page

fix/fleet-provider-form-dto-v2

fix/ms22-audit

fix/orchestrator-widgets

fix/fleet-provider-form-dto

fix/csrf-bearer-bypass

fix/ms22-missing-authmodule-imports

fix/container-lifecycle-config-module

fix/swarm-compose-ms22-vars

chore/ms22-p1-complete

feat/ms22-p1h-settings-ui

feat/ms22-p1f-onboarding-ui

feat/ms22-p1i-chat-proxy

feat/ms22-p1k-idle-reaper

feat/ms22-p1j-docker

feat/ms22-p1e-onboarding-api

feat/ms22-p1g-settings-api

feat/ms22-p1d-container-mgr

feat/ms22-p1c-config-api

chore/ms22-prd-tracking

feat/ms22-p1a-schema

feat/ms22-p1b-crypto

chore/ms22-p1-tasks

docs/ms22-architecture

feat/ms22-openclaw-docker

feat/ms22-openclaw-gateway-module

chore/ms21-complete

chore/ms21-final-tasks-done

fix/ms21-ui-001-qa

test/ms21-ui-tests

chore/ms21-tasks-sync

chore/ms22-phase0-complete

feat/ms22-ingest-clean

feat/ms21-ui-users-members

feat/ms22-task-agent

chore/tasks-final

chore/tasks-update

feat/ms21-session-invalidation

feat/ms21-rbac-settings

feat/ms21-teams-page

feat/ms21-users-page

feat/ms19-terminal-persistence

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: mosaic/stack#373