Implement Token Budget Tracker #138

New Issue

jason.woltje · 2026-01-30T23:43:14Z

jason.woltje commented

2026-01-30 23:43:14 +00:00

Track token usage and prevent premature done claims with significant budget remaining.

Objective: Detect when agents claim done with substantial token budget unused, indicating premature stopping.

Problem: Agents claim done after fixing P0 issues, leaving work incomplete despite budget remaining.

Token Budget System:

Track tokens used vs allocated per task
Flag suspicious patterns: done claimed with >20% budget remaining
Correlate with gate failures: done + budget remaining + gates failing = forced continue
Allow early completion only if gates pass AND work demonstrably complete

Budget Allocation:

Per-task budget based on estimated complexity
Track input tokens, output tokens, total cost
Compare against similar completed tasks
Learn optimal budget utilization over time

Anti-Gaming Detection:

Agent cannot waste tokens to hit threshold
Must correlate token usage with actual work progress
Gate results are primary signal, budget is secondary

Integration with Orchestrator:

Orchestrator checks budget before accepting done
Suspicious pattern + gate failures = reject done
Budget exhausted + gates failing = alert user, request more budget

Related: L-015, #134 (orchestrator), #136 (gates), #137 (forced continuation)

Acceptance Criteria:

Token usage tracked per agent session
Budget utilization calculated
Suspicious patterns detected
Integration with orchestrator decision logic
Does not prevent legitimate early completion
Alerts on budget exhaustion before work complete

Track token usage and prevent premature done claims with significant budget remaining. Objective: Detect when agents claim done with substantial token budget unused, indicating premature stopping. Problem: Agents claim done after fixing P0 issues, leaving work incomplete despite budget remaining. Token Budget System: - Track tokens used vs allocated per task - Flag suspicious patterns: done claimed with >20% budget remaining - Correlate with gate failures: done + budget remaining + gates failing = forced continue - Allow early completion only if gates pass AND work demonstrably complete Budget Allocation: - Per-task budget based on estimated complexity - Track input tokens, output tokens, total cost - Compare against similar completed tasks - Learn optimal budget utilization over time Anti-Gaming Detection: - Agent cannot waste tokens to hit threshold - Must correlate token usage with actual work progress - Gate results are primary signal, budget is secondary Integration with Orchestrator: - Orchestrator checks budget before accepting done - Suspicious pattern + gate failures = reject done - Budget exhausted + gates failing = alert user, request more budget Related: L-015, #134 (orchestrator), #136 (gates), #137 (forced continuation) Acceptance Criteria: - Token usage tracked per agent session - Budget utilization calculated - Suspicious patterns detected - Integration with orchestrator decision logic - Does not prevent legitimate early completion - Alerts on budget exhaustion before work complete

jason.woltje added the api api p1 labels 2026-01-30 23:43:14 +00:00

jason.woltje added this to the M4-LLM (0.0.4) milestone 2026-01-30 23:45:34 +00:00

jason.woltje closed this issue

2026-01-31 20:50:24 +00:00

Sign in to join this conversation.

Branches Tags

main

fix/ci-glibc-image

fix/dockerfile-npmrc

fix/matrix-native-binary

fix/kaniko-cache

fix/base-image-kaniko-v2

fix/base-image-kaniko

feat/custom-base-image

ci/pnpm-cache

fix/interceptor-tests

fix/kanban-tests

feat/wire-chat

feat/usage-widget

fix/security-hardening

fix/project-domain-v2

feat/kanban-add-task

fix/project-domain-attach

fix/logs-page-clean

fix/workspace-members

fix/ci-lint-632

fix/file-manager-tags

fix/csrf-debug-log

fix/controller-type-imports

fix/system-admin-env

fix/gateway-cors-trusted-origins

feat/project-detail-page

fix/fleet-provider-form-dto-v2

fix/ms22-audit

fix/orchestrator-widgets

fix/fleet-provider-form-dto

fix/csrf-bearer-bypass

fix/ms22-missing-authmodule-imports

fix/container-lifecycle-config-module

fix/swarm-compose-ms22-vars

chore/ms22-p1-complete

feat/ms22-p1h-settings-ui

feat/ms22-p1f-onboarding-ui

feat/ms22-p1i-chat-proxy

feat/ms22-p1k-idle-reaper

feat/ms22-p1j-docker

feat/ms22-p1e-onboarding-api

feat/ms22-p1g-settings-api

feat/ms22-p1d-container-mgr

feat/ms22-p1c-config-api

chore/ms22-prd-tracking

feat/ms22-p1a-schema

feat/ms22-p1b-crypto

chore/ms22-p1-tasks

docs/ms22-architecture

feat/ms22-openclaw-docker

feat/ms22-openclaw-gateway-module

chore/ms21-complete

chore/ms21-final-tasks-done

fix/ms21-ui-001-qa

test/ms21-ui-tests

chore/ms21-tasks-sync

chore/ms22-phase0-complete

feat/ms22-ingest-clean

feat/ms21-ui-users-members

feat/ms22-task-agent

chore/tasks-final

chore/tasks-update

feat/ms21-session-invalidation

feat/ms21-rbac-settings

feat/ms21-teams-page

feat/ms21-users-page

feat/ms19-terminal-persistence

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: mosaic/stack#138