Streaming AI responses via Matrix message edits #383

New Issue

jason.woltje · 2026-02-15T07:01:39Z

jason.woltje commented

2026-02-15 07:01:39 +00:00

Summary

Implement streaming AI chat responses in Matrix rooms using incremental message edits. This fills the gap left by the unimplemented streamChatMessage in the REST API.

Context

The current LLM chat endpoint (/api/llm/chat) is request-response only. streamChatMessage in apps/web/src/api/chat.ts is marked as not implemented. Matrix's protocol natively supports message edits (m.replace relation), making it a natural transport for streaming LLM output.

Implementation

Flow

User sends message in Matrix room (or thread)
Bot sends initial response: "Thinking..." (or typing indicator)
As LLM tokens stream in, bot edits the response message with accumulated text
Final edit includes complete response + token usage metadata

Chunking Strategy

Buffer tokens and edit every ~500ms (not per-token — too many API calls)
Use Matrix typing indicator (m.typing) while generating
Final message replaces the streaming content with clean formatted output

LLM Integration

Call the existing LLM service with streaming enabled
This requires the LLM providers (Claude, OpenAI, Ollama) to support streaming responses
If a provider doesn't support streaming, fall back to request-response (send complete message)

Typing Indicator

Send m.typing event when processing starts
Clear typing indicator when response is complete or errored

Acceptance Criteria

LLM responses stream incrementally in Matrix rooms
Message edits used (not multiple messages)
Typing indicator shown during generation
Graceful fallback for non-streaming providers
Rate-limited edits (max every 500ms)
Final message is clean and complete
Token usage shown in final message (optional reaction or footer)

Refs

Current chat API: apps/api/src/llm/
Unimplemented streaming: apps/web/src/api/chat.ts (search for streamChatMessage)
Matrix message editing: m.replace relation type
EPIC: EPIC: Matrix/Element Bridge Integration (#377)
Depends on: #378, Matrix command handling — receive and dispatch @mosaic commands (#381)

## Summary Implement streaming AI chat responses in Matrix rooms using incremental message edits. This fills the gap left by the unimplemented `streamChatMessage` in the REST API. ## Context The current LLM chat endpoint (`/api/llm/chat`) is request-response only. `streamChatMessage` in `apps/web/src/api/chat.ts` is marked as not implemented. Matrix's protocol natively supports message edits (`m.replace` relation), making it a natural transport for streaming LLM output. ## Implementation ### Flow 1. User sends message in Matrix room (or thread) 2. Bot sends initial response: "Thinking..." (or typing indicator) 3. As LLM tokens stream in, bot **edits** the response message with accumulated text 4. Final edit includes complete response + token usage metadata ### Chunking Strategy - Buffer tokens and edit every ~500ms (not per-token — too many API calls) - Use Matrix typing indicator (`m.typing`) while generating - Final message replaces the streaming content with clean formatted output ### LLM Integration - Call the existing LLM service with streaming enabled - This requires the LLM providers (Claude, OpenAI, Ollama) to support streaming responses - If a provider doesn't support streaming, fall back to request-response (send complete message) ### Typing Indicator - Send `m.typing` event when processing starts - Clear typing indicator when response is complete or errored ## Acceptance Criteria - [ ] LLM responses stream incrementally in Matrix rooms - [ ] Message edits used (not multiple messages) - [ ] Typing indicator shown during generation - [ ] Graceful fallback for non-streaming providers - [ ] Rate-limited edits (max every 500ms) - [ ] Final message is clean and complete - [ ] Token usage shown in final message (optional reaction or footer) ## Refs - Current chat API: `apps/api/src/llm/` - Unimplemented streaming: `apps/web/src/api/chat.ts` (search for streamChatMessage) - Matrix message editing: `m.replace` relation type - EPIC: #377 - Depends on: #378, #381

jason.woltje added the ai label 2026-02-15 07:01:39 +00:00

jason.woltje added this to the M12-MatrixBridge (0.0.12) milestone 2026-02-15 07:01:51 +00:00

jason.woltje referenced this issue from a commit

2026-02-15 08:35:28 +00:00

feat(#383): Streaming AI responses via Matrix message edits

jason.woltje commented

2026-02-15 08:35:40 +00:00

Completed in commit 93cd314 on branch feature/m12-matrix-bridge.

MatrixStreamingService with editMessage (m.replace), setTypingIndicator, streamResponse
Rate-limited edits at 500ms intervals
LLM-agnostic AsyncIterable interface
Thread support via MSC3440
Graceful error handling with typing cleanup
Optional editMessage added to IChatProvider interface
20 tests pass, 132 total bridge tests pass

Completed in commit 93cd314 on branch feature/m12-matrix-bridge. - MatrixStreamingService with editMessage (m.replace), setTypingIndicator, streamResponse - Rate-limited edits at 500ms intervals - LLM-agnostic AsyncIterable interface - Thread support via MSC3440 - Graceful error handling with typing cleanup - Optional editMessage added to IChatProvider interface - 20 tests pass, 132 total bridge tests pass

jason.woltje closed this issue

2026-02-15 08:35:40 +00:00

jason.woltje referenced this issue

2026-02-15 08:43:39 +00:00

feat: M12-MatrixBridge — Matrix/Element chat bridge integration #408

Sign in to join this conversation.

Branches Tags

main

fix/ci-glibc-image

fix/dockerfile-npmrc

fix/matrix-native-binary

fix/kaniko-cache

fix/base-image-kaniko-v2

fix/base-image-kaniko

feat/custom-base-image

ci/pnpm-cache

fix/interceptor-tests

fix/kanban-tests

feat/wire-chat

feat/usage-widget

fix/security-hardening

fix/project-domain-v2

feat/kanban-add-task

fix/project-domain-attach

fix/logs-page-clean

fix/workspace-members

fix/ci-lint-632

fix/file-manager-tags

fix/csrf-debug-log

fix/controller-type-imports

fix/system-admin-env

fix/gateway-cors-trusted-origins

feat/project-detail-page

fix/fleet-provider-form-dto-v2

fix/ms22-audit

fix/orchestrator-widgets

fix/fleet-provider-form-dto

fix/csrf-bearer-bypass

fix/ms22-missing-authmodule-imports

fix/container-lifecycle-config-module

fix/swarm-compose-ms22-vars

chore/ms22-p1-complete

feat/ms22-p1h-settings-ui

feat/ms22-p1f-onboarding-ui

feat/ms22-p1i-chat-proxy

feat/ms22-p1k-idle-reaper

feat/ms22-p1j-docker

feat/ms22-p1e-onboarding-api

feat/ms22-p1g-settings-api

feat/ms22-p1d-container-mgr

feat/ms22-p1c-config-api

chore/ms22-prd-tracking

feat/ms22-p1a-schema

feat/ms22-p1b-crypto

chore/ms22-p1-tasks

docs/ms22-architecture

feat/ms22-openclaw-docker

feat/ms22-openclaw-gateway-module

chore/ms21-complete

chore/ms21-final-tasks-done

fix/ms21-ui-001-qa

test/ms21-ui-tests

chore/ms21-tasks-sync

chore/ms22-phase0-complete

feat/ms22-ingest-clean

feat/ms21-ui-users-members

feat/ms22-task-agent

chore/tasks-final

chore/tasks-update

feat/ms21-session-invalidation

feat/ms21-rbac-settings

feat/ms21-teams-page

feat/ms21-users-page

feat/ms19-terminal-persistence

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: mosaic/stack#383