feat(P4-004): summarization pipeline — LLM + cron scheduling

Add SummarizationService that reads hot agent logs (>24h), groups by session, calls a cheap LLM (gpt-4o-mini default, configurable via SUMMARIZATION_MODEL) to extract key insights, stores them with embeddings in the insights table, and transitions processed logs to warm tier. Add CronService with node-cron for scheduled execution (summarization every 6h, tier management daily at 3am). Tier management promotes warm→cold (30d) and purges cold logs (90d). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-13 08:52:52 -05:00
parent 666d2bc36d
commit 1d4916fe97
6 changed files with 247 additions and 2 deletions
--- a/docs/TASKS.md
+++ b/docs/TASKS.md
@@ -40,7 +40,7 @@
 | P4-001 | in-progress | Phase 4   | @mosaic/memory — preference + insight stores                  | —   | #34   |
 | P4-002 | in-progress | Phase 4   | Semantic search — pgvector embeddings + search API            | —   | #35   |
 | P4-003 | in-progress | Phase 4   | @mosaic/log — log ingest, parsing, tiered storage             | —   | #36   |
-| P4-004 | not-started | Phase 4   | Summarization pipeline — Haiku-tier LLM + cron                | —   | #37   |
+| P4-004 | in-progress | Phase 4   | Summarization pipeline — Haiku-tier LLM + cron                | —   | #37   |
 | P4-005 | not-started | Phase 4   | Memory integration — inject into agent sessions               | —   | #38   |
 | P4-006 | not-started | Phase 4   | Skill management — catalog, install, config                   | —   | #39   |
 | P4-007 | not-started | Phase 4   | Verify Phase 4 — memory + log pipeline working                | —   | #40   |