Its just a plan, or more of an understanding brainfart

2026-04-05 13:09:06 +02:00
parent 4aed4105aa
commit 1958eaca01
1 changed files with 826 additions and 0 deletions
@@ -0,0 +1,826 @@
 # AMCS → OpenClaw Alternative: Gap Analysis & Roadmap
 ## Context
 AMCS is a **passive** MCP memory server. OpenClaw's key differentiator is that it's an **always-on autonomous agent** — it proactively acts, monitors, and learns without human prompting. AMCS has the data model and search foundation; it's missing the execution engine and channel integrations that make OpenClaw compelling.
 OpenClaw's 3 pillars AMCS lacks:
 1. **Autonomous heartbeat** — scheduled jobs that run without user prompts
 2. **Channel integrations** — 25+ messaging platforms (Telegram, Slack, Discord, email, etc.)
 3. **Self-improving memory** — knowledge graph distillation, daily notes, living summary (MEMORY.md)
 ---
 ## Phase 1: Autonomous Heartbeat Engine (Critical — unlocks everything else)
 ### 1a. Add `Complete()` to AI Provider
 The current `Provider` interface in `internal/ai/provider.go` only supports `Summarize(ctx, systemPrompt, userPrompt)`. An autonomous agent needs a stateful multi-turn call with tool awareness.
 **Extend the interface:**
 ```go
 // internal/ai/provider.go
 type CompletionRole string
 const (
    RoleSystem    CompletionRole = "system"
    RoleUser      CompletionRole = "user"
    RoleAssistant CompletionRole = "assistant"
 )
 type CompletionMessage struct {
    Role    CompletionRole `json:"role"`
    Content string         `json:"content"`
 }
 type CompletionResult struct {
    Content    string `json:"content"`
    StopReason string `json:"stop_reason"` // "stop" | "length" | "error"
    Model      string `json:"model"`
 }
 type Provider interface {
    Embed(ctx context.Context, input string) ([]float32, error)
    ExtractMetadata(ctx context.Context, input string) (thoughttypes.ThoughtMetadata, error)
    Summarize(ctx context.Context, systemPrompt, userPrompt string) (string, error)
    Complete(ctx context.Context, messages []CompletionMessage) (CompletionResult, error)
    Name() string
    EmbeddingModel() string
 }
 ```
 **Implement in `internal/ai/compat/client.go`:**
 `Complete` is a simplification of the existing `extractMetadataWithModel` path — same OpenAI-compatible `/chat/completions` endpoint, same auth headers, no JSON schema coercion. Add a `chatCompletionsRequest` type (reuse or extend the existing unexported struct) and a `Complete` method on `*Client` that:
 1. Builds the request body from `[]CompletionMessage`
 2. POSTs to `c.baseURL + "/chat/completions"` with `c.metadataModel`
 3. Reads the first choice's `message.content`
 4. Returns `CompletionResult{Content, StopReason, Model}`
 Error handling mirrors the metadata path: on HTTP 429/503 mark the model unhealthy (`c.modelHealth`), return a wrapped error. No fallback model chain needed for agent calls — callers should retry on next heartbeat tick.
 ---
 ### 1b. Heartbeat Engine Package
 **New package: `internal/agent/`**
 #### `internal/agent/job.go`
 ```go
 package agent
 import (
    "context"
    "time"
 )
 // Job is a single scheduled unit of autonomous work.
 type Job interface {
    Name() string
    Interval() time.Duration
    Run(ctx context.Context) error
 }
 ```
 #### `internal/agent/engine.go`
 The engine manages a set of jobs and fires each on its own ticker. It mirrors the pattern already used for `runBackfillPass` and `runMetadataRetryPass` in `internal/app/app.go`, but generalises it.
 ```go
 package agent
 import (
    "context"
    "log/slog"
    "sync"
    "time"
 )
 type Engine struct {
    jobs   []Job
    store  JobStore  // persists agent_job_runs rows
    logger *slog.Logger
 }
 func NewEngine(store JobStore, logger *slog.Logger, jobs ...Job) *Engine {
    return &Engine{jobs: jobs, store: store, logger: logger}
 }
 // Run starts all job tickers and blocks until ctx is cancelled.
 func (e *Engine) Run(ctx context.Context) {
    var wg sync.WaitGroup
    for _, job := range e.jobs {
        wg.Add(1)
        go func(j Job) {
            defer wg.Done()
            e.runLoop(ctx, j)
        }(job)
    }
    wg.Wait()
 }
 func (e *Engine) runLoop(ctx context.Context, j Job) {
    ticker := time.NewTicker(j.Interval())
    defer ticker.Stop()
    for {
        select {
        case <-ctx.Done():
            return
        case <-ticker.C:
            e.runOnce(ctx, j)
        }
    }
 }
 func (e *Engine) runOnce(ctx context.Context, j Job) {
    runID, err := e.store.StartRun(ctx, j.Name())
    if err != nil {
        e.logger.Error("agent: failed to start job run record",
            slog.String("job", j.Name()), slog.String("error", err.Error()))
        return
    }
    if err := j.Run(ctx); err != nil {
        e.logger.Error("agent: job failed",
            slog.String("job", j.Name()), slog.String("error", err.Error()))
        _ = e.store.FinishRun(ctx, runID, "failed", "", err.Error())
        return
    }
    _ = e.store.FinishRun(ctx, runID, "ok", "", "")
    e.logger.Info("agent: job complete", slog.String("job", j.Name()))
 }
 ```
 **Deduplication / double-run prevention:** `StartRun` should check for an existing `running` row younger than `2 * j.Interval()` and return `ErrAlreadyRunning` — the caller skips that tick.
 #### `internal/agent/distill.go`
 ```go
 // DistillJob clusters semantically related thoughts and promotes
 // durable insights into knowledge nodes.
 type DistillJob struct {
    store     store.ThoughtQuerier
    provider  ai.Provider
    cfg       AgentDistillConfig
    projectID *uuid.UUID // nil = all projects
 }
 func (j *DistillJob) Name() string             { return "distill" }
 func (j *DistillJob) Interval() time.Duration  { return j.cfg.Interval }
 func (j *DistillJob) Run(ctx context.Context) error {
    // 1. Fetch recent thoughts not yet distilled (metadata.distilled != true)
    //    using store.ListThoughts with filter Days = cfg.MinAgeHours/24
    // 2. Group into semantic clusters via SearchSimilarThoughts
    // 3. For each cluster > MinClusterSize:
    //    a. Call provider.Summarize with insight extraction prompt
    //    b. InsertThought with type="insight", metadata.knowledge_node=true
    //    c. InsertLink from each cluster member to the insight, relation="distilled_from"
    //    d. UpdateThought on each source to set metadata.distilled=true
    // 4. Return nil; partial failures are logged but do not abort the run
 }
 ```
 Prompt used in step 3a:
 ```
 System: You extract durable knowledge from a cluster of related notes.
        Return a single paragraph (2-5 sentences) capturing the core insight.
        Do not reference the notes themselves. Write in third person.
 User: [concatenated thought content, newest first, max 4000 tokens]
 ```
 #### `internal/agent/daily_notes.go`
 Runs at a configured hour each day (checked by comparing `time.Now().Hour()` against `cfg.Hour` inside the loop — skip if already ran today by querying `agent_job_runs` for a successful `daily_notes` run with `started_at >= today midnight`).
 Collects:
 - Thoughts created today (`store.ListThoughts` with `Days=1`)
 - CRM interactions logged today
 - Calendar activities for today
 - Maintenance logs from today
 Formats into a structured markdown string and calls `store.InsertThought` with `type=daily_note`.
 #### `internal/agent/living_summary.go`
 Regenerates `MEMORY.md` from the last N daily notes + all knowledge nodes. Calls `provider.Summarize` and upserts the result via `store.UpsertFile` using a fixed name `MEMORY.md` scoped to the project (or global if no project).
 ---
 ### 1c. Config Structs
 Add to `internal/config/config.go`:
 ```go
 type Config struct {
    // ... existing fields ...
    Agent    AgentConfig    `yaml:"agent"`
    Channels ChannelsConfig `yaml:"channels"`
    Shell    ShellConfig    `yaml:"shell"`
 }
 type AgentConfig struct {
    Enabled       bool                  `yaml:"enabled"`
    Distill       AgentDistillConfig    `yaml:"distill"`
    DailyNotes    AgentDailyNotesConfig `yaml:"daily_notes"`
    LivingSummary AgentLivingSummary    `yaml:"living_summary"`
    Archival      AgentArchivalConfig   `yaml:"archival"`
    Model         string                `yaml:"model"` // override for agent calls; falls back to AI.Metadata.Model
 }
 type AgentDistillConfig struct {
    Enabled        bool          `yaml:"enabled"`
    Interval       time.Duration `yaml:"interval"`        // default: 24h
    BatchSize      int           `yaml:"batch_size"`      // thoughts per run; default: 50
    MinClusterSize int           `yaml:"min_cluster_size"` // default: 3
    MinAgeHours    int           `yaml:"min_age_hours"`   // ignore thoughts younger than this; default: 6
 }
 type AgentDailyNotesConfig struct {
    Enabled bool `yaml:"enabled"`
    Hour    int  `yaml:"hour"` // 0-23 UTC; default: 23
 }
 type AgentLivingSummary struct {
    Enabled  bool          `yaml:"enabled"`
    Interval time.Duration `yaml:"interval"`   // default: 24h
    MaxDays  int           `yaml:"max_days"`   // daily notes lookback; default: 30
 }
 type AgentArchivalConfig struct {
    Enabled           bool          `yaml:"enabled"`
    Interval          time.Duration `yaml:"interval"`             // default: 168h (weekly)
    ArchiveOlderThan  int           `yaml:"archive_older_than_days"` // default: 90
 }
 ```
 **Full YAML reference (`configs/dev.yaml` additions):**
 ```yaml
 agent:
  enabled: false
  model: ""                        # leave blank to reuse ai.metadata.model
  distill:
    enabled: false
    interval: 24h
    batch_size: 50
    min_cluster_size: 3
    min_age_hours: 6
  daily_notes:
    enabled: false
    hour: 23                       # UTC hour to generate (0–23)
  living_summary:
    enabled: false
    interval: 24h
    max_days: 30
  archival:
    enabled: false
    interval: 168h
    archive_older_than_days: 90
 ```
 ---
 ### 1d. Wire into `internal/app/app.go`
 After the existing `MetadataRetry` goroutine block:
 ```go
 if cfg.Agent.Enabled {
    jobStore := store.NewJobStore(db)
    var jobs []agent.Job
    if cfg.Agent.Distill.Enabled {
        jobs = append(jobs, agent.NewDistillJob(db, provider, cfg.Agent.Distill, nil))
    }
    if cfg.Agent.DailyNotes.Enabled {
        jobs = append(jobs, agent.NewDailyNotesJob(db, provider, cfg.Agent.DailyNotes))
    }
    if cfg.Agent.LivingSummary.Enabled {
        jobs = append(jobs, agent.NewLivingSummaryJob(db, provider, cfg.Agent.LivingSummary))
    }
    if cfg.Agent.Archival.Enabled {
        jobs = append(jobs, agent.NewArchivalJob(db, cfg.Agent.Archival))
    }
    engine := agent.NewEngine(jobStore, logger, jobs...)
    go engine.Run(ctx)
 }
 ```
 ---
 ### 1e. New MCP Tools — `internal/tools/agent.go`
 ```go
 // list_agent_jobs
 // Returns all registered jobs with: name, interval, last_run (status, started_at, finished_at), next_run estimate.
 // trigger_agent_job
 // Input: { "job": "distill" }
 // Fires the job immediately in a goroutine; returns a run_id for polling.
 // get_agent_job_history
 // Input: { "job": "distill", "limit": 20 }
 // Returns rows from agent_job_runs ordered by started_at DESC.
 ```
 Register in `internal/app/app.go` routes by adding `Agent tools.AgentTool` to `mcpserver.ToolSet` and wiring `tools.NewAgentTool(engine)`.
 ---
 ### 1f. Migration — `migrations/021_agent_jobs.sql`
 ```sql
 CREATE TABLE agent_job_runs (
  id          uuid        PRIMARY KEY DEFAULT gen_random_uuid(),
  job_name    text        NOT NULL,
  started_at  timestamptz NOT NULL DEFAULT now(),
  finished_at timestamptz,
  status      text        NOT NULL DEFAULT 'running', -- running | ok | failed | skipped
  output      text,
  error       text,
  metadata    jsonb       NOT NULL DEFAULT '{}'
 );
 CREATE INDEX idx_agent_job_runs_lookup
  ON agent_job_runs (job_name, started_at DESC);
 ```
 **`JobStore` interface (`internal/store/agent.go`):**
 ```go
 type JobStore interface {
    StartRun(ctx context.Context, jobName string) (uuid.UUID, error)
    FinishRun(ctx context.Context, id uuid.UUID, status, output, errMsg string) error
    LastRun(ctx context.Context, jobName string) (*AgentJobRun, error)
    ListRuns(ctx context.Context, jobName string, limit int) ([]AgentJobRun, error)
 }
 ```
 ---
 ## Phase 2: Knowledge Graph Distillation
 Builds on Phase 1's distillation job. `thought_links` already exists with typed `relation` — the missing piece is a way to mark and query promoted knowledge nodes.
 ### 2a. Extend `ThoughtMetadata`
 In `internal/types/thought.go`, add two fields to `ThoughtMetadata`:
 ```go
 type ThoughtMetadata struct {
    // ... existing fields ...
    KnowledgeNode   bool `json:"knowledge_node,omitempty"`   // true = promoted insight
    KnowledgeWeight int  `json:"knowledge_weight,omitempty"` // number of source thoughts that fed this node
    Distilled       bool `json:"distilled,omitempty"`        // true = this thought has been processed by distill job
 }
 ```
 These are stored in the existing `metadata jsonb` column — no schema migration needed.
 ### 2b. Store Addition
 In `internal/store/thoughts.go` add:
 ```go
 // ListKnowledgeNodes returns thoughts where metadata->>'knowledge_node' = 'true',
 // ordered by knowledge_weight DESC, then created_at DESC.
 func (db *DB) ListKnowledgeNodes(ctx context.Context, projectID *uuid.UUID, limit int) ([]types.Thought, error)
 ```
 SQL:
 ```sql
 SELECT id, content, metadata, project_id, archived_at, created_at, updated_at
 FROM thoughts
 WHERE (metadata->>'knowledge_node')::boolean = true
  AND ($1::uuid IS NULL OR project_id = $1)
  AND archived_at IS NULL
 ORDER BY (metadata->>'knowledge_weight')::int DESC, created_at DESC
 LIMIT $2
 ```
 ### 2c. New MCP Tools — `internal/tools/knowledge.go`
 ```go
 // get_knowledge_graph
 // Input: { "project_id": "uuid|null", "limit": 50 }
 // Returns: { nodes: [Thought], edges: [ThoughtLink] }
 // Fetches ListKnowledgeNodes + their outgoing/incoming links via store.GetThoughtLinks.
 // distill_now
 // Input: { "project_id": "uuid|null", "batch_size": 20 }
 // Triggers the distillation job synchronously (for on-demand use); returns { insights_created: N }
 ```
 ---
 ## Phase 3: Channel Integrations — Telegram First
 ### 3a. Channel Adapter Interface — `internal/channels/channel.go`
 ```go
 package channels
 import (
    "context"
    "time"
 )
 type Attachment struct {
    Name      string
    MediaType string
    Data      []byte
 }
 type InboundMessage struct {
    ChannelID   string      // e.g. telegram chat ID as string
    SenderID    string      // e.g. telegram user ID as string
    SenderName  string      // display name
    Text        string
    Attachments []Attachment
    Timestamp   time.Time
    Raw         any         // original platform message for debug/logging
 }
 type Channel interface {
    Name() string
    Start(ctx context.Context, handler func(InboundMessage)) error
    Send(ctx context.Context, channelID string, text string) error
 }
 ```
 ### 3b. Telegram Implementation — `internal/channels/telegram/bot.go`
 Uses `net/http` only (no external Telegram SDK). Long-polling loop:
 ```go
 type Bot struct {
    token      string
    allowedIDs map[int64]struct{} // empty = all allowed
    baseURL    string             // https://api.telegram.org/bot{token}
    client     *http.Client
    offset     int64
    logger     *slog.Logger
 }
 func (b *Bot) Name() string { return "telegram" }
 func (b *Bot) Start(ctx context.Context, handler func(channels.InboundMessage)) error {
    for {
        updates, err := b.getUpdates(ctx, b.offset, 30 /*timeout seconds*/)
        if err != nil {
            if ctx.Err() != nil { return nil }
            // transient error: log and back off 5s
            time.Sleep(5 * time.Second)
            continue
        }
        for _, u := range updates {
            b.offset = u.UpdateID + 1
            if u.Message == nil { continue }
            if !b.isAllowed(u.Message.Chat.ID) { continue }
            handler(b.toInbound(u.Message))
        }
    }
 }
 func (b *Bot) Send(ctx context.Context, channelID string, text string) error {
    // POST /sendMessage with chat_id and text
    // Splits messages > 4096 chars automatically
 }
 ```
 **Error handling:**
 - HTTP 401 (bad token): return fatal error, engine stops channel
 - HTTP 429 (rate limit): respect `retry_after` from response body, sleep, retry
 - HTTP 5xx: exponential backoff (5s → 10s → 30s → 60s), max 3 retries then sleep 5 min
 ### 3c. Channel Router — `internal/channels/router.go`
 ```go
 type Router struct {
    store    store.ContactQuerier
    thoughts store.ThoughtInserter
    provider ai.Provider
    channels map[string]channels.Channel
    cfg      config.ChannelsConfig
    logger   *slog.Logger
 }
 func (r *Router) Handle(msg channels.InboundMessage) {
    // 1. Resolve sender → CRM contact (by channel_identifiers->>'telegram' = senderID)
    //    If not found: create a new professional_contact with the sender name + channel identifier
    // 2. Capture message as thought:
    //    content = msg.Text
    //    metadata.source = "telegram"
    //    metadata.type = "observation"
    //    metadata.people = [senderName]
    //    metadata (extra, stored in JSONB): channel="telegram", channel_id=msg.ChannelID, sender_id=msg.SenderID
    // 3. If cfg.Telegram.Respond:
    //    a. Load recent context via store.SearchSimilarThoughts(msg.Text, limit=10)
    //    b. Build []CompletionMessage with system context + recent thoughts + user message
    //    c. Call provider.Complete(ctx, messages)
    //    d. Capture response as thought (type="assistant_response", source="telegram")
    //    e. Send reply via channel.Send(ctx, msg.ChannelID, result.Content)
    //    f. Save chat history via store.InsertChatHistory
 }
 ```
 **Agent response system prompt (step 3b):**
 ```
 You are a personal assistant with access to the user's memory.
 Relevant context from memory:
 {joined recent thought content}
 Respond concisely. If you cannot answer from memory, say so.
 ```
 ### 3d. Config — full YAML reference
 ```yaml
 channels:
  telegram:
    enabled: false
    bot_token: ""
    allowed_chat_ids: []           # empty = all chats allowed
    capture_all: true              # save every inbound message as a thought
    respond: true                  # send LLM reply back to sender
    response_model: ""             # blank = uses agent.model or ai.metadata.model
    poll_timeout_seconds: 30       # Telegram long-poll timeout (max 60)
    max_message_length: 4096       # split replies longer than this
  discord:
    enabled: false
    bot_token: ""
    guild_ids: []                  # empty = all guilds
    capture_all: true
    respond: true
  slack:
    enabled: false
    bot_token: ""
    app_token: ""                  # for socket mode
    capture_all: true
    respond: true
  email:
    enabled: false
    imap_host: ""
    imap_port: 993
    smtp_host: ""
    smtp_port: 587
    username: ""
    password: ""
    poll_interval: 5m
    capture_all: true
    folders: ["INBOX"]
 ```
 ### 3e. Schema Migration — `migrations/022_channel_contacts.sql`
 ```sql
 -- Store per-channel identity handles on CRM contacts
 ALTER TABLE professional_contacts
  ADD COLUMN IF NOT EXISTS channel_identifiers jsonb NOT NULL DEFAULT '{}';
 -- e.g. {"telegram": "123456789", "discord": "user#1234", "slack": "U01234567"}
 CREATE INDEX idx_contacts_telegram_id
  ON professional_contacts ((channel_identifiers->>'telegram'))
  WHERE channel_identifiers->>'telegram' IS NOT NULL;
 ```
 ### 3f. New MCP Tools — `internal/tools/channels.go`
 ```go
 // send_channel_message
 // Input: { "channel": "telegram", "channel_id": "123456789", "text": "Hello" }
 // Sends a message on the named channel. Returns { sent: true, channel: "telegram" }
 // list_channel_conversations
 // Input: { "channel": "telegram", "limit": 20, "days": 7 }
 // Lists chat histories filtered by channel metadata. Wraps store.ListChatHistories.
 // get_channel_status
 // Returns: [{ channel: "telegram", connected: true, uptime_seconds: 3600 }, ...]
 ```
 ### 3g. Future Channel Adapters
 Each is a new subdirectory implementing `channels.Channel`. No router or MCP tool changes needed.
 | Channel | Package | Approach |
 |---------|---------|----------|
 | Discord | `internal/channels/discord/` | Gateway WebSocket (discord.com/api/gateway); or use `discordgo` lib |
 | Slack | `internal/channels/slack/` | Socket Mode WebSocket (no public URL needed) |
 | Email (IMAP) | `internal/channels/email/` | IMAP IDLE or poll; SMTP for send |
 | Signal | `internal/channels/signal/` | Wrap `signal-cli` JSON-RPC subprocess |
 | WhatsApp | `internal/channels/whatsapp/` | Meta Cloud API webhook (requires public URL) |
 ---
 ## Phase 4: Shell / Computer Access
 ### 4a. Shell Tool — `internal/tools/shell.go`
 ```go
 type ShellInput struct {
    Command    string `json:"command"`
    WorkingDir string `json:"working_dir,omitempty"` // override default; must be within allowed prefix
    Timeout    string `json:"timeout,omitempty"`     // e.g. "30s"; overrides config default
    CaptureAs  string `json:"capture_as,omitempty"`  // thought type for stored output; default "shell_output"
    SaveOutput bool   `json:"save_output"`           // store stdout/stderr as a thought
 }
 type ShellOutput struct {
    Stdout   string `json:"stdout"`
    Stderr   string `json:"stderr"`
    ExitCode int    `json:"exit_code"`
    ThoughtID *uuid.UUID `json:"thought_id,omitempty"` // set if save_output=true
 }
 ```
 **Execution model:**
 1. Validate `command` against `cfg.Shell.AllowedCommands` (if non-empty) and `cfg.Shell.BlockedCommands`
 2. `exec.CommandContext(ctx, "sh", "-c", command)` with `Dir` set to working dir
 3. Capture stdout + stderr into `bytes.Buffer`
 4. On timeout: kill process group (`syscall.Kill(-cmd.Process.Pid, syscall.SIGKILL)`), return exit code -1
 5. If `SaveOutput`: call `store.InsertThought` with content = truncated stdout (max 8KB) + stderr summary
 **Security controls:**
 ```yaml
 shell:
  enabled: false
  working_dir: "/tmp/amcs-agent"  # all commands run here unless overridden
  allowed_working_dirs:           # if set, working_dir overrides must be within one of these
    - "/tmp/amcs-agent"
    - "/home/user/projects"
  timeout: 30s
  max_output_bytes: 65536         # truncate captured output beyond this
  allowed_commands: []            # empty = all; non-empty = exact binary name allowlist
  blocked_commands:               # checked before allowed_commands
    - "rm"
    - "sudo"
    - "su"
    - "curl"
    - "wget"
  save_output_by_default: false
 ```
 The tool is registered with `mcp.Tool.Annotations` `Destructive: true` so MCP clients prompt for confirmation.
 ### 4b. File Bridge Tools
 Also in `internal/tools/shell.go`:
 ```go
 // read_file_from_path
 // Input: { "path": "/abs/path/file.txt", "link_to_thought": "uuid|null" }
 // Reads file from server filesystem → stores as AMCS file via store.InsertFile
 // Returns: { file_id: "uuid", size_bytes: N, media_type: "text/plain" }
 // write_file_to_path
 // Input: { "file_id": "uuid", "path": "/abs/path/output.txt" }
 // Loads AMCS file → writes to filesystem path
 // Path must be within cfg.Shell.AllowedWorkingDirs if set
 ```
 ---
 ## Phase 5: Self-Improving Memory
 ### 5a. Skill Discovery Job — `internal/agent/skill_discovery.go`
 Runs weekly. Algorithm:
 1. Load last 30 days of `chat_histories` via `store.ListChatHistories(days=30)`
 2. Extract assistant message patterns with `provider.Complete`:
   ```
   System: Identify reusable behavioural patterns or preferences visible in these conversations.
           Return a JSON array of { "name": "...", "description": "...", "tags": [...] }.
           Only include patterns that would be useful across future sessions.
   User: [last N assistant + user messages, newest first]
   ```
 3. For each discovered pattern, call `store.InsertSkill` with tag `auto-discovered` and the current date
 4. Link to all projects via `store.LinkSkillToProject`
 Deduplication: before inserting, call `store.SearchSkills(pattern.name)` — if similarity > 0.9, skip.
 ### 5b. Thought Archival Job — `internal/agent/archival.go`
 ```go
 func (j *ArchivalJob) Run(ctx context.Context) error {
    // 1. ListThoughts older than cfg.ArchiveOlderThanDays with no knowledge_node link
    //    SQL: thoughts where created_at < now() - interval '$N days'
    //         AND metadata->>'knowledge_node' IS DISTINCT FROM 'true'
    //         AND archived_at IS NULL
    //         AND id NOT IN (SELECT thought_id FROM thought_links WHERE relation = 'distilled_from')
    // 2. For each batch: store.ArchiveThought(ctx, id)
    // 3. Log count
 }
 ```
 Uses the existing `ArchiveThought` store method — no new SQL needed.
 ---
 ## End-to-End Agent Loop Flow
 ```
 Telegram message arrives
        │
        ▼
 channels/telegram/bot.go (long-poll goroutine)
        │  InboundMessage{}
        ▼
 channels/router.go Handle()
        ├── Resolve sender → CRM contact (store.SearchContacts by channel_identifiers)
        ├── store.InsertThought (source="telegram", type="observation")
        ├── store.SearchSimilarThoughts (semantic context retrieval)
        ├── ai.Provider.Complete (build messages → LLM call)
        ├── store.InsertThought (source="telegram", type="assistant_response")
        ├── store.InsertChatHistory (full turn saved)
        └── channels.Channel.Send (reply dispatched to Telegram)
 Meanwhile, every 24h:
 agent/engine.go ticker fires DistillJob
        ├── store.ListThoughts (recent, not yet distilled)
        ├── store.SearchSimilarThoughts (cluster by semantic similarity)
        ├── ai.Provider.Summarize (insight extraction prompt)
        ├── store.InsertThought (type="insight", knowledge_node=true)
        └── store.InsertLink (relation="distilled_from" for each source)
 After distill:
 agent/living_summary.go
        ├── store.ListKnowledgeNodes
        ├── store.ListThoughts (type="daily_note", last 30 days)
        ├── ai.Provider.Summarize (MEMORY.md regeneration)
        └── store.UpsertFile (name="MEMORY.md", linked to project)
 ```
 ---
 ## Error Handling & Retry Strategy
 | Scenario | Handling |
 |----------|----------|
 | LLM returns 429 | Mark model unhealthy in `modelHealth` map (existing pattern), return error, engine logs and skips tick |
 | LLM returns 5xx | Same as 429 |
 | Telegram 429 | Read `retry_after` from response, sleep exact duration, retry immediately |
 | Telegram 5xx | Exponential backoff: 5s → 10s → 30s → 60s, reset after success |
 | Telegram disconnects | Long-poll timeout naturally retries; context cancel exits cleanly |
 | Agent job panics | `engine.runOnce` wraps in `recover()`, logs stack trace, marks run `failed` |
 | Agent double-run | `store.StartRun` checks for `running` row < `2 * interval` old → returns `ErrAlreadyRunning`, tick skipped silently |
 | Shell command timeout | `exec.CommandContext` kills process group via SIGKILL, returns exit_code=-1 and partial output |
 | Distillation partial failure | Each cluster processed independently; failure of one cluster logged and skipped, others continue |
 ---
 ## Critical Files
 | File | Change |
 |------|--------|
 | `internal/ai/provider.go` | Add `Complete()`, `CompletionMessage`, `CompletionResult` |
 | `internal/ai/compat/client.go` | Implement `Complete()` on `*Client` |
 | `internal/config/config.go` | Add `AgentConfig`, `ChannelsConfig`, `ShellConfig` |
 | `internal/types/thought.go` | Add `KnowledgeNode`, `KnowledgeWeight`, `Distilled` to `ThoughtMetadata` |
 | `internal/store/thoughts.go` | Add `ListKnowledgeNodes()` |
 | `internal/store/agent.go` | New: `JobStore` interface + implementation |
 | `internal/app/app.go` | Wire agent engine + channel router goroutines |
 | `internal/mcpserver/server.go` | Add `Agent`, `Knowledge`, `Channels`, `Shell` to `ToolSet` |
 | `internal/agent/` | New package: engine, job, distill, daily_notes, living_summary, archival, skill_discovery |
 | `internal/channels/` | New package: channel interface, router, telegram/ |
 | `internal/tools/agent.go` | New: list_agent_jobs, trigger_agent_job, get_agent_job_history |
 | `internal/tools/knowledge.go` | New: get_knowledge_graph, distill_now |
 | `internal/tools/channels.go` | New: send_channel_message, list_channel_conversations, get_channel_status |
 | `internal/tools/shell.go` | New: run_shell_command, read_file_from_path, write_file_to_path |
 | `migrations/021_agent_jobs.sql` | New table: agent_job_runs |
 | `migrations/022_channel_contacts.sql` | ALTER professional_contacts: add channel_identifiers jsonb |
 ---
 ## Sequence / Parallelism
 ```
 Phase 1 (Heartbeat Engine) ──► Phase 2 (Knowledge Graph)
                           └──► Phase 5 (Self-Improving)
 Phase 3 (Telegram) ──► Phase 3g (Discord / Slack / Email)
 Phase 4 (Shell) [fully independent — no dependencies on other phases]
 ```
 **Minimum viable OpenClaw competitor = Phase 1 + Phase 3** (autonomous scheduling + Telegram channel).
 ---
 ## Verification
 | Phase | Test |
 |-------|------|
 | 1 — Heartbeat | Set `distill.interval: 1m` in dev config. Capture 5+ related thoughts. Wait 1 min. Query `thought_links` for `relation=distilled_from` rows. Check `agent_job_runs` has a `status=ok` row. |
 | 1 — Daily notes | Set `daily_notes.hour` to current UTC hour. Restart server. Within 1 min, `list_thoughts` should return a `type=daily_note` entry for today. |
 | 2 — Knowledge graph | Call `get_knowledge_graph` MCP tool. Verify `nodes` array contains `type=insight` thoughts with `knowledge_node=true`. Verify edges list `distilled_from` links. |
 | 3 — Telegram inbound | Send a message to the configured bot. Call `search_thoughts` with the message text — should appear with `source=telegram`. |
 | 3 — Telegram response | Send a question to the bot. Verify a reply arrives in Telegram. Call `list_chat_histories` — should contain the turn. |
 | 4 — Shell | Call `run_shell_command` with `{"command": "echo hello", "save_output": true}`. Verify `stdout=hello\n`, `exit_code=0`, and a new thought with `type=shell_output`. |
 | 4 — Blocked command | Call `run_shell_command` with `{"command": "sudo whoami"}`. Verify error returned without execution. |
 | 5 — Skill discovery | Run `trigger_agent_job` with `{"job": "skill_discovery"}`. Verify new rows in `agent_skills` with tag `auto-discovered`. |
 | Full loop | Send Telegram message → agent responds → distill job runs → knowledge node created from conversation → MEMORY.md regenerated with new insight. |