4.6 KiB
From Claw Code to a Personal AI Assistant (Life OS)
This document turns the current “developer CLI agent” direction into a concrete path toward a personal AI assistant: a multi-channel interface (chat/voice), personal memory (RAG for life), tool/action integrations (MCP + plugins), proactivity (OmX-style loops), and long-lived identity (sessions + profile).
It is intentionally pragmatic: each section has MVP scope, next step, and evolution.
1) Interface: out of the terminal
Goal
Make claw usable without opening an IDE or terminal — from a phone, from chat, and eventually by voice.
MVP
- Chat bridge: a small service that relays messages from Discord (primary) or Telegram to
claw/claw-analog.- Treat the chat thread as the “front-end”, and
clawas the execution runtime. - Map a channel/thread to a session id (resume/append).
- Treat the chat thread as the “front-end”, and
- Basic UX: slash-like commands in chat:
/prompt …,/resume latest,/status,/cost,/help- “safe mode” defaults (read-only) unless elevated explicitly.
Next step
- Voice:
- Speech-to-text input (e.g. Whisper-class STT) into the same chat bridge.
- Text-to-speech output for hands-free feedback.
Evolution
- Multi-modal: attachments (images/PDF) routed into ingest/personal memory.
- Presence and notifications: summaries pushed back into chat.
2) Memory: from “RAG for code” to “RAG for life”
Goal
Let the assistant answer personal questions and make decisions using your long-term context, not only the current repo.
MVP
- Extend ingestion inputs beyond git workspaces:
- Notes (Markdown), exported chats, simple text logs.
- PDFs (initially text extraction outside Rust is OK; later: built-in pipeline).
- Keep a clear separation:
- Work RAG (code/workspaces)
- Personal RAG (notes, plans, history)
Next step
- Evolve
retrieve_contextinto a multi-source retrieval tool:- “where to search” selector (work/personal/both)
- metadata filters (source, date ranges, tags)
Evolution
- Incremental ingestion + event-based updates (watch folders, chat events).
- Better stores (ANN/Qdrant/etc) when scale demands it.
3) Hands: tools, MCP, plugins
Goal
The assistant is valuable because it can do things, not only talk.
MVP
- Wire in external systems via MCP servers:
- Calendar, notes (Notion), email, task trackers, smart home (as available).
- Establish a convention for “personal skills”:
- a dedicated directory (e.g.
.claw/skills/) for user-specific automations - small, composable tools (digest, budgeting, reminders) rather than monoliths
- a dedicated directory (e.g.
Next step
- “Tool discovery” UX: list available MCP/tools/skills directly from chat.
- Permission boundaries per tool category (read vs write, destructive actions require explicit confirmation).
Evolution
- Plugin marketplace flows for reusing “skills”.
- Audit logging and replay of actions.
4) Proactivity: OmX-style loops
Goal
Move from reactive “answer me” to proactive “notice + prepare + propose + execute”.
MVP
- A scheduled runner that periodically:
- checks inbox/notifications
- extracts actionable tasks
- drafts responses
- posts a short digest to chat
Next step
- Multi-agent patterns (Architect/Executor/Reviewer) for higher reliability:
- executor proposes actions
- reviewer validates safety and correctness
- only then does the bridge run the write/action tool
Evolution
- Event-driven triggers (webhooks) instead of only cron.
- “Autopilot” modes with bounded scopes (time, tools, spend limits).
5) Long-lived identity: sessions + profile
Goal
Make the assistant feel continuous and personalized across days/weeks.
MVP
- Default to resuming the latest session (
--resume latest-style behavior). - Use a short, user-owned profile/system-prompt for tone and preferences.
Next step
- Separate:
- “personality” (style, preferences)
- “memory” (facts, history)
- “policies” (permissions, safety rules)
Evolution
- Multiple personas (work/personal) with explicit switching.
- Transparent memory controls (“forget this”, “store this”).
Suggested milestone sequence
- Discord bridge + session mapping (no new AI capabilities; just distribution).
- Personal ingest source #1 (notes folder) + retrieval selector (personal/work).
- One MCP integration (calendar or notes) + a single “daily digest” skill.
- Scheduled digest loop (cron) with bounded permissions.
- Voice input/output on top of the same bridge.