mirror of https://github.com/instructkr/claude-code.git synced 2026-05-26 07:26:46 +00:00

Files

gismo212 52572d5883 docs: personal assistant roadmap

Adds personal assistant roadmap and concept documentation.

2026-05-25 11:24:55 +09:00

4.6 KiB

Raw Blame History

From Claw Code to a Personal AI Assistant (Life OS)

This document turns the current “developer CLI agent” direction into a concrete path toward a personal AI assistant: a multi-channel interface (chat/voice), personal memory (RAG for life), tool/action integrations (MCP + plugins), proactivity (OmX-style loops), and long-lived identity (sessions + profile).

It is intentionally pragmatic: each section has MVP scope, next step, and evolution.

1) Interface: out of the terminal

Goal

Make claw usable without opening an IDE or terminal — from a phone, from chat, and eventually by voice.

MVP

Chat bridge: a small service that relays messages from Discord (primary) or Telegram to claw / claw-analog.
- Treat the chat thread as the “front-end”, and claw as the execution runtime.
- Map a channel/thread to a session id (resume/append).
Basic UX: slash-like commands in chat:
- /prompt …, /resume latest, /status, /cost, /help
- “safe mode” defaults (read-only) unless elevated explicitly.

Next step

Voice:
- Speech-to-text input (e.g. Whisper-class STT) into the same chat bridge.
- Text-to-speech output for hands-free feedback.

Evolution

Multi-modal: attachments (images/PDF) routed into ingest/personal memory.
Presence and notifications: summaries pushed back into chat.

2) Memory: from “RAG for code” to “RAG for life”

Goal

Let the assistant answer personal questions and make decisions using your long-term context, not only the current repo.

MVP

Extend ingestion inputs beyond git workspaces:
- Notes (Markdown), exported chats, simple text logs.
- PDFs (initially text extraction outside Rust is OK; later: built-in pipeline).
Keep a clear separation:
- Work RAG (code/workspaces)
- Personal RAG (notes, plans, history)

Next step

Evolve retrieve_context into a multi-source retrieval tool:
- “where to search” selector (work/personal/both)
- metadata filters (source, date ranges, tags)

Evolution

Incremental ingestion + event-based updates (watch folders, chat events).
Better stores (ANN/Qdrant/etc) when scale demands it.

3) Hands: tools, MCP, plugins

Goal

The assistant is valuable because it can do things, not only talk.

MVP

Wire in external systems via MCP servers:
- Calendar, notes (Notion), email, task trackers, smart home (as available).
Establish a convention for “personal skills”:
- a dedicated directory (e.g. .claw/skills/) for user-specific automations
- small, composable tools (digest, budgeting, reminders) rather than monoliths

Next step

“Tool discovery” UX: list available MCP/tools/skills directly from chat.
Permission boundaries per tool category (read vs write, destructive actions require explicit confirmation).

Evolution

Plugin marketplace flows for reusing “skills”.
Audit logging and replay of actions.

4) Proactivity: OmX-style loops

Goal

Move from reactive “answer me” to proactive “notice + prepare + propose + execute”.

MVP

A scheduled runner that periodically:
- checks inbox/notifications
- extracts actionable tasks
- drafts responses
- posts a short digest to chat

Next step

Multi-agent patterns (Architect/Executor/Reviewer) for higher reliability:
- executor proposes actions
- reviewer validates safety and correctness
- only then does the bridge run the write/action tool

Evolution

Event-driven triggers (webhooks) instead of only cron.
“Autopilot” modes with bounded scopes (time, tools, spend limits).

5) Long-lived identity: sessions + profile

Goal

Make the assistant feel continuous and personalized across days/weeks.

MVP

Default to resuming the latest session (--resume latest-style behavior).
Use a short, user-owned profile/system-prompt for tone and preferences.

Next step

Separate:
- “personality” (style, preferences)
- “memory” (facts, history)
- “policies” (permissions, safety rules)

Evolution

Multiple personas (work/personal) with explicit switching.
Transparent memory controls (“forget this”, “store this”).

Suggested milestone sequence

Discord bridge + session mapping (no new AI capabilities; just distribution).
Personal ingest source #1 (notes folder) + retrieval selector (personal/work).
One MCP integration (calendar or notes) + a single “daily digest” skill.
Scheduled digest loop (cron) with bounded permissions.
Voice input/output on top of the same bridge.

4.6 KiB Raw Blame History

From Claw Code to a Personal AI Assistant (Life OS)

1) Interface: out of the terminal

Goal

MVP

Next step

Evolution

2) Memory: from “RAG for code” to “RAG for life”

Goal

MVP

Next step

Evolution

3) Hands: tools, MCP, plugins

Goal

MVP

Next step

Evolution

4) Proactivity: OmX-style loops

Goal

MVP

Next step

Evolution

5) Long-lived identity: sessions + profile

Goal

MVP

Next step

Evolution

Suggested milestone sequence

4.6 KiB

Raw Blame History