claude-code

mirror of https://github.com/instructkr/claude-code.git synced 2026-05-15 02:16:44 +00:00

Author	SHA1	Message	Date
bellman	d2b5f5d498	require provenance for green contracts Promote merge-ready green contracts from a level-only check to explicit provenance requirements for test commands, base freshness, recovery-attempt context, and known blocking flakes. This preserves simple level contracts while giving policy code a single satisfied-contract signal to require before merge decisions.\n\nConstraint: Task scope was limited to green_contract.rs, policy_engine.rs if needed, and narrow tests; stale_* and recovery_recipes.rs were not edited.\nRejected: Adding more boolean fields to GreenContract \| clippy flagged the shape and a requirement list is more explicit.\nConfidence: high\nScope-risk: narrow\nDirective: Treat raw test level as insufficient for merge readiness unless green contract evidence is satisfied.\nTested: cargo check --manifest-path rust/Cargo.toml -p runtime; cargo test --manifest-path rust/Cargo.toml -p runtime; cargo clippy --manifest-path rust/Cargo.toml -p runtime -- -D warnings; focused green_contract, policy_engine, and integration tests.\nNot-tested: full workspace cargo test due pre-existing rusty-claude-cli session_lifecycle_prefers_running_process_over_idle_shell failure observed before this slice.	2026-05-14 18:33:51 +09:00
bellman	607f071ca8	harden branch recovery reporting Ensure branch-recovery verification surfaces compile cleanly under focused lint by preserving trusted-root fallback without clippy noise. Constraint: G005 worker task requires verified branch/test awareness and recovery reporting evidence without mutating .omx/ultragoal. Rejected: ignoring focused clippy failure \| would leave modified tools surface with avoidable lint noise. Confidence: high Scope-risk: narrow Directive: Keep recovery surfaces machine-readable; do not collapse test hangs back into generic timeouts. Tested: cargo test -p runtime; cargo test -p tools targeted branch/hung/preflight tests; cargo check -p runtime -p tools; cargo clippy -p runtime --all-targets -- -D warnings; cargo clippy -p tools --lib --no-deps -- -D warnings. Not-tested: full cargo test -p tools remains red on pre-existing permission-enforcer expectation failures unrelated to this change.	2026-05-14 18:33:48 +09:00
bellman	d3f8ff9916	omx(team): auto-checkpoint worker-1 [1]	2026-05-14 18:28:21 +09:00
bellman	5c40d4e778	omx(team): auto-checkpoint worker-3 [4]	2026-05-14 18:26:55 +09:00
bellman	5625ba597b	omx(team): auto-checkpoint worker-1 [1]	2026-05-14 18:26:49 +09:00
bellman	6a37442ee1	omx(team): auto-checkpoint worker-2 [3]	2026-05-14 18:24:51 +09:00
bellman	0bca524c8c	omx(team): auto-checkpoint worker-1 [1]	2026-05-14 18:22:37 +09:00
bellman	1fbde9f47f	omx(team): auto-checkpoint worker-1 [1]	2026-05-14 18:21:26 +09:00
bellman	0b0d55d7ec	omx(team): auto-checkpoint worker-1 [1]	2026-05-14 18:11:53 +09:00
bellman	7214573f35	Keep approval token contracts in their own runtime module Constraint: G004 task 3 now owns approval-token contracts through rust/crates/runtime/src/approval_tokens.rs, while auto-integration left a duplicate unused copy in permissions.rs.\nRejected: suppressing dead-code warnings \| the duplicate implementation was obsolete after the dedicated module landed.\nConfidence: high\nScope-risk: narrow\nDirective: Keep permission-mode authorization in permissions.rs and approval-token policy handoff in approval_tokens.rs.\nTested: cargo fmt --manifest-path rust/Cargo.toml --all -- --check; cargo check --manifest-path rust/Cargo.toml -p runtime; cargo test --manifest-path rust/Cargo.toml -p runtime approval_token -- --nocapture; cargo test --manifest-path rust/Cargo.toml -p runtime --test g004_conformance -- --nocapture\nNot-tested: full workspace test suite; G004 tasks 2/4/5 remain non-terminal.\n\nCo-authored-by: OmX <omx@oh-my-codex.dev>	2026-05-14 18:11:20 +09:00
bellman	dcf11f8190	harden report contract projection identity Add a runtime report schema v1 contract so downstream consumers can negotiate structured fields, verify canonical report identity, and audit projection redactions without reverse-engineering prose.\n\nConstraint: Task 2 scope was limited to report schema/projection/redaction modules/docs/tests and prohibited .omx/ultragoal mutation.\nRejected: Wiring into broader CLI report emitters \| kept this slice focused on the reusable contract and deterministic fixtures.\nConfidence: high\nScope-risk: narrow\nDirective: Future report emitters should build canonical payloads through CanonicalReportV1 before projecting audience-specific views.\nTested: cargo test -p runtime report_schema -- --nocapture; cargo test -p runtime lane_events -- --nocapture; cargo check -p runtime\nNot-tested: cargo clippy -p runtime --all-targets -- -D warnings remains blocked by pre-existing non-task warnings in compact.rs, file_ops.rs, policy_engine.rs, sandbox.rs.	2026-05-14 18:09:36 +09:00
bellman	e1641aa010	Prove G004 contract bundles are machine-checkable Constraint: Task 6 needed a regression harness without overwriting Task 1-4 implementation files.\nRejected: Editing lane_events/report-schema/approval-token owners directly \| would create shared-file conflicts with active lanes.\nConfidence: high\nScope-risk: narrow\nDirective: Keep this harness as a consumer-facing conformance layer; extend fixtures after Task 2/3 land schema/token producers.\nTested: cd rust && cargo test -p runtime --test g004_conformance -- --nocapture; cd rust && cargo check -p runtime; cd rust && cargo fmt --check; git diff --check\nNot-tested: cargo clippy -p runtime --tests -- -D warnings fails on pre-existing runtime lint debt outside changed files.	2026-05-14 18:07:11 +09:00
bellman	5cebdd999d	omx(team): auto-checkpoint worker-2 [3]	2026-05-14 18:07:05 +09:00
bellman	bf533d77a7	task: approval token chain Add a runtime approval-token ledger so policy-blocked actions can require scoped owner grants, consume one-time tokens, reject replay, and retain delegation traceability.\n\nConstraint: Task 3 scope is the G004 approval-token chain for runtime event/report contract families.\nRejected: Extending the existing permission prompt path directly \| the token contract can be tested independently without changing live tool authorization behavior.\nConfidence: high\nScope-risk: narrow\nDirective: Keep approval grants scoped to policy/action/repo/branch before wiring them into external execution paths.\nTested: cargo check --manifest-path rust/Cargo.toml --workspace; cargo test --manifest-path rust/crates/runtime/Cargo.toml; cargo test --manifest-path rust/crates/runtime/Cargo.toml approval_token -- --nocapture\nNot-tested: cargo clippy --manifest-path rust/crates/runtime/Cargo.toml --all-targets -- -D warnings is blocked by pre-existing warnings in compact.rs, file_ops.rs, policy_engine.rs, and sandbox.rs.	2026-05-14 18:07:03 +09:00
bellman	e34209ff7f	omx(team): auto-checkpoint worker-2 [3]	2026-05-14 18:07:00 +09:00
bellman	ff37d395bb	Stabilize G004 contract integration after worker merges Constraint: G004 worker integrations introduced unparseable approval-token tests and a conformance path bug that blocked leader verification.\nRejected: waiting for another auto-integration cycle \| local leader verification had exact parse and fixture failures to repair safely.\nConfidence: high\nScope-risk: moderate\nDirective: Keep approval-token regression tests in cfg(test) modules or integration tests, never inside type definitions.\nTested: cargo fmt --manifest-path rust/Cargo.toml --all -- --check; cargo check --manifest-path rust/Cargo.toml -p runtime; cargo test --manifest-path rust/Cargo.toml -p runtime approval_token -- --nocapture; cargo test --manifest-path rust/Cargo.toml -p runtime --test g004_conformance -- --nocapture; python3 .github/scripts/check_doc_source_of_truth.py\nNot-tested: full workspace test suite; remaining G004 tasks 1-5 still non-terminal.\n\nCo-authored-by: OmX <omx@oh-my-codex.dev>	2026-05-14 18:06:14 +09:00
bellman	f8d744bb37	omx(team): auto-checkpoint worker-1 [1]	2026-05-14 18:05:26 +09:00
bellman	c8c936ede1	omx(team): auto-checkpoint worker-3 [6]	2026-05-14 18:00:23 +09:00
bellman	57b3e3258b	omx(team): auto-checkpoint worker-2 [3]	2026-05-14 18:00:19 +09:00
bellman	06e545325d	omx(team): auto-checkpoint worker-1 [1]	2026-05-14 18:00:16 +09:00
bellman	f4e08d0ecf	omx(team): auto-checkpoint worker-2 [3]	2026-05-14 17:58:46 +09:00
bellman	16d6525de4	omx(team): auto-checkpoint worker-2 [3]	2026-05-14 17:57:59 +09:00
bellman	aec291caab	omx(team): auto-checkpoint worker-4 [unknown]	2026-05-14 17:51:53 +09:00
bellman	43b182882a	Lock doctor JSON boot preflight contract Constraint: G003 boot/session work adds a structured doctor boot-preflight check that must be visible in JSON output. Rejected: reducing the doctor check count back to six \| boot preflight is an explicit G003 acceptance surface. Confidence: high Scope-risk: narrow Directive: Keep doctor/status JSON contract tests aligned with boot_preflight schema fields when extending preflight diagnostics. Tested: git diff --check; cargo fmt --manifest-path rust/Cargo.toml --all -- --check; cargo test --manifest-path rust/Cargo.toml -p runtime trusted_roots -- --nocapture; cargo test --manifest-path rust/Cargo.toml -p runtime startup -- --nocapture; cargo test --manifest-path rust/Cargo.toml -p runtime worker_boot -- --nocapture; cargo test --manifest-path rust/Cargo.toml -p tools path_scope -- --nocapture; cargo test --manifest-path rust/Cargo.toml -p rusty-claude-cli --test output_format_contract -- --nocapture; cargo check --manifest-path rust/Cargo.toml --workspace Not-tested: full cargo test --workspace remains deferred during active G003 team reconciliation. Co-authored-by: OmX <omx@oh-my-codex.dev>	2026-05-14 17:51:47 +09:00
bellman	307b23d27f	omx(team): auto-checkpoint worker-4 [unknown]	2026-05-14 17:50:36 +09:00
bellman	8c11dd16f4	task: preserve startup no-evidence timestamp evidence Lock the startup-no-evidence contract so prompt timestamps remain the original send time while lifecycle and pane timestamps prove timeout ordering. Constraint: task 4 scope limited changes to runtime worker boot/session/startup modules and tests; .omx/ultragoal not mutated. Rejected: CLI-surface changes \| runtime evidence contract already exposes the typed worker.startup_no_evidence payload. Confidence: high Scope-risk: narrow Directive: Keep startup timeout evidence timestamps stable across later lifecycle observations. Tested: cargo test -p runtime worker_boot -- --nocapture; cargo check --workspace Not-tested: cargo clippy -p runtime --tests -- -D warnings is blocked by pre-existing runtime warnings in compact.rs, file_ops.rs, policy_engine.rs, and sandbox.rs.	2026-05-14 17:50:33 +09:00
bellman	79d3b809f9	omx(team): auto-checkpoint worker-4 [unknown]	2026-05-14 17:46:16 +09:00
bellman	9ec4d8398e	omx(team): auto-checkpoint worker-3 [unknown]	2026-05-14 17:46:13 +09:00
bellman	5f45740408	omx(team): auto-checkpoint worker-2 [unknown]	2026-05-14 17:46:10 +09:00
bellman	675d9ddc78	Harden workspace path classification Canonicalize absolute shell path operands before comparing them with the workspace root so symlink-expanded reads cannot be downgraded under workspace-write enforcement. Also resolves local clippy findings in the touched tools crate so targeted linting can run cleanly.\n\nConstraint: Task 1 scope is workspace/path scope enforcement only; do not mutate .omx/ultragoal.\nRejected: Editing shared path-scope regression tests \| worker-3 owns that test coverage and the current tests already prove the contract.\nConfidence: high\nScope-risk: narrow\nDirective: Keep shell/file permission classification canonical-path based before permitting workspace-write execution.\nTested: ../scripts/fmt.sh --check; cargo test -p tools --test path_scope_enforcement -- --nocapture; cargo test -p tools given_workspace_write_enforcer_when_bash -- --nocapture; cargo check -p tools; cargo clippy -p tools --all-targets --no-deps -- -D warnings\nNot-tested: Full workspace clippy still has known unrelated runtime crate warnings outside this task scope.	2026-05-14 17:46:07 +09:00
bellman	087e31d190	Keep G003 integrated runtime tests compiling Constraint: G003 worker outputs added config and startup evidence fields that must compile under focused runtime validation before leader push. Rejected: pushing auto-checkpoints without leader validation \| integrated tests initially failed to compile due missing imports and stale StartupEvidenceBundle fixtures. Confidence: high Scope-risk: narrow Directive: When extending StartupEvidenceBundle, update all in-crate fixtures in the same change. Tested: git diff --check; cargo fmt --manifest-path rust/Cargo.toml --all -- --check; cargo test --manifest-path rust/Cargo.toml -p runtime trusted_roots -- --nocapture; cargo test --manifest-path rust/Cargo.toml -p runtime startup -- --nocapture; cargo test --manifest-path rust/Cargo.toml -p runtime worker_boot -- --nocapture; cargo test --manifest-path rust/Cargo.toml -p tools path_scope -- --nocapture; cargo check --manifest-path rust/Cargo.toml --workspace Not-tested: full cargo test --workspace remains deferred during active G003 team work. Co-authored-by: OmX <omx@oh-my-codex.dev>	2026-05-14 17:45:46 +09:00
bellman	a6ee51baab	omx(team): auto-checkpoint worker-3 [unknown]	2026-05-14 17:40:32 +09:00
bellman	6df60a4683	omx(team): auto-checkpoint worker-2 [unknown]	2026-05-14 17:40:29 +09:00
bellman	964458ad4a	omx(team): auto-checkpoint worker-1 [1]	2026-05-14 17:38:59 +09:00
bellman	ac888623a8	Merge commit '3a8ce832341884322ede0855b150e3ceebe9180d'	2026-05-14 17:34:07 +09:00
bellman	3a8ce83234	Deny scoped file reads before tool dispatch Worker-3's path-scope regression showed outside read_file paths were blocked by the workspace wrapper after dispatch instead of by the permission enforcer. File, glob, and grep tools now classify path scope before dispatch and require danger-full-access for paths that resolve outside the current workspace. Constraint: G002-alpha-security requires permission-mode event/status visibility for blocked file and shell paths Rejected: relying only on runtime wrapper errors \| it hides the active permission-mode denial contract from callers Confidence: high Scope-risk: narrow Directive: keep path-sensitive tool permission classification aligned with workspace wrapper resolution Tested: cargo test -p tools --test path_scope_enforcement --manifest-path rust/Cargo.toml --quiet; cargo test -p tools given_workspace_write_enforcer_when_bash --manifest-path rust/Cargo.toml --quiet; cargo check --manifest-path rust/Cargo.toml --workspace; cargo fmt --all --manifest-path rust/Cargo.toml -- --check Not-tested: full workspace test suite after this small permission-classification follow-up Co-authored-by: OmX <omx@oh-my-codex.dev>	2026-05-14 17:34:03 +09:00
bellman	37b2b75287	Keep G002 path-scope tests aligned with enforced denials Constraint: G002-alpha-security requires direct file-tool escapes to fail before reads while accepting the canonical runtime error text. Rejected: weakening the test to accept successful reads \| the verified behavior denies the escape and only the assertion vocabulary was stale. Confidence: high Scope-risk: narrow Directive: Keep path-scope tests asserting denial semantics, not a single legacy wording. Tested: cargo fmt --manifest-path rust/Cargo.toml --all -- --check; cargo test --manifest-path rust/Cargo.toml -p tools path_scope -- --nocapture; cargo test --manifest-path rust/Cargo.toml -p tools --test path_scope_enforcement -- --nocapture; cargo test --manifest-path rust/Cargo.toml -p runtime workspace_ -- --nocapture; cargo test --manifest-path rust/Cargo.toml -p rusty-claude-cli --test output_format_contract -- --nocapture; python3 -m pytest tests/test_security_scope.py -q; cargo check --manifest-path rust/Cargo.toml --workspace; git diff --check Not-tested: full cargo test --workspace due known unrelated session_lifecycle_prefers_running_process_over_idle_shell failure. Co-authored-by: OmX <omx@oh-my-codex.dev>	2026-05-14 17:33:47 +09:00
bellman	f2dc615a8a	Prevent workspace escape through tool path resolution File and shell tool dispatch now resolves path-sensitive operations through workspace-scoped wrappers so direct paths, globs, symlinks, shell expansion, and Windows absolute path probes fail before execution when they leave the workspace. Constraint: G002-alpha-security requires alpha-blocking workspace/path scope enforcement without mutating .omx/ultragoal Rejected: string-prefix only checks \| they miss canonical symlink and glob expansion escapes Confidence: high Scope-risk: moderate Directive: keep new file/shell tool entrypoints wired through workspace-aware wrappers before dispatch Tested: python3 -m unittest discover -s tests -v; python3 -m compileall -q src tests; cargo test -p runtime workspace --manifest-path rust/Cargo.toml --quiet; cargo test -p tools workspace --manifest-path rust/Cargo.toml --quiet; cargo test -p tools given_workspace_write_enforcer_when_bash --manifest-path rust/Cargo.toml --quiet; cargo test -p tools file_tools_reject --manifest-path rust/Cargo.toml --quiet; cargo fmt --all --manifest-path rust/Cargo.toml -- --check; cargo check --manifest-path rust/Cargo.toml --workspace Not-tested: full unfiltered cargo test workspace due task-time constraints; targeted runtime/tools workspace security tests and full cargo check passed Co-authored-by: OmX <omx@oh-my-codex.dev>	2026-05-14 17:30:57 +09:00
bellman	9bc55f9946	omx(team): auto-checkpoint worker-1 [1]	2026-05-14 17:30:54 +09:00
bellman	180ebb3b02	Reject Windows absolute PowerShell paths from workspace scope The G002 security gate caught that PowerShell path classification still treated Windows absolute paths as workspace-relative on POSIX, so workspace scope now rejects those tokens before permission downgrades. Constraint: G002-alpha-security requires workspace/path scope across Windows path cases as well as direct paths, symlinks, globbing, shell expansion, and worktrees. Rejected: Relying on PathBuf::is_absolute for Windows syntax on POSIX \| it treats C:\ and UNC-like tokens as relative and weakens permission classification. Confidence: high Scope-risk: narrow Directive: Keep bash and PowerShell path classifiers aligned whenever new shell syntax is admitted. Tested: cargo test --manifest-path rust/Cargo.toml -p tools path_scope -- --nocapture; cargo test --manifest-path rust/Cargo.toml -p tools --test path_scope_enforcement -- --nocapture; cargo test --manifest-path rust/Cargo.toml -p runtime workspace_ -- --nocapture; python3 -m pytest tests/test_security_scope.py -q; cargo check --manifest-path rust/Cargo.toml --workspace. Not-tested: Full cargo test --workspace still has existing unrelated rusty-claude-cli session lifecycle failure reported by workers. Co-authored-by: OmX <omx@oh-my-codex.dev>	2026-05-14 17:29:57 +09:00
bellman	9c2ebb4f39	task: prefer tests before fixes Add focused regression coverage for path-scope enforcement before implementation changes land, preserving worker-1 ownership of the fix path. Constraint: task 4 requested tests-first coverage for direct path, symlink, glob/shell expansion, worktree, and Windows-style path cases.\nRejected: implementation edits in enforcement code \| worker-1 owns minimal implementation changes.\nConfidence: high\nScope-risk: narrow\nDirective: Keep these regressions red until path canonicalization/enforcement blocks outside-workspace reads before dispatch.\nTested: cargo fmt -p tools -- --check; cargo check -p tools; cargo clippy -p tools --test path_scope_enforcement (warnings only, pre-existing); cargo test -p tools --test path_scope_enforcement (expected red: 4 failing path-scope gaps, 2 passing baselines).\nNot-tested: Full workspace test suite because the new regression tests intentionally fail until implementation lands.	2026-05-14 17:29:31 +09:00
bellman	2c48400293	omx(team): auto-checkpoint worker-3 [4]	2026-05-14 17:27:21 +09:00
bellman	713ca7aee4	omx(team): auto-checkpoint worker-1 [1]	2026-05-14 17:27:18 +09:00
bellman	02b591ac64	omx(team): auto-checkpoint worker-3 [4]	2026-05-14 17:22:09 +09:00
bellman	f789525839	omx(team): auto-checkpoint worker-1 [1]	2026-05-14 17:22:06 +09:00
bellman	17da2964d7	omx(team): auto-checkpoint worker-3 [4]	2026-05-14 17:18:58 +09:00
bellman	9ab569e626	omx(team): auto-checkpoint worker-2 [3]	2026-05-14 17:18:55 +09:00
Jobdori	b98b9a712e	fix(fmt): expand Thinking struct literals to pass cargo fmt	2026-05-09 15:52:54 +09:00
YeonGyu-Kim	357629dbd9	fix(skills): route help flags to local dispatch + fix push_output_block test arity Cherry-pick from Yeachan-Heo's #2945 with manual conflict resolution: - classify_skills_slash_command now catches -h/--help anywhere in args - Restored pending_thinking parameter in push_output_block test calls Co-authored-by: Yeachan-Heo <bellman@ultraworkers.dev>	2026-05-06 15:41:25 +09:00
YeonGyu-Kim	75c08bc982	fix: REPL display, /compact panic, identity leak, DeepSeek reasoning, thinking blocks Five interrelated fixes from parallel Hephaestus sessions: 1. fix(repl): display assistant text after spinner (#2981, #2982, #2937) - Added final_assistant_text() call after run_turn spinner completes - REPL now shows response text like run_prompt_json does 2. fix(compact): handle Thinking content blocks (#2985) - Added ContentBlock::Thinking variant throughout compact summarizer - Prevents panic when /compact encounters thinking blocks 3. fix(prompt): provider-aware model identity (#2822) - New ModelFamilyIdentity enum (Claude vs Generic) - Non-Anthropic models no longer say 'I am Claude' - model_family_identity_for() detects provider and sets identity 4. fix(openai): preserve DeepSeek reasoning_content (#2821) - Stream parser now captures reasoning_content from OpenAI-compat - Emits ThinkingDelta/SignatureDelta events for reasoning models - Thinking blocks included in conversation history for re-send 5. feat(runtime): Thinking block support across codebase - AssistantEvent::Thinking variant in conversation.rs - ContentBlock::Thinking in session serialization - Thinking-aware compact summarization - Tests for thinking block ordering and content Closes #2981, #2982, #2937, #2985, #2822, #2821	2026-05-06 15:32:34 +09:00

1 2 3 4 5 ...

738 Commits