fix(vault): remove safe-session runtime dependency #619

Merged
pdurlej merged 1 commit from codex/m04-safe-session-vault-dependency-cleanup into main 2026-05-30 02:33:56 +02:00
Collaborator

Canary status: missing — fire canary 3+3 manually before merge

Summary

Removes safe-session-api as a live HashiCorp Vault runtime dependency after the approved M04 local OpenSSH CA cutover.

What changed

  • Makes local-openssh-ca the default safe-session signer mode.
  • Removes Vault env/mount/depends_on from the normal safe-session-api compose path.
  • Adds compose/overlays/safe-session-vault-signer.yaml as the explicit rollback path.
  • Tightens vault-sunset-readiness.sh so active HashiCorp Vault dependency env names fail, while unrelated VAULT_* names are reported as inventory.
  • Documents that post-M01/M04 DR refresh must happen before destructive Vault cleanup.
  • Adds tests for readiness pass/fail classification.

Runtime evidence

  • M04-B runtime cutover was operator-approved with m04-vault-runtime-cutover-approved.
  • Rebuilt and recreated only home-platform-safe-session-api-1 with the local CA overlay.
  • safe-session-api: running/healthy.
  • Vault: still running/healthy as rollback material.
  • Local signer smoke: pass for llmops, denied root.
  • SSH login smoke with issued cert: pass for llmops.
  • Vault sunset readiness: pass.
  • Readiness inventory still reports old home-platform-np-1 names: VAULT_BOOTSTRAP_EXAMPLES, VAULT_ROOT, VAULT_TIMEZONE; these are not classified as HashiCorp Vault dependency env names.

Validation

  • pytest -q tests/test_vault_sunset_readiness.py tests/test_safe_session_ca_bootstrap.py scripts/safe-session-api/tests/test_local_signer.py -> 12 passed.
  • PYTHONPATH=control-plane uv run --project control-plane python -m platformctl.cli validate all --json -> exitCode 0.
  • RS2000 rollback overlay config: pass.

Non-goals

  • Does not stop Vault.
  • Does not delete Vault data, snapshots, env files, image, container, or CA material.
  • Does not run quarantine.
  • Does not run destructive cleanup.
  • Does not replace the required post-M01/M04 DR refresh gate.

Spec sources read

  • compose/core/compose.yaml
  • compose/overlays/safe-session-local-ca.yaml
  • scripts/safe-session-api/Dockerfile
  • scripts/safe-session-api/server.py
  • scripts/cutover/vault-sunset-readiness.sh
  • runbooks/safe-session-local-ca-cutover.md
  • runbooks/vault-quarantine-and-sunset.md

Requested decision

Review/merge as the M04-C cleanup PR. Next runtime gate is quarantine only after explicit m04-vault-quarantine-approved; destructive cleanup remains blocked until post-M01/M04 DR refresh and explicit destructive approval.

Canary status: missing — fire canary 3+3 manually before merge ## Summary Removes `safe-session-api` as a live HashiCorp Vault runtime dependency after the approved M04 local OpenSSH CA cutover. ## What changed - Makes `local-openssh-ca` the default safe-session signer mode. - Removes Vault env/mount/depends_on from the normal `safe-session-api` compose path. - Adds `compose/overlays/safe-session-vault-signer.yaml` as the explicit rollback path. - Tightens `vault-sunset-readiness.sh` so active HashiCorp Vault dependency env names fail, while unrelated `VAULT_*` names are reported as inventory. - Documents that post-M01/M04 DR refresh must happen before destructive Vault cleanup. - Adds tests for readiness pass/fail classification. ## Runtime evidence - M04-B runtime cutover was operator-approved with `m04-vault-runtime-cutover-approved`. - Rebuilt and recreated only `home-platform-safe-session-api-1` with the local CA overlay. - `safe-session-api`: running/healthy. - `Vault`: still running/healthy as rollback material. - Local signer smoke: pass for `llmops`, denied `root`. - SSH login smoke with issued cert: pass for `llmops`. - Vault sunset readiness: pass. - Readiness inventory still reports old `home-platform-np-1` names: `VAULT_BOOTSTRAP_EXAMPLES`, `VAULT_ROOT`, `VAULT_TIMEZONE`; these are not classified as HashiCorp Vault dependency env names. ## Validation - `pytest -q tests/test_vault_sunset_readiness.py tests/test_safe_session_ca_bootstrap.py scripts/safe-session-api/tests/test_local_signer.py` -> 12 passed. - `PYTHONPATH=control-plane uv run --project control-plane python -m platformctl.cli validate all --json` -> exitCode 0. - RS2000 rollback overlay config: pass. ## Non-goals - Does not stop Vault. - Does not delete Vault data, snapshots, env files, image, container, or CA material. - Does not run quarantine. - Does not run destructive cleanup. - Does not replace the required post-M01/M04 DR refresh gate. ## Spec sources read - `compose/core/compose.yaml` - `compose/overlays/safe-session-local-ca.yaml` - `scripts/safe-session-api/Dockerfile` - `scripts/safe-session-api/server.py` - `scripts/cutover/vault-sunset-readiness.sh` - `runbooks/safe-session-local-ca-cutover.md` - `runbooks/vault-quarantine-and-sunset.md` ## Requested decision Review/merge as the M04-C cleanup PR. Next runtime gate is quarantine only after explicit `m04-vault-quarantine-approved`; destructive cleanup remains blocked until post-M01/M04 DR refresh and explicit destructive approval.
fix(vault): remove safe-session runtime dependency
Some checks failed
base-is-main / guard (pull_request) Successful in 1s
canary-required / collect-diff (pull_request) Successful in 5s
patchwarden-client-dry-run / collect-diff (pull_request) Successful in 5s
patchwarden-pr-sanity / collect-diff (pull_request) Successful in 4s
platformctl plan / auto-apply scope (pull_request) Successful in 23s
python-ci / Python 3.11 (pull_request) Successful in 43s
patchwarden-pr-sanity / sanity (pull_request) Failing after 3m43s
python-ci / Python 3.12 (pull_request) Successful in 43s
python-ci / Python 3.13 (pull_request) Successful in 43s
canary-required / canary (pull_request) Successful in 13s
patchwarden-client-dry-run / dry-run (pull_request) Successful in 22s
75819b0e80
Make local OpenSSH CA the safe-session default after M04 cutover, move the Vault signer path into an explicit rollback overlay, and tighten Vault sunset readiness around actual HashiCorp Vault dependency env names.\n\nReadiness now reports unrelated VAULT_* names as inventory while failing on active Vault dependency variables. The quarantine runbook also records that post-M01/M04 DR refresh must precede destructive Vault cleanup.
Author
Collaborator

Patchwarden PR sanity

  • Status: eligible_sanity_clean
  • PR: 619
  • Commit: fc45324172f3cae860c76f7f998351bc159e3673
  • Security-sensitive label: present
  • Authority: advisory model review plus deterministic blockers only
  • 3+3 canary: still alive; this does not replace it

Deterministic findings

No deterministic findings.

Model reviewers

global-glm / glm-5.1:cloud

  • Status: error
  • Verdict: -
  • Note: ReadTimeout: The read operation timed out
  • Findings: none

global-deepseek / deepseek-v4-pro:cloud

  • Status: ok

  • Verdict: OK

  • low Readiness script may miss future Vault dependency env names

    • Evidence: scripts/cutover/vault-sunset-readiness.sh: is_hashicorp_vault_env_name() lists a fixed set of known HashiCorp Vault env names. If a new dependency uses an unlisted name (e.g., a custom or future Vault env var), it would be classified as inv
    • Next: Document the list as exhaustive for the current Vault integration and add a periodic review step in the runbook to update the list if Vault usage evolves.

redteam / kimi-k2.6:cloud

  • Status: error
  • Verdict: -
  • Note: ReadTimeout: The read operation timed out
  • Findings: none

Policy notes

  • GLM 5.1 + DeepSeek V4 Pro are the operator-required model mix for this bot.
  • Optional red-team model is enabled only when PLATFORMCTL_PR_SANITY_REDTEAM_MODEL is configured.
  • Auto-merge is not enabled here.
<!-- patchwarden-pr-sanity:pdurlej/platform:PR-619 --> # Patchwarden PR sanity - Status: `eligible_sanity_clean` - PR: `619` - Commit: `fc45324172f3cae860c76f7f998351bc159e3673` - Security-sensitive label: `present` - Authority: advisory model review plus deterministic blockers only - 3+3 canary: still alive; this does not replace it ## Deterministic findings No deterministic findings. ## Model reviewers ### `global-glm` / `glm-5.1:cloud` - Status: `error` - Verdict: `-` - Note: ReadTimeout: The read operation timed out - Findings: none ### `global-deepseek` / `deepseek-v4-pro:cloud` - Status: `ok` - Verdict: `OK` - **`low`** Readiness script may miss future Vault dependency env names - Evidence: `scripts/cutover/vault-sunset-readiness.sh: is_hashicorp_vault_env_name() lists a fixed set of known HashiCorp Vault env names. If a new dependency uses an unlisted name (e.g., a custom or future Vault env var), it would be classified as inv` - Next: Document the list as exhaustive for the current Vault integration and add a periodic review step in the runbook to update the list if Vault usage evolves. ### `redteam` / `kimi-k2.6:cloud` - Status: `error` - Verdict: `-` - Note: ReadTimeout: The read operation timed out - Findings: none ## Policy notes - GLM 5.1 + DeepSeek V4 Pro are the operator-required model mix for this bot. - Optional red-team model is enabled only when `PLATFORMCTL_PR_SANITY_REDTEAM_MODEL` is configured. - Auto-merge is not enabled here.
codex force-pushed codex/m04-safe-session-vault-dependency-cleanup from 75819b0e80
Some checks failed
base-is-main / guard (pull_request) Successful in 1s
canary-required / collect-diff (pull_request) Successful in 5s
patchwarden-client-dry-run / collect-diff (pull_request) Successful in 5s
patchwarden-pr-sanity / collect-diff (pull_request) Successful in 4s
platformctl plan / auto-apply scope (pull_request) Successful in 23s
python-ci / Python 3.11 (pull_request) Successful in 43s
patchwarden-pr-sanity / sanity (pull_request) Failing after 3m43s
python-ci / Python 3.12 (pull_request) Successful in 43s
python-ci / Python 3.13 (pull_request) Successful in 43s
canary-required / canary (pull_request) Successful in 13s
patchwarden-client-dry-run / dry-run (pull_request) Successful in 22s
to fc45324172
All checks were successful
canary-required / collect-diff (pull_request) Successful in 5s
patchwarden-client-dry-run / collect-diff (pull_request) Successful in 5s
platformctl plan / auto-apply scope (pull_request) Successful in 24s
python-ci / Python 3.11 (pull_request) Successful in 40s
python-ci / Python 3.12 (pull_request) Successful in 44s
canary-required / canary (pull_request) Successful in 15s
patchwarden-client-dry-run / dry-run (pull_request) Successful in 21s
patchwarden-pr-sanity / sanity (pull_request) Successful in 6m50s
patchwarden-pr-sanity / collect-diff (pull_request) Successful in 4s
python-ci / Python 3.13 (pull_request) Successful in 43s
base-is-main / guard (pull_request) Successful in 1s
2026-05-30 02:11:38 +02:00
Compare
pdurlej approved these changes 2026-05-30 02:33:38 +02:00
pdurlej left a comment

Operator-delegated approval via temporary admin PAT. Checks are green after rebase; DeepSeek V4 Pro red-team was GO; runtime cutover was already smoked on RS2000; this PR removes safe-session's Vault runtime dependency and does not quarantine or destroy Vault.

Operator-delegated approval via temporary admin PAT. Checks are green after rebase; DeepSeek V4 Pro red-team was GO; runtime cutover was already smoked on RS2000; this PR removes safe-session's Vault runtime dependency and does not quarantine or destroy Vault.
Sign in to join this conversation.
No reviewers
No labels
W6d-automerge-calibration
agent/claude-code
agent/codex
agent/hermes
agent/iskra
agent/ollama
agent/patchwarden
automerge-candidate
class/security-sensitive
cutover-gate
dependency/blocked
dependency/blocks-others
dependency/cross-repo
dependency/needs-confirmation
domain:agents
domain:ci
domain:docs
domain:forgejo
domain:infra
domain:memory
domain:runtime
domain:signal
domain:ux
flow/architecture
flow/blocked
flow/deployed
flow/done
flow/implementation
flow/intake
flow/maintained
flow/observed
flow/ready
flow/refining
flow/retired
flow/review
iterating
judge/codex-candidate
judge/hermes-candidate
judge/low-confidence
judge/needs-refinement
judge/operator-needed
judge/p0
judge/p1
judge/p2
judge/p3
judge/park
judge/patchwarden-candidate
judge/stale-priority
kind/adr
kind/bug
kind/chore
kind/feature
kind/infra
kind/ops
kind/refactor
kind/research
large-impact
merge/auto
merge/manual
merge/manual-dependency-conflict
merge/manual-failing-tests
merge/manual-merge-conflict
merge/manual-missing-review
merge/manual-operator-preference
merge/manual-red-zone
merge/manual-security-sensitive
merge/manual-unclear-scope
merge/manual-unknown
meta
mode:operator-only
mode:patchwarden-iskra-approved
mode:safe-auto
needs-operator-decision
needs-triage
not-ready
observed/erroring
observed/needs-followup
observed/pending
observed/retire-candidate
observed/unused
observed/used
operator-emotional
owner-attention
phase/02
phase/03
priority:p0
priority:p1
priority:p2
priority:p3
proposed
ready-for-agent
ready-for-operator
recovery
review:claude-reviewed
review:codex-reviewed
review:dziadek-reviewed
review:needs-human
risk/exposure
risk/process
risk/product
risk/runtime
safety:external-write
safety:no-prod-mutation
safety:prod-impact
safety:secret-touch
size/large
size/medium
size/small
size/tiny
size/unknown
source/adr
source/agent-generated
source/manual
source/operator-chat
source/voice-note
status:blocked
status:codex-ready
status:merged:pending-evidence
status:needs-evidence
status:operator-needed
status:parked
tier/full
tier/lite
tier/stacked
tier:0-platform-substrate
tier:1-iskra-value-layer
tier:2-tools-products-modules
type:bug
type:chore
type:docs
type:feat
type:policy
type:research
No milestone
No project
No assignees
2 participants
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
pdurlej/platform!619
No description provided.