docs(vault): add safe-session CA dual-trust bootstrap #618
No reviewers
Labels
No labels
W6d-automerge-calibration
agent/claude-code
agent/codex
agent/hermes
agent/iskra
agent/ollama
agent/patchwarden
automerge-candidate
class/security-sensitive
cutover-gate
dependency/blocked
dependency/blocks-others
dependency/cross-repo
dependency/needs-confirmation
domain:agents
domain:ci
domain:docs
domain:forgejo
domain:infra
domain:memory
domain:runtime
domain:signal
domain:ux
flow/architecture
flow/blocked
flow/deployed
flow/done
flow/implementation
flow/intake
flow/maintained
flow/observed
flow/ready
flow/refining
flow/retired
flow/review
iterating
judge/codex-candidate
judge/hermes-candidate
judge/low-confidence
judge/needs-refinement
judge/operator-needed
judge/p0
judge/p1
judge/p2
judge/p3
judge/park
judge/patchwarden-candidate
judge/stale-priority
kind/adr
kind/bug
kind/chore
kind/feature
kind/infra
kind/ops
kind/refactor
kind/research
large-impact
merge/auto
merge/manual
merge/manual-dependency-conflict
merge/manual-failing-tests
merge/manual-merge-conflict
merge/manual-missing-review
merge/manual-operator-preference
merge/manual-red-zone
merge/manual-security-sensitive
merge/manual-unclear-scope
merge/manual-unknown
meta
mode:operator-only
mode:patchwarden-iskra-approved
mode:safe-auto
needs-operator-decision
needs-triage
not-ready
observed/erroring
observed/needs-followup
observed/pending
observed/retire-candidate
observed/unused
observed/used
operator-emotional
owner-attention
phase/02
phase/03
priority:p0
priority:p1
priority:p2
priority:p3
proposed
ready-for-agent
ready-for-operator
recovery
review:claude-reviewed
review:codex-reviewed
review:dziadek-reviewed
review:needs-human
risk/exposure
risk/process
risk/product
risk/runtime
safety:external-write
safety:no-prod-mutation
safety:prod-impact
safety:secret-touch
size/large
size/medium
size/small
size/tiny
size/unknown
source/adr
source/agent-generated
source/manual
source/operator-chat
source/voice-note
status:blocked
status:codex-ready
status:merged:pending-evidence
status:needs-evidence
status:operator-needed
status:parked
tier/full
tier/lite
tier/stacked
tier:0-platform-substrate
tier:1-iskra-value-layer
tier:2-tools-products-modules
type:bug
type:chore
type:docs
type:feat
type:policy
type:research
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
pdurlej/platform!618
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "codex/m04-safe-session-ca-dual-trust-bootstrap"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Canary status: local preflight green; Forgejo checks pending
Summary
This PR adds the missing safe-session CA bootstrap/trust handoff gate before any live Vault sunset cutover.
DeepSeek V4 Pro redteam returned HOLD on doing live CA handoff immediately because replacing
/etc/ssh/trusted-user-ca-keys.pemwould invalidate outstanding Vault-signed certificates. This PR implements the recommended safe path: a temporary dual-trust window.What changed
scripts/cutover/safe-session-ca-dual-trust-bootstrap.sh.runbooks/safe-session-local-ca-cutover.mdwith the CA bootstrap and dual-trust handoff procedure.runbooks/vault-quarantine-and-sunset.mdpreconditions so Vault quarantine requires dual-trust handoff and local signer cutover evidence first.scripts/cutover/README.md.Safety model
Default mode is read-only:
The write path requires an explicit separate gate:
The script appends the new public CA to
TrustedUserCAKeys; it does not replace the old Vault CA in one step. Vault-signed certificates remain valid during the dual-trust window.Validation
bash -n scripts/cutover/safe-session-ca-dual-trust-bootstrap.shbash -n scripts/cutover/safe-session-local-ca-preflight.shbash -n scripts/cutover/vault-sunset-readiness.shUV_CACHE_DIR=/private/tmp/codex-uv-cache PYTHONPATH=control-plane uv run --project control-plane pytest tests/test_safe_session_ca_bootstrap.py -q— 5 passed.UV_CACHE_DIR=/private/tmp/codex-uv-cache PYTHONPATH=control-plane uv run --project control-plane python -m platformctl.cli validate all --json— exitCode 0.DeepSeek V4 Pro redteam
Verdict before this PR: HOLD.
Blockers were:
This PR addresses the first two as repo/runbook/script gates. It does not execute bootstrap or runtime cutover.
Non-goals
Patchwarden PR sanity
advisory_findings6186284ee06388e67924a3e8a1173e87ef02e1527f7presentDeterministic findings
No deterministic findings.
Model reviewers
global-glm/glm-5.1:cloudokOKglobal-deepseek/deepseek-v4-pro:cloudStatus:
okVerdict:
NOT_OKhighDual-trust append deduplication check is broken due to newline in patternIn scripts/cutover/safe-session-ca-dual-trust-bootstrap.sh, the lineif grep -Fqx "$(cat "$ca_pub_file")" "$trusted_ca_file"; thenusescatwhich includes a trailing newline, causinggrep -xto never match a line without a newline. Thpub_key_content=$(cat "$ca_pub_file"); pub_key_content=${pub_key_content%$'\n'}and then use that variable.mediumNo rollback of TrustedUserCAKeys if sshd -t fails after appendThe script appends to the trusted CA file, then runssshd -t. Ifsshd -tfails, the script exits due toset -e, but the modified file remains, potentially breaking SSH. The backup is created but not automatically restored.sshd -tfails, or at least print a warning to manually restore.lowInfisical write failure may be silentTheinfisical secrets setcommand uses--silentand>/dev/null, so if it fails, the error message is suppressed. The script relies onset -eto catch non-zero exit, but the operator won't see the error details.redteam/kimi-k2.6:cloudStatus:
okVerdict:
NOT_OKhighSSH trust mutated before validation; no rollback on sshd -t failurescripts/cutover/safe-session-ca-dual-trust-bootstrap.sh:cat "$ca_pub_file" >> "$trusted_ca_file"precedessshd -t; withset -e, a validation failure aborts without restoring the$backupcopy, leaving a potentially broken CA file onsshd -t(or via a temporary sshd_config pointing at the staging file), and only atomically replace the live file after validation passes; add an ERR trap to restore$backupon any failure after mutation.Policy notes
PLATFORMCTL_PR_SANITY_REDTEAM_MODELis configured.