feat(vault): add safe-session local CA signer #613

Merged
pdurlej merged 1 commit from codex/m04-safe-session-local-signer into main 2026-05-29 22:02:18 +02:00
Collaborator

Canary status: pending — Forgejo Actions will run on PR creation

Summary

Adds the first code path needed to retire HashiCorp Vault from safe-session-api: a local OpenSSH CA signer behind SAFE_SESSION_SIGNER_MODE=local-openssh-ca. The default remains SAFE_SESSION_SIGNER_MODE=vault, so this PR does not cut over runtime signing.

Canary Context Pack

Product story

Vault inventory showed Vault is not a canonical platform secret backend anymore; Infisical is. The only active Vault use is safe-session SSH certificate signing. This PR makes that dependency removable without changing runtime behavior yet.

What changed

  • Imported the live safe-session-api source that compose already points to under scripts/safe-session-api/.
  • Added local_signer.py, a stdlib-only OpenSSH CA signer wrapper around ssh-keygen.
  • Added signer mode configuration in compose with vault as the default.
  • Added audit metadata in session artifacts and stdout JSON for signing success/failure.
  • Added tests for local signing, no default SSH cert extensions, principal rejection, CA key mode rejection, and TTL policy.

Why it changed

This is the implementation bridge before the runtime cutover PR. It lets the next PR mount Infisical-delivered CA material and switch only safe-session-api, while preserving a one-line rollback to Vault mode.

Files touched

  • compose/core/compose.yaml
  • modules/safe-session-api/module.yaml
  • modules/safe-session-api/runbook.md
  • scripts/safe-session-api/Dockerfile
  • scripts/safe-session-api/local_signer.py
  • scripts/safe-session-api/server.py
  • scripts/safe-session-api/tests/test_local_signer.py

Relevant context

  • decisions/0024-infisical-primary-secrets-pipeline.md
  • migrations/vault-to-infisical.md
  • docs/specs/vault-to-infisical-migration-v0/02-plan-and-tasks.md
  • Observed Vault SSH role: allowed_users=llmops, no extensions, no critical options, ttl 1800, max_ttl 3600, not_before 30s.

Runtime evidence

No runtime mutation in this PR. Local validation:

  • python3 -m pytest scripts/safe-session-api/tests/test_local_signer.py: 5 passed
  • python3 -m py_compile scripts/safe-session-api/local_signer.py scripts/safe-session-api/server.py: pass
  • platformctl validate all --json: pass (exitCode=0, 88 modules)

Known constraints

  • Default runtime behavior remains Vault signer.
  • Local signer default preserves the observed Vault role default: llmops only.
  • cloud-note currently maps to llmnote; local-mode cutover must either keep cutover scoped to internal-ops or explicitly change principal policy in a separate reviewed PR.
  • CA key material is not added to repo and is not created by this PR.

Explicit out-of-scope

  • No runtime cutover.
  • No Vault stop/delete.
  • No Infisical writes.
  • No issue/comment mutation.
  • No auto-merge changes.
  • No widening of allowed SSH principals.

Requested decision

Approve the code path as safe to merge before a separate cutover PR.

Merge blockers

  • Any finding that local mode leaks CA key material.
  • Any finding that local mode emits SSH cert extensions/critical options by default.
  • Any finding that default runtime behavior changes away from Vault.

DeepSeek v4 Pro redteam

DeepSeek v4 Pro returned APPROVE. It specifically reviewed CA key handling, policy parity, TTL/not_before behavior, -O clear extension stripping, audit/rollback, input safety, and accidental runtime mutation. No blockers were reported.

Spec sources read

  • docs/forgejo-agent-operations.md — Forgejo write identity contract.
  • compose/core/compose.yaml — live service declaration.
  • modules/safe-session-api/module.yaml — module contract.
  • modules/safe-session-api/runbook.md — operator runbook.
  • decisions/0024-infisical-primary-secrets-pipeline.md — Infisical/Vault decision.
  • migrations/vault-to-infisical.md — sunset plan.
  • Runtime source at /opt/vps-home-platform-infra/scripts/safe-session-api/ on RS2000 — imported as the canonical code compose already referenced.

Refs #64.
Refs #609.

Canary status: pending — Forgejo Actions will run on PR creation ## Summary Adds the first code path needed to retire HashiCorp Vault from `safe-session-api`: a local OpenSSH CA signer behind `SAFE_SESSION_SIGNER_MODE=local-openssh-ca`. The default remains `SAFE_SESSION_SIGNER_MODE=vault`, so this PR does not cut over runtime signing. ## Canary Context Pack ### Product story Vault inventory showed Vault is not a canonical platform secret backend anymore; Infisical is. The only active Vault use is safe-session SSH certificate signing. This PR makes that dependency removable without changing runtime behavior yet. ### What changed - Imported the live `safe-session-api` source that compose already points to under `scripts/safe-session-api/`. - Added `local_signer.py`, a stdlib-only OpenSSH CA signer wrapper around `ssh-keygen`. - Added signer mode configuration in compose with `vault` as the default. - Added audit metadata in session artifacts and stdout JSON for signing success/failure. - Added tests for local signing, no default SSH cert extensions, principal rejection, CA key mode rejection, and TTL policy. ### Why it changed This is the implementation bridge before the runtime cutover PR. It lets the next PR mount Infisical-delivered CA material and switch only `safe-session-api`, while preserving a one-line rollback to Vault mode. ### Files touched - `compose/core/compose.yaml` - `modules/safe-session-api/module.yaml` - `modules/safe-session-api/runbook.md` - `scripts/safe-session-api/Dockerfile` - `scripts/safe-session-api/local_signer.py` - `scripts/safe-session-api/server.py` - `scripts/safe-session-api/tests/test_local_signer.py` ### Relevant context - `decisions/0024-infisical-primary-secrets-pipeline.md` - `migrations/vault-to-infisical.md` - `docs/specs/vault-to-infisical-migration-v0/02-plan-and-tasks.md` - Observed Vault SSH role: `allowed_users=llmops`, no extensions, no critical options, ttl 1800, max_ttl 3600, not_before 30s. ### Runtime evidence No runtime mutation in this PR. Local validation: - `python3 -m pytest scripts/safe-session-api/tests/test_local_signer.py`: 5 passed - `python3 -m py_compile scripts/safe-session-api/local_signer.py scripts/safe-session-api/server.py`: pass - `platformctl validate all --json`: pass (`exitCode=0`, 88 modules) ### Known constraints - Default runtime behavior remains Vault signer. - Local signer default preserves the observed Vault role default: `llmops` only. - `cloud-note` currently maps to `llmnote`; local-mode cutover must either keep cutover scoped to `internal-ops` or explicitly change principal policy in a separate reviewed PR. - CA key material is not added to repo and is not created by this PR. ### Explicit out-of-scope - No runtime cutover. - No Vault stop/delete. - No Infisical writes. - No issue/comment mutation. - No auto-merge changes. - No widening of allowed SSH principals. ### Requested decision Approve the code path as safe to merge before a separate cutover PR. ### Merge blockers - Any finding that local mode leaks CA key material. - Any finding that local mode emits SSH cert extensions/critical options by default. - Any finding that default runtime behavior changes away from Vault. ## DeepSeek v4 Pro redteam DeepSeek v4 Pro returned **APPROVE**. It specifically reviewed CA key handling, policy parity, TTL/not_before behavior, `-O clear` extension stripping, audit/rollback, input safety, and accidental runtime mutation. No blockers were reported. ## Spec sources read - `docs/forgejo-agent-operations.md` — Forgejo write identity contract. - `compose/core/compose.yaml` — live service declaration. - `modules/safe-session-api/module.yaml` — module contract. - `modules/safe-session-api/runbook.md` — operator runbook. - `decisions/0024-infisical-primary-secrets-pipeline.md` — Infisical/Vault decision. - `migrations/vault-to-infisical.md` — sunset plan. - Runtime source at `/opt/vps-home-platform-infra/scripts/safe-session-api/` on RS2000 — imported as the canonical code compose already referenced. Refs #64. Refs #609.
feat(vault): add safe-session local CA signer
Some checks failed
canary-required / collect-diff (pull_request) Successful in 5s
patchwarden-client-dry-run / collect-diff (pull_request) Successful in 5s
platformctl plan / auto-apply scope (pull_request) Successful in 23s
canary-required / canary (pull_request) Successful in 14s
patchwarden-client-dry-run / dry-run (pull_request) Successful in 22s
base-is-main / guard (pull_request) Successful in 1s
patchwarden-pr-sanity / collect-diff (pull_request) Successful in 4s
patchwarden-pr-sanity / sanity (pull_request) Failing after 24s
3e24fad247
codex force-pushed codex/m04-safe-session-local-signer from 3e24fad247
Some checks failed
canary-required / collect-diff (pull_request) Successful in 5s
patchwarden-client-dry-run / collect-diff (pull_request) Successful in 5s
platformctl plan / auto-apply scope (pull_request) Successful in 23s
canary-required / canary (pull_request) Successful in 14s
patchwarden-client-dry-run / dry-run (pull_request) Successful in 22s
base-is-main / guard (pull_request) Successful in 1s
patchwarden-pr-sanity / collect-diff (pull_request) Successful in 4s
patchwarden-pr-sanity / sanity (pull_request) Failing after 24s
to 793816b55e
Some checks failed
base-is-main / guard (pull_request) Successful in 1s
canary-required / collect-diff (pull_request) Successful in 5s
patchwarden-client-dry-run / collect-diff (pull_request) Successful in 5s
patchwarden-pr-sanity / collect-diff (pull_request) Successful in 5s
platformctl plan / auto-apply scope (pull_request) Successful in 23s
canary-required / canary (pull_request) Successful in 15s
patchwarden-client-dry-run / dry-run (pull_request) Successful in 24s
patchwarden-pr-sanity / sanity (pull_request) Failing after 24s
2026-05-29 21:19:17 +02:00
Compare
codex force-pushed codex/m04-safe-session-local-signer from 793816b55e
Some checks failed
base-is-main / guard (pull_request) Successful in 1s
canary-required / collect-diff (pull_request) Successful in 5s
patchwarden-client-dry-run / collect-diff (pull_request) Successful in 5s
patchwarden-pr-sanity / collect-diff (pull_request) Successful in 5s
platformctl plan / auto-apply scope (pull_request) Successful in 23s
canary-required / canary (pull_request) Successful in 15s
patchwarden-client-dry-run / dry-run (pull_request) Successful in 24s
patchwarden-pr-sanity / sanity (pull_request) Failing after 24s
to bb815a39c4
All checks were successful
base-is-main / guard (pull_request) Successful in 1s
canary-required / collect-diff (pull_request) Successful in 5s
patchwarden-client-dry-run / collect-diff (pull_request) Successful in 4s
patchwarden-pr-sanity / collect-diff (pull_request) Successful in 5s
platformctl plan / auto-apply scope (pull_request) Successful in 22s
canary-required / canary (pull_request) Successful in 13s
patchwarden-client-dry-run / dry-run (pull_request) Successful in 21s
patchwarden-pr-sanity / sanity (pull_request) Successful in 23s
2026-05-29 21:44:52 +02:00
Compare
Author
Collaborator

Patchwarden PR sanity

  • Status: eligible_sanity_clean
  • PR: 613
  • Commit: 84aad4d023b4eb462047f38462dceffa4d60686d
  • Security-sensitive label: present
  • Authority: advisory model review plus deterministic blockers only
  • 3+3 canary: still alive; this does not replace it

Deterministic findings

No deterministic findings.

Model reviewers

global-glm / glm-5.1:cloud

  • Status: skipped
  • Verdict: -
  • Note: Diff exceeds PR sanity model-review budget; deterministic report only.
  • Findings: none

global-deepseek / deepseek-v4-pro:cloud

  • Status: skipped
  • Verdict: -
  • Note: Diff exceeds PR sanity model-review budget; deterministic report only.
  • Findings: none

redteam / kimi-k2.6:cloud

  • Status: skipped
  • Verdict: -
  • Note: Diff exceeds PR sanity model-review budget; deterministic report only.
  • Findings: none

Policy notes

  • GLM 5.1 + DeepSeek V4 Pro are the operator-required model mix for this bot.
  • Optional red-team model is enabled only when PLATFORMCTL_PR_SANITY_REDTEAM_MODEL is configured.
  • Auto-merge is not enabled here.
<!-- patchwarden-pr-sanity:pdurlej/platform:PR-613 --> # Patchwarden PR sanity - Status: `eligible_sanity_clean` - PR: `613` - Commit: `84aad4d023b4eb462047f38462dceffa4d60686d` - Security-sensitive label: `present` - Authority: advisory model review plus deterministic blockers only - 3+3 canary: still alive; this does not replace it ## Deterministic findings No deterministic findings. ## Model reviewers ### `global-glm` / `glm-5.1:cloud` - Status: `skipped` - Verdict: `-` - Note: Diff exceeds PR sanity model-review budget; deterministic report only. - Findings: none ### `global-deepseek` / `deepseek-v4-pro:cloud` - Status: `skipped` - Verdict: `-` - Note: Diff exceeds PR sanity model-review budget; deterministic report only. - Findings: none ### `redteam` / `kimi-k2.6:cloud` - Status: `skipped` - Verdict: `-` - Note: Diff exceeds PR sanity model-review budget; deterministic report only. - Findings: none ## Policy notes - GLM 5.1 + DeepSeek V4 Pro are the operator-required model mix for this bot. - Optional red-team model is enabled only when `PLATFORMCTL_PR_SANITY_REDTEAM_MODEL` is configured. - Auto-merge is not enabled here.
codex force-pushed codex/m04-safe-session-local-signer from bb815a39c4
All checks were successful
base-is-main / guard (pull_request) Successful in 1s
canary-required / collect-diff (pull_request) Successful in 5s
patchwarden-client-dry-run / collect-diff (pull_request) Successful in 4s
patchwarden-pr-sanity / collect-diff (pull_request) Successful in 5s
platformctl plan / auto-apply scope (pull_request) Successful in 22s
canary-required / canary (pull_request) Successful in 13s
patchwarden-client-dry-run / dry-run (pull_request) Successful in 21s
patchwarden-pr-sanity / sanity (pull_request) Successful in 23s
to 84aad4d023
All checks were successful
base-is-main / guard (pull_request) Successful in 2s
canary-required / collect-diff (pull_request) Successful in 5s
patchwarden-client-dry-run / collect-diff (pull_request) Successful in 5s
patchwarden-pr-sanity / collect-diff (pull_request) Successful in 5s
platformctl plan / auto-apply scope (pull_request) Successful in 25s
canary-required / canary (pull_request) Successful in 15s
patchwarden-client-dry-run / dry-run (pull_request) Successful in 24s
patchwarden-pr-sanity / sanity (pull_request) Successful in 24s
2026-05-29 21:55:47 +02:00
Compare
Sign in to join this conversation.
No reviewers
No labels
W6d-automerge-calibration
agent/claude-code
agent/codex
agent/hermes
agent/iskra
agent/ollama
agent/patchwarden
automerge-candidate
class/security-sensitive
cutover-gate
dependency/blocked
dependency/blocks-others
dependency/cross-repo
dependency/needs-confirmation
domain:agents
domain:ci
domain:docs
domain:forgejo
domain:infra
domain:memory
domain:runtime
domain:signal
domain:ux
flow/architecture
flow/blocked
flow/deployed
flow/done
flow/implementation
flow/intake
flow/maintained
flow/observed
flow/ready
flow/refining
flow/retired
flow/review
iterating
judge/codex-candidate
judge/hermes-candidate
judge/low-confidence
judge/needs-refinement
judge/operator-needed
judge/p0
judge/p1
judge/p2
judge/p3
judge/park
judge/patchwarden-candidate
judge/stale-priority
kind/adr
kind/bug
kind/chore
kind/feature
kind/infra
kind/ops
kind/refactor
kind/research
large-impact
merge/auto
merge/manual
merge/manual-dependency-conflict
merge/manual-failing-tests
merge/manual-merge-conflict
merge/manual-missing-review
merge/manual-operator-preference
merge/manual-red-zone
merge/manual-security-sensitive
merge/manual-unclear-scope
merge/manual-unknown
meta
mode:operator-only
mode:patchwarden-iskra-approved
mode:safe-auto
needs-operator-decision
needs-triage
not-ready
observed/erroring
observed/needs-followup
observed/pending
observed/retire-candidate
observed/unused
observed/used
operator-emotional
owner-attention
phase/02
phase/03
priority:p0
priority:p1
priority:p2
priority:p3
proposed
ready-for-agent
ready-for-operator
recovery
review:claude-reviewed
review:codex-reviewed
review:dziadek-reviewed
review:needs-human
risk/exposure
risk/process
risk/product
risk/runtime
safety:external-write
safety:no-prod-mutation
safety:prod-impact
safety:secret-touch
size/large
size/medium
size/small
size/tiny
size/unknown
source/adr
source/agent-generated
source/manual
source/operator-chat
source/voice-note
status:blocked
status:codex-ready
status:merged:pending-evidence
status:needs-evidence
status:operator-needed
status:parked
tier/full
tier/lite
tier/stacked
tier:0-platform-substrate
tier:1-iskra-value-layer
tier:2-tools-products-modules
type:bug
type:chore
type:docs
type:feat
type:policy
type:research
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
pdurlej/platform!613
No description provided.