test(platformctl): parse apply workflow contract structurally #672

Merged
pdurlej merged 1 commit from codex/209-workflow-contract-structure into main 2026-06-01 17:08:43 +02:00
Collaborator

Canary status: missing — fire canary 3+3 manually before merge

Canary Context Pack

Product story

Forgejo workflow contract tests guard the apply lane. They should fail because a workflow contract changed, not because a brittle whole-file substring happened to move.

What changed

  • Added structured YAML helpers for loading the apply workflow, locating jobs, locating steps by name, collecting artifact paths, and scanning structured string values.
  • Refactored apply workflow contract tests to assert structured fields for job config, step names/ids, env vars, action versions, and artifact paths.
  • Kept substring assertions only inside shell run: snippets where the contract is inherently shell text.

Why it changed

M06 apply-hardening issue #209 asks to replace fragile text/regex assumptions with structural YAML assertions while preserving contract coverage strength.

Files touched

  • control-plane/platformctl/tests/test_forgejo_ci_scripts_contract.py

Relevant context

  • Issue #209: parse Forgejo workflow contract structurally.
  • control-plane/forgejo-actions/apply.yaml remains unchanged and is treated as the contract under test.

Runtime evidence

No runtime action, no workflow mutation, no SSH, no DB, no service restart. This PR changes tests only.

Known constraints

Shell bodies still require substring assertions for snippets such as trap handling, output writes, and jq field checks. Those assertions now target the specific step's run: field instead of the entire workflow file.

Explicit out-of-scope

  • No Forgejo workflow behavior changes.
  • No apply runtime changes.
  • No break-glass or actor binding changes.
  • No changes to unrelated Infisical inventory files currently dirty in the worktree.

Requested decision

Review and merge if checks stay green.

Merge blockers

  • Loss of apply workflow contract coverage.
  • Any structural helper masking a missing step/action/env/artifact path.

Spec sources read

  • Forgejo issue #209 — acceptance criteria for structural workflow contract parsing.
  • control-plane/platformctl/tests/test_forgejo_ci_scripts_contract.py — existing contract tests and refactor scope.
  • control-plane/forgejo-actions/apply.yaml — workflow contract under test.

Validation

  • UV_CACHE_DIR=/private/tmp/codex-uv-cache uv run pytest platformctl/tests/test_forgejo_ci_scripts_contract.py — 42 passed
  • UV_CACHE_DIR=/private/tmp/codex-uv-cache uv run pytest platformctl/tests/test_apply_phase3.py platformctl/tests/test_apply_env_file.py platformctl/tests/test_pr_sanity.py platformctl/tests/test_forgejo_ci_scripts_contract.py platformctl/tests/test_health_phase3.py — 208 passed
  • PYTHONPATH=control-plane UV_CACHE_DIR=/private/tmp/codex-uv-cache uv run --project control-plane python -m platformctl.cli validate all --json — passed, exitCode=0

Closes #209

Canary status: missing — fire canary 3+3 manually before merge ## Canary Context Pack ### Product story Forgejo workflow contract tests guard the apply lane. They should fail because a workflow contract changed, not because a brittle whole-file substring happened to move. ### What changed - Added structured YAML helpers for loading the apply workflow, locating jobs, locating steps by name, collecting artifact paths, and scanning structured string values. - Refactored apply workflow contract tests to assert structured fields for job config, step names/ids, env vars, action versions, and artifact paths. - Kept substring assertions only inside shell `run:` snippets where the contract is inherently shell text. ### Why it changed M06 apply-hardening issue #209 asks to replace fragile text/regex assumptions with structural YAML assertions while preserving contract coverage strength. ### Files touched - `control-plane/platformctl/tests/test_forgejo_ci_scripts_contract.py` ### Relevant context - Issue #209: parse Forgejo workflow contract structurally. - `control-plane/forgejo-actions/apply.yaml` remains unchanged and is treated as the contract under test. ### Runtime evidence No runtime action, no workflow mutation, no SSH, no DB, no service restart. This PR changes tests only. ### Known constraints Shell bodies still require substring assertions for snippets such as trap handling, output writes, and jq field checks. Those assertions now target the specific step's `run:` field instead of the entire workflow file. ### Explicit out-of-scope - No Forgejo workflow behavior changes. - No apply runtime changes. - No break-glass or actor binding changes. - No changes to unrelated Infisical inventory files currently dirty in the worktree. ### Requested decision Review and merge if checks stay green. ### Merge blockers - Loss of apply workflow contract coverage. - Any structural helper masking a missing step/action/env/artifact path. ## Spec sources read - Forgejo issue #209 — acceptance criteria for structural workflow contract parsing. - `control-plane/platformctl/tests/test_forgejo_ci_scripts_contract.py` — existing contract tests and refactor scope. - `control-plane/forgejo-actions/apply.yaml` — workflow contract under test. ## Validation - `UV_CACHE_DIR=/private/tmp/codex-uv-cache uv run pytest platformctl/tests/test_forgejo_ci_scripts_contract.py` — 42 passed - `UV_CACHE_DIR=/private/tmp/codex-uv-cache uv run pytest platformctl/tests/test_apply_phase3.py platformctl/tests/test_apply_env_file.py platformctl/tests/test_pr_sanity.py platformctl/tests/test_forgejo_ci_scripts_contract.py platformctl/tests/test_health_phase3.py` — 208 passed - `PYTHONPATH=control-plane UV_CACHE_DIR=/private/tmp/codex-uv-cache uv run --project control-plane python -m platformctl.cli validate all --json` — passed, `exitCode=0` Closes #209
test(platformctl): parse apply workflow contract structurally
All checks were successful
canary-required / collect-diff (pull_request) Successful in 4s
python-ci / Python 3.11 (pull_request) Successful in 36s
python-ci / Python 3.12 (pull_request) Successful in 36s
canary-required / canary (pull_request) Successful in 11s
platformctl plan / auto-apply scope (pull_request) Successful in 17s
pyfallow / Pyfallow gate (control-plane) (pull_request) Successful in 16s
python-ci / Python 3.13 (pull_request) Successful in 36s
base-is-main / guard (pull_request) Successful in 1s
patchwarden-client-dry-run / collect-diff (pull_request) Successful in 3s
patchwarden-client-dry-run / dry-run (pull_request) Successful in 17s
patchwarden-pr-sanity / collect-diff (pull_request) Successful in 3s
patchwarden-pr-sanity / sanity (pull_request) Successful in 3m37s
ea85acc32d
Author
Collaborator

Patchwarden PR sanity

  • Status: advisory_findings
  • PR: 672
  • Commit: ea85acc32d257d04335e2a871166997b6d8964c8
  • Security-sensitive label: present
  • Authority: advisory model review plus deterministic blockers only
  • 3+3 canary: still alive; this does not replace it

Deterministic findings

No deterministic findings.

Model reviewers

global-glm / glm-5.1:cloud

  • Status: ok
  • Verdict: OK
  • Findings: none

global-deepseek / deepseek-v4-pro:cloud

  • Status: ok
  • Verdict: OK
  • Findings: none

redteam / kimi-k2.6:cloud

  • Status: ok

  • Verdict: NOT_OK

  • high steps_by_name silently shadows duplicate step names, weakening contract assertions

    • Evidence: control-plane/platformctl/tests/test_forgejo_ci_scripts_contract.py: steps_by_name() builds a dict comprehension keyed by step name, so duplicate names overwrite earlier entries. Previously the tests used next(step for step in steps if step
    • Next: Change steps_by_name to return a list of tuples or assert name uniqueness, and provide first_step_named() / all_steps_named() helpers so callers explicitly choose match semantics.

Policy notes

  • GLM 5.1 + DeepSeek V4 Pro are the operator-required model mix for this bot.
  • Optional red-team model is enabled only when PLATFORMCTL_PR_SANITY_REDTEAM_MODEL is configured.
  • Auto-merge is not enabled here.
<!-- patchwarden-pr-sanity:pdurlej/platform:PR-672 --> # Patchwarden PR sanity - Status: `advisory_findings` - PR: `672` - Commit: `ea85acc32d257d04335e2a871166997b6d8964c8` - Security-sensitive label: `present` - Authority: advisory model review plus deterministic blockers only - 3+3 canary: still alive; this does not replace it ## Deterministic findings No deterministic findings. ## Model reviewers ### `global-glm` / `glm-5.1:cloud` - Status: `ok` - Verdict: `OK` - Findings: none ### `global-deepseek` / `deepseek-v4-pro:cloud` - Status: `ok` - Verdict: `OK` - Findings: none ### `redteam` / `kimi-k2.6:cloud` - Status: `ok` - Verdict: `NOT_OK` - **`high`** steps_by_name silently shadows duplicate step names, weakening contract assertions - Evidence: `control-plane/platformctl/tests/test_forgejo_ci_scripts_contract.py: steps_by_name() builds a dict comprehension keyed by step name, so duplicate names overwrite earlier entries. Previously the tests used next(step for step in steps if step` - Next: Change steps_by_name to return a list of tuples or assert name uniqueness, and provide first_step_named() / all_steps_named() helpers so callers explicitly choose match semantics. ## Policy notes - GLM 5.1 + DeepSeek V4 Pro are the operator-required model mix for this bot. - Optional red-team model is enabled only when `PLATFORMCTL_PR_SANITY_REDTEAM_MODEL` is configured. - Auto-merge is not enabled here.
pdurlej deleted branch codex/209-workflow-contract-structure 2026-06-01 17:08:43 +02:00
Sign in to join this conversation.
No reviewers
No labels
W6d-automerge-calibration
agent/claude-code
agent/codex
agent/hermes
agent/iskra
agent/ollama
agent/patchwarden
automerge-candidate
class/security-sensitive
cutover-gate
dependency/blocked
dependency/blocks-others
dependency/cross-repo
dependency/needs-confirmation
domain:agents
domain:ci
domain:docs
domain:forgejo
domain:infra
domain:memory
domain:runtime
domain:signal
domain:ux
flow/architecture
flow/blocked
flow/deployed
flow/done
flow/implementation
flow/intake
flow/maintained
flow/observed
flow/ready
flow/refining
flow/retired
flow/review
iterating
judge/codex-candidate
judge/hermes-candidate
judge/low-confidence
judge/needs-refinement
judge/operator-needed
judge/p0
judge/p1
judge/p2
judge/p3
judge/park
judge/patchwarden-candidate
judge/stale-priority
kind/adr
kind/bug
kind/chore
kind/feature
kind/infra
kind/ops
kind/refactor
kind/research
large-impact
merge/auto
merge/manual
merge/manual-dependency-conflict
merge/manual-failing-tests
merge/manual-merge-conflict
merge/manual-missing-review
merge/manual-operator-preference
merge/manual-red-zone
merge/manual-security-sensitive
merge/manual-unclear-scope
merge/manual-unknown
meta
mode:operator-only
mode:patchwarden-iskra-approved
mode:safe-auto
needs-operator-decision
needs-triage
not-ready
observed/erroring
observed/needs-followup
observed/pending
observed/retire-candidate
observed/unused
observed/used
operator-emotional
owner-attention
phase/02
phase/03
priority:p0
priority:p1
priority:p2
priority:p3
proposed
ready-for-agent
ready-for-operator
recovery
review:claude-reviewed
review:codex-reviewed
review:dziadek-reviewed
review:needs-human
risk/exposure
risk/process
risk/product
risk/runtime
safety:external-write
safety:no-prod-mutation
safety:prod-impact
safety:secret-touch
size/large
size/medium
size/small
size/tiny
size/unknown
source/adr
source/agent-generated
source/manual
source/operator-chat
source/voice-note
status:blocked
status:codex-ready
status:merged:pending-evidence
status:needs-evidence
status:operator-needed
status:parked
tier/full
tier/lite
tier/stacked
tier:0-platform-substrate
tier:1-iskra-value-layer
tier:2-tools-products-modules
type:bug
type:chore
type:docs
type:feat
type:policy
type:research
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
pdurlej/platform!672
No description provided.