test(platformctl): cover plan state matrix #643

Merged
pdurlej merged 1 commit from codex/m06-plan-state-matrix into main 2026-05-30 16:05:02 +02:00
Collaborator

Canary status: missing - fire canary 3+3 manually before merge

Canary Context Pack

Product story

Lock apply-plan state handling into a compact regression matrix so future changes do not silently loosen malformed-plan behavior.

What changed

  • Added a parameterized exitCode x changes matrix for 0, 1, 2, None, and bad crossed with empty, one-change, and missing changes states.
  • Verified accepted states remain noop or dry-run, while malformed states remain fail-closed.

Why it changed

Issue #197 asked for deterministic coverage after the explicit missing/null exitCode gate landed in #196.

Files touched

  • control-plane/platformctl/tests/test_apply_phase3.py

Relevant context

  • #197 PLAN-STATE-CROSS-TESTS-01
  • #196 explicit missing/null exitCode handling
  • M06 apply-pipeline hardening sequence

Runtime evidence

No runtime mutation. Test-only change.

Known constraints

The matrix documents existing behavior for missing changes as no-change input when exitCode=0; this PR does not introduce new plan schema enforcement.

Explicit out-of-scope

  • No production code changes.
  • No runtime apply.
  • No new telemetry or plan schema changes.

Requested decision

Approve and merge if checks stay green.

Merge blockers

  • Tests fail.
  • Matrix exposes nondeterministic plan state handling.
  • Patchwarden flags a safety regression.

Spec sources read

  • control-plane/platformctl/apply.py - apply-plan state behavior being covered.
  • control-plane/platformctl/tests/test_apply_phase3.py - existing apply-plan tests and helpers.
  • Forgejo issue #197 - acceptance criteria.

Validation

  • PYTHONPATH=control-plane control-plane/.venv/bin/python -m pytest control-plane/platformctl/tests/test_apply_phase3.py control-plane/platformctl/tests/test_apply_env_file.py - 99 passed
  • PYTHONPATH=control-plane control-plane/.venv/bin/python -m platformctl.cli validate all --json - pass

Closes #197

Canary status: missing - fire canary 3+3 manually before merge ## Canary Context Pack ### Product story Lock apply-plan state handling into a compact regression matrix so future changes do not silently loosen malformed-plan behavior. ### What changed - Added a parameterized `exitCode x changes` matrix for `0`, `1`, `2`, `None`, and `bad` crossed with empty, one-change, and missing `changes` states. - Verified accepted states remain `noop` or `dry-run`, while malformed states remain fail-closed. ### Why it changed Issue #197 asked for deterministic coverage after the explicit missing/null `exitCode` gate landed in #196. ### Files touched - `control-plane/platformctl/tests/test_apply_phase3.py` ### Relevant context - #197 PLAN-STATE-CROSS-TESTS-01 - #196 explicit missing/null `exitCode` handling - M06 apply-pipeline hardening sequence ### Runtime evidence No runtime mutation. Test-only change. ### Known constraints The matrix documents existing behavior for missing `changes` as no-change input when `exitCode=0`; this PR does not introduce new plan schema enforcement. ### Explicit out-of-scope - No production code changes. - No runtime apply. - No new telemetry or plan schema changes. ### Requested decision Approve and merge if checks stay green. ### Merge blockers - Tests fail. - Matrix exposes nondeterministic plan state handling. - Patchwarden flags a safety regression. ## Spec sources read - `control-plane/platformctl/apply.py` - apply-plan state behavior being covered. - `control-plane/platformctl/tests/test_apply_phase3.py` - existing apply-plan tests and helpers. - Forgejo issue #197 - acceptance criteria. ## Validation - `PYTHONPATH=control-plane control-plane/.venv/bin/python -m pytest control-plane/platformctl/tests/test_apply_phase3.py control-plane/platformctl/tests/test_apply_env_file.py` - 99 passed - `PYTHONPATH=control-plane control-plane/.venv/bin/python -m platformctl.cli validate all --json` - pass Closes #197
test(platformctl): cover plan state matrix
All checks were successful
canary-required / collect-diff (pull_request) Successful in 6s
patchwarden-client-dry-run / collect-diff (pull_request) Successful in 5s
python-ci / Python 3.12 (pull_request) Successful in 45s
python-ci / Python 3.13 (pull_request) Successful in 46s
base-is-main / guard (pull_request) Successful in 1s
platformctl plan / auto-apply scope (pull_request) Successful in 24s
pyfallow / Pyfallow gate (control-plane) (pull_request) Successful in 23s
python-ci / Python 3.11 (pull_request) Successful in 46s
canary-required / canary (pull_request) Successful in 15s
patchwarden-client-dry-run / dry-run (pull_request) Successful in 26s
patchwarden-pr-sanity / sanity (pull_request) Successful in 1m21s
patchwarden-pr-sanity / collect-diff (pull_request) Successful in 5s
4b8ea5b17d
Author
Collaborator

Patchwarden PR sanity

  • Status: eligible_sanity_clean
  • PR: 643
  • Commit: 4b8ea5b17d92ac146c0c3af4a5ebfa3e7210646f
  • Security-sensitive label: present
  • Authority: advisory model review plus deterministic blockers only
  • 3+3 canary: still alive; this does not replace it

Deterministic findings

No deterministic findings.

Model reviewers

global-glm / glm-5.1:cloud

  • Status: ok

  • Verdict: OK

  • medium Missing changes key with exitCode=0 treated as noop documents potential silent-accept gap

    • Evidence: control-plane/platformctl/tests/test_apply_phase3.py line 770: (0, "missing", "noop", EXIT_NO_CHANGES, None) - when exitCode=0 and changes key is absent, plan is accepted as noop with no error
    • Next: Consider a follow-up PR to enforce that exitCode=0 plans must have an explicit empty changes array (not missing key), or add schema validation that rejects plans with missing required fields. This test correctly documents current behavior but the behavior itself may warrant hardening.

global-deepseek / deepseek-v4-pro:cloud

  • Status: ok
  • Verdict: OK
  • Findings: none

redteam / kimi-k2.6:cloud

  • Status: ok
  • Verdict: OK
  • Findings: none

Policy notes

  • GLM 5.1 + DeepSeek V4 Pro are the operator-required model mix for this bot.
  • Optional red-team model is enabled only when PLATFORMCTL_PR_SANITY_REDTEAM_MODEL is configured.
  • Auto-merge is not enabled here.
<!-- patchwarden-pr-sanity:pdurlej/platform:PR-643 --> # Patchwarden PR sanity - Status: `eligible_sanity_clean` - PR: `643` - Commit: `4b8ea5b17d92ac146c0c3af4a5ebfa3e7210646f` - Security-sensitive label: `present` - Authority: advisory model review plus deterministic blockers only - 3+3 canary: still alive; this does not replace it ## Deterministic findings No deterministic findings. ## Model reviewers ### `global-glm` / `glm-5.1:cloud` - Status: `ok` - Verdict: `OK` - **`medium`** Missing changes key with exitCode=0 treated as noop documents potential silent-accept gap - Evidence: `control-plane/platformctl/tests/test_apply_phase3.py line 770: (0, "missing", "noop", EXIT_NO_CHANGES, None) - when exitCode=0 and changes key is absent, plan is accepted as noop with no error` - Next: Consider a follow-up PR to enforce that exitCode=0 plans must have an explicit empty changes array (not missing key), or add schema validation that rejects plans with missing required fields. This test correctly documents current behavior but the behavior itself may warrant hardening. ### `global-deepseek` / `deepseek-v4-pro:cloud` - Status: `ok` - Verdict: `OK` - Findings: none ### `redteam` / `kimi-k2.6:cloud` - Status: `ok` - Verdict: `OK` - Findings: none ## Policy notes - GLM 5.1 + DeepSeek V4 Pro are the operator-required model mix for this bot. - Optional red-team model is enabled only when `PLATFORMCTL_PR_SANITY_REDTEAM_MODEL` is configured. - Auto-merge is not enabled here.
pdurlej approved these changes 2026-05-30 16:05:01 +02:00
pdurlej left a comment

Approved by Codex using operator-authorized temporary admin PAT after all checks green.

Approved by Codex using operator-authorized temporary admin PAT after all checks green.
pdurlej deleted branch codex/m06-plan-state-matrix 2026-05-30 16:05:02 +02:00
Sign in to join this conversation.
No reviewers
No labels
W6d-automerge-calibration
agent/claude-code
agent/codex
agent/hermes
agent/iskra
agent/ollama
agent/patchwarden
automerge-candidate
class/security-sensitive
cutover-gate
dependency/blocked
dependency/blocks-others
dependency/cross-repo
dependency/needs-confirmation
domain:agents
domain:ci
domain:docs
domain:forgejo
domain:infra
domain:memory
domain:runtime
domain:signal
domain:ux
flow/architecture
flow/blocked
flow/deployed
flow/done
flow/implementation
flow/intake
flow/maintained
flow/observed
flow/ready
flow/refining
flow/retired
flow/review
iterating
judge/codex-candidate
judge/hermes-candidate
judge/low-confidence
judge/needs-refinement
judge/operator-needed
judge/p0
judge/p1
judge/p2
judge/p3
judge/park
judge/patchwarden-candidate
judge/stale-priority
kind/adr
kind/bug
kind/chore
kind/feature
kind/infra
kind/ops
kind/refactor
kind/research
large-impact
merge/auto
merge/manual
merge/manual-dependency-conflict
merge/manual-failing-tests
merge/manual-merge-conflict
merge/manual-missing-review
merge/manual-operator-preference
merge/manual-red-zone
merge/manual-security-sensitive
merge/manual-unclear-scope
merge/manual-unknown
meta
mode:operator-only
mode:patchwarden-iskra-approved
mode:safe-auto
needs-operator-decision
needs-triage
not-ready
observed/erroring
observed/needs-followup
observed/pending
observed/retire-candidate
observed/unused
observed/used
operator-emotional
owner-attention
phase/02
phase/03
priority:p0
priority:p1
priority:p2
priority:p3
proposed
ready-for-agent
ready-for-operator
recovery
review:claude-reviewed
review:codex-reviewed
review:dziadek-reviewed
review:needs-human
risk/exposure
risk/process
risk/product
risk/runtime
safety:external-write
safety:no-prod-mutation
safety:prod-impact
safety:secret-touch
size/large
size/medium
size/small
size/tiny
size/unknown
source/adr
source/agent-generated
source/manual
source/operator-chat
source/voice-note
status:blocked
status:codex-ready
status:merged:pending-evidence
status:needs-evidence
status:operator-needed
status:parked
tier/full
tier/lite
tier/stacked
tier:0-platform-substrate
tier:1-iskra-value-layer
tier:2-tools-products-modules
type:bug
type:chore
type:docs
type:feat
type:policy
type:research
No milestone
No project
No assignees
2 participants
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
pdurlej/platform!643
No description provided.