fix(tests): reconcile platform contract expectations #841

Merged
pdurlej merged 2 commits from codex/810-contract-expectations into main 2026-06-26 14:44:38 +02:00
Collaborator

Canary status: missing — fire canary 3+3 manually before merge

Canary Context Pack

Product story

Contract tests should defend current platform intent, not stale paths or stale ADR numbers. This keeps future agents from mistrusting correct repo state because old expectations fail.

What changed

  • Updated deploy-control test expectations to match current compose/apps/compose.yaml legacy-import runtime paths.
  • Updated Iskra Things contract test to read canonical ADR 0026-wloczykij-gold-vault-and-iskra-promotion-boundary.md after renumbering.

Why it changed

Issue #810 identified two stale test expectations from the Antigravity/Gemini audit.

Files touched

  • tests/test_deploy_control_cutoff_contract.py
  • tests/test_iskra_things_sync_contract.py

Relevant context

  • compose/apps/compose.yaml
  • decisions/0026-wloczykij-gold-vault-and-iskra-promotion-boundary.md

Runtime evidence

None. Test-only reconciliation; no runtime mutation.

Known constraints

This PR does not change compose paths or ADR content. It only updates tests to match the current canonical sources.

Explicit out-of-scope

  • Compose/runtime path changes.
  • ADR renumbering or ADR content edits.
  • Iskra sync implementation behavior.

Requested decision

Approve merge if CI matches local focused verification.

Merge blockers

  • If current compose path is wrong, this PR should not merge; open a runtime/design issue instead.
  • If ADR 0026 is not canonical, this PR should not merge.

Spec sources read

  • compose/apps/compose.yaml
  • tests/test_deploy_control_cutoff_contract.py
  • tests/test_iskra_things_sync_contract.py
  • decisions/0026-wloczykij-gold-vault-and-iskra-promotion-boundary.md

Verification

  • python3 -m pytest tests/test_deploy_control_cutoff_contract.py tests/test_iskra_things_sync_contract.py — 47 passed.

Closes #810

Canary status: missing — fire canary 3+3 manually before merge ## Canary Context Pack ### Product story Contract tests should defend current platform intent, not stale paths or stale ADR numbers. This keeps future agents from mistrusting correct repo state because old expectations fail. ### What changed - Updated deploy-control test expectations to match current `compose/apps/compose.yaml` legacy-import runtime paths. - Updated Iskra Things contract test to read canonical ADR `0026-wloczykij-gold-vault-and-iskra-promotion-boundary.md` after renumbering. ### Why it changed Issue #810 identified two stale test expectations from the Antigravity/Gemini audit. ### Files touched - `tests/test_deploy_control_cutoff_contract.py` - `tests/test_iskra_things_sync_contract.py` ### Relevant context - `compose/apps/compose.yaml` - `decisions/0026-wloczykij-gold-vault-and-iskra-promotion-boundary.md` ### Runtime evidence None. Test-only reconciliation; no runtime mutation. ### Known constraints This PR does not change compose paths or ADR content. It only updates tests to match the current canonical sources. ### Explicit out-of-scope - Compose/runtime path changes. - ADR renumbering or ADR content edits. - Iskra sync implementation behavior. ### Requested decision Approve merge if CI matches local focused verification. ### Merge blockers - If current compose path is wrong, this PR should not merge; open a runtime/design issue instead. - If ADR 0026 is not canonical, this PR should not merge. ## Spec sources read - `compose/apps/compose.yaml` - `tests/test_deploy_control_cutoff_contract.py` - `tests/test_iskra_things_sync_contract.py` - `decisions/0026-wloczykij-gold-vault-and-iskra-promotion-boundary.md` ## Verification - `python3 -m pytest tests/test_deploy_control_cutoff_contract.py tests/test_iskra_things_sync_contract.py` — 47 passed. Closes #810
fix(tests): reconcile platform contract expectations
All checks were successful
python-ci / Python 3.12 (pull_request) Successful in 42s
canary-required / canary (pull_request) Successful in 17s
base-is-main / guard (pull_request) Successful in 1s
canary-required / collect-diff (pull_request) Successful in 4s
patchwarden-client-dry-run / collect-diff (pull_request) Successful in 4s
patchwarden-pr-sanity / collect-diff (pull_request) Successful in 4s
python-ci / Python 3.11 (pull_request) Successful in 40s
python-ci / Python 3.13 (pull_request) Successful in 43s
patchwarden-client-dry-run / dry-run (pull_request) Successful in 17s
patchwarden-pr-sanity / sanity (pull_request) Successful in 1m27s
cdf8ee7cee
First-time contributor

Patchwarden PR sanity

Operator signal: 🛑 STOP - reviewer finding(s) must be addressed or explicitly accepted.

Automerge signal: NOT READY - no unattended merge or APPROVED review should be published.

Verdict: 🛑 STOP - a model reviewer reported actionable findings.

Next step: Address the reviewer finding(s), or leave a human decision explaining why the risk is accepted.

  • PR: 841
  • Commit: 99288e979db551c1b72065ceac749a0339c8fb07
  • Status: advisory_findings
  • Reviewer health: findings
  • Security-sensitive label: missing
  • Authority: Patchwarden policy signal; branch protection and automerge controller remain merge authority.
  • Model mix: glm-5.2:cloud, deepseek-v4-pro:cloud, kimi-k2.7-code:cloud

What I checked

  • Changed files: 2
  • Deterministic blocker scan: clean
  • External Forgejo gates: not_reported
  • Model reviewer lanes: 3
  • Comment contract: this comment is updated in place via a hidden Patchwarden marker.

Approval Handoff

  • State: not_ready_reviewer_findings
  • Action: address reviewer finding(s) or leave a human decision before any unattended approval.
  • Boundary: branch protection and the automerge controller remain merge authority.

Signal Board

  • Legend: evidence is sufficient; 🟡 controller still has work; ⚠️ automation retries first; 🛑/ do not approve or merge.
Lane Signal Meaning
🧪 Deterministic sanity clean No deterministic blockers found.
🧩 External Forgejo gates 🟡 not reported No external gate snapshot was included in this report.
🧠 Model reviewers findings Address reviewer finding(s) before approval.
🛡️ Patchwarden approval not_ready_reviewer_findings No unattended APPROVED review should be published.
🚦 Unattended automerge ineligible Outside the narrow safe-docs/status unattended lane.
🙋 Owner attention 🔁 automation first Retry, repair, or inspect automation before asking the owner.
  • Scope blocker: non-doc/status path(s): tests/test_deploy_control_cutoff_contract.py, tests/test_iskra_things_sync_contract.py
    🧭 Merge authority: branch protection and automerge controller remain authoritative.

Required Fixes

No deterministic blockers.

Reviewer Details

Model reviewer lanes

global-glm / glm-5.2:cloud

  • Status: ok

  • Verdict: OK

  • medium Test expectations now hardcode user-specific runtime path 'pdurlej-platform'

    • Evidence: tests/test_deploy_control_cutoff_contract.py: default for PLATFORM_RUNTIME_INTEGRATIONS_DATA_DIR and PLATFORM_RUNTIME_ENV_DIR changed from '/opt/vps-home-platform-infra/...' to '/opt/pdurlej-platform/runtime/legacy-import/...'. Embedding a
    • Next: Confirm via compose/apps/compose.yaml that the literal default '/opt/pdurlej-platform/runtime/legacy-import/...' is the canonical default in the source of truth, not a per-developer override. If compose.yaml uses a parameterized/default value, mirror that exactly rather than a resolved personal path
  • low ADR renumbering from 0023 to 0026 may orphan the original ADR file

    • Evidence: tests/test_iskra_things_sync_contract.py: ADR_0023 renamed to ADR_0026 pointing at 'decisions/0026-wloczykij-gold-vault-and-iskra-promotion-boundary.md'. Diff does not show deletion or redirect of the 0023 file.
    • Next: Verify decisions/0023-* no longer exists or has been superseded per repo ADR governance; otherwise other references (docs, scripts) may still point at 0023 and silently break.

global-deepseek / deepseek-v4-pro:cloud

  • Status: ok
  • Verdict: OK
  • Findings: none

redteam / kimi-k2.7-code:cloud

  • Status: ok

  • Verdict: NOT_OK

  • high Contract test does not enforce absence of whole legacy root mount

    • Evidence: tests/test_deploy_control_cutoff_contract.py function test_deploy_control_no_longer_mounts_whole_legacy_root only asserts presence of the two updated subdir volume strings (${PLATFORM_RUNTIME_INTEGRATIONS_DATA_DIR:-/opt/pdurlej-platform/run
    • Next: Add a negative assertion that parses each volume entry's source path (before the first ':') and rejects any source equal to the legacy root directory, so only the expected data/integrations and env subdirectories are mounted.

Policy notes

  • Patchwarden PR sanity is the first merge-lane signal for this PR.
  • Models produce findings; Patchwarden/policy produces decisions.
  • Model findings alone do not fail the status check; they require human or agent disposition.
  • Formal approval is separate from this comment and requires clean reviewer health.
  • Automerge remains delegated to branch protection and the automerge pilot.
<!-- patchwarden-pr-sanity:pdurlej/platform:PR-841 --> <!-- patchwarden.pr_sanity.v1 status=advisory_findings model_health=findings external_gates=not_reported approval_handoff=not_ready_reviewer_findings pr=841 sha=99288e979db551c1b72065ceac749a0339c8fb07 --> # Patchwarden PR sanity **Operator signal:** 🛑 STOP - reviewer finding(s) must be addressed or explicitly accepted. **Automerge signal:** ❌ NOT READY - no unattended merge or APPROVED review should be published. **Verdict:** 🛑 STOP - a model reviewer reported actionable findings. **Next step:** Address the reviewer finding(s), or leave a human decision explaining why the risk is accepted. - PR: `841` - Commit: `99288e979db551c1b72065ceac749a0339c8fb07` - Status: `advisory_findings` - Reviewer health: `findings` - Security-sensitive label: `missing` - Authority: Patchwarden policy signal; branch protection and automerge controller remain merge authority. - Model mix: `glm-5.2:cloud`, `deepseek-v4-pro:cloud`, `kimi-k2.7-code:cloud` ## What I checked - Changed files: `2` - Deterministic blocker scan: `clean` - External Forgejo gates: `not_reported` - Model reviewer lanes: `3` - Comment contract: this comment is updated in place via a hidden Patchwarden marker. ## Approval Handoff - State: `not_ready_reviewer_findings` - Action: address reviewer finding(s) or leave a human decision before any unattended approval. - Boundary: branch protection and the automerge controller remain merge authority. ## Signal Board - Legend: ✅ evidence is sufficient; 🟡 controller still has work; ⚠️ automation retries first; 🛑/❌ do not approve or merge. | Lane | Signal | Meaning | | --- | --- | --- | | 🧪 Deterministic sanity | ✅ `clean` | No deterministic blockers found. | | 🧩 External Forgejo gates | 🟡 `not reported` | No external gate snapshot was included in this report. | | 🧠 Model reviewers | ❌ `findings` | Address reviewer finding(s) before approval. | | 🛡️ Patchwarden approval | ❌ `not_ready_reviewer_findings` | No unattended APPROVED review should be published. | | 🚦 Unattended automerge | ❌ `ineligible` | Outside the narrow safe-docs/status unattended lane. | | 🙋 Owner attention | 🔁 `automation first` | Retry, repair, or inspect automation before asking the owner. | - Scope blocker: non-doc/status path(s): `tests/test_deploy_control_cutoff_contract.py`, `tests/test_iskra_things_sync_contract.py` 🧭 Merge authority: branch protection and automerge controller remain authoritative. ## Required Fixes No deterministic blockers. ## Reviewer Details <details> <summary>Model reviewer lanes</summary> ### `global-glm` / `glm-5.2:cloud` - Status: `ok` - Verdict: `OK` - **`medium`** Test expectations now hardcode user-specific runtime path 'pdurlej-platform' - Evidence: `tests/test_deploy_control_cutoff_contract.py: default for PLATFORM_RUNTIME_INTEGRATIONS_DATA_DIR and PLATFORM_RUNTIME_ENV_DIR changed from '/opt/vps-home-platform-infra/...' to '/opt/pdurlej-platform/runtime/legacy-import/...'. Embedding a ` - Next: Confirm via compose/apps/compose.yaml that the literal default '/opt/pdurlej-platform/runtime/legacy-import/...' is the canonical default in the source of truth, not a per-developer override. If compose.yaml uses a parameterized/default value, mirror that exactly rather than a resolved personal path - **`low`** ADR renumbering from 0023 to 0026 may orphan the original ADR file - Evidence: `tests/test_iskra_things_sync_contract.py: ADR_0023 renamed to ADR_0026 pointing at 'decisions/0026-wloczykij-gold-vault-and-iskra-promotion-boundary.md'. Diff does not show deletion or redirect of the 0023 file.` - Next: Verify decisions/0023-* no longer exists or has been superseded per repo ADR governance; otherwise other references (docs, scripts) may still point at 0023 and silently break. ### `global-deepseek` / `deepseek-v4-pro:cloud` - Status: `ok` - Verdict: `OK` - Findings: none ### `redteam` / `kimi-k2.7-code:cloud` - Status: `ok` - Verdict: `NOT_OK` - **`high`** Contract test does not enforce absence of whole legacy root mount - Evidence: `tests/test_deploy_control_cutoff_contract.py function test_deploy_control_no_longer_mounts_whole_legacy_root only asserts presence of the two updated subdir volume strings (${PLATFORM_RUNTIME_INTEGRATIONS_DATA_DIR:-/opt/pdurlej-platform/run` - Next: Add a negative assertion that parses each volume entry's source path (before the first ':') and rejects any source equal to the legacy root directory, so only the expected data/integrations and env subdirectories are mounted. </details> ## Policy notes - Patchwarden PR sanity is the first merge-lane signal for this PR. - Models produce findings; Patchwarden/policy produces decisions. - Model findings alone do not fail the status check; they require human or agent disposition. - Formal approval is separate from this comment and requires clean reviewer health. - Automerge remains delegated to branch protection and the automerge pilot.
pdurlej approved these changes 2026-06-26 14:36:07 +02:00
Dismissed
pdurlej left a comment

Operator approval relayed from live Codex merge-fest scope: platform PR queue only.

Operator approval relayed from live Codex merge-fest scope: platform PR queue only.
Merge remote-tracking branch 'origin/main' into codex/810-contract-expectations
All checks were successful
base-is-main / guard (pull_request) Successful in 1s
patchwarden-client-dry-run / collect-diff (pull_request) Successful in 4s
canary-required / canary (pull_request) Successful in 16s
canary-required / collect-diff (pull_request) Successful in 4s
patchwarden-pr-sanity / collect-diff (pull_request) Successful in 4s
python-ci / Python 3.11 (pull_request) Successful in 39s
python-ci / Python 3.12 (pull_request) Successful in 42s
python-ci / Python 3.13 (pull_request) Successful in 42s
patchwarden-client-dry-run / dry-run (pull_request) Successful in 17s
patchwarden-pr-sanity / sanity (pull_request) Successful in 1m56s
99288e979d
pdurlej approved these changes 2026-06-26 14:44:37 +02:00
pdurlej left a comment

Operator approval relayed from live Codex merge-fest scope: platform PR queue only.

Operator approval relayed from live Codex merge-fest scope: platform PR queue only.
Sign in to join this conversation.
No reviewers
No labels
W6d-automerge-calibration
agent/claude-code
agent/codex
agent/hermes
agent/iskra
agent/ollama
agent/patchwarden
automerge-candidate
class/security-sensitive
cutover-gate
dependency/blocked
dependency/blocks-others
dependency/cross-repo
dependency/needs-confirmation
domain:agents
domain:ci
domain:docs
domain:forgejo
domain:infra
domain:memory
domain:runtime
domain:signal
domain:ux
flow/architecture
flow/blocked
flow/deployed
flow/done
flow/implementation
flow/intake
flow/maintained
flow/observed
flow/ready
flow/refining
flow/retired
flow/review
iterating
judge/codex-candidate
judge/hermes-candidate
judge/low-confidence
judge/needs-refinement
judge/operator-needed
judge/p0
judge/p1
judge/p2
judge/p3
judge/park
judge/patchwarden-candidate
judge/stale-priority
kind/adr
kind/bug
kind/chore
kind/feature
kind/infra
kind/ops
kind/refactor
kind/research
large-impact
merge/auto
merge/manual
merge/manual-dependency-conflict
merge/manual-failing-tests
merge/manual-merge-conflict
merge/manual-missing-review
merge/manual-operator-preference
merge/manual-red-zone
merge/manual-security-sensitive
merge/manual-unclear-scope
merge/manual-unknown
meta
mode:operator-only
mode:patchwarden-iskra-approved
mode:safe-auto
needs-operator-decision
needs-triage
not-ready
observed/erroring
observed/needs-followup
observed/pending
observed/retire-candidate
observed/unused
observed/used
operator-emotional
owner-attention
phase/02
phase/03
priority:p0
priority:p1
priority:p2
priority:p3
proposed
ready-for-agent
ready-for-operator
recovery
review:claude-reviewed
review:codex-reviewed
review:dziadek-reviewed
review:needs-human
risk/exposure
risk/process
risk/product
risk/runtime
safety:external-write
safety:no-prod-mutation
safety:prod-impact
safety:secret-touch
size/large
size/medium
size/small
size/tiny
size/unknown
source/adr
source/agent-generated
source/manual
source/operator-chat
source/voice-note
status:blocked
status:codex-ready
status:merged:pending-evidence
status:needs-evidence
status:operator-needed
status:parked
tier/full
tier/lite
tier/stacked
tier:0-platform-substrate
tier:1-iskra-value-layer
tier:2-tools-products-modules
type:bug
type:chore
type:docs
type:feat
type:policy
type:research
No milestone
No project
No assignees
3 participants
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
pdurlej/platform!841
No description provided.