fix(tests): smoke.sh runtime drift rewrite #52

Merged
pdurlej merged 1 commit from codex/orders/smoke-sh-rewrite into main 2026-05-05 01:32:00 +02:00
Collaborator

Canary status: missing — manual canary 3+3 required before merge

Canary Context Pack

Product story

tests/smoke.sh --all should be a trustworthy daily drift detector. PR #40 v1 found useful module drift, but canary showed the implementation could also produce false failures across many modules. This rewrite fixes the specific PR #40 findings and keeps runtime smoke independent from schema validation dependencies.

What changed

  • Rewrote tests/smoke.sh as runtime smoke only: manifest presence, runbook-derived container name, live container status, registry digest drift, HTTP health, optional module hook.
  • Added tests/validate-schema.sh for Python/jsonschema/PyYAML validation outside the runtime smoke path.
  • JSON output now uses jq, not printf-built JSON.
  • Digest comparison reads the live image RepoDigest via docker image inspect, not container .Image.

Choices made

  • Picked HIGH 3 option (b): split schema validation from runtime smoke.
  • Picked runbook Container: as primary container-name source; no home-platform- prefix fallback in the critical path.
  • Kept smoke-extra support as an existing extension hook, but no new remediation/cron framework.

Verification

  • bash -n tests/smoke.sh — PASS
  • bash -n tests/validate-schema.sh — PASS
  • tests/validate-schema.sh n8n-worker — PASS
  • tests/smoke.sh --json n8n-worker | jq — PASS
  • Selected live targets: n8n-worker OK, agaria-postgres OK, agent-plane-shadow-control OK; uptime-kuma and karakeep show real live drift/health findings.
  • tests/smoke.sh --all inventory: 80 modules checked, 63 OK, 17 FAIL.

n8n-worker drift status

Confirmed PR #40 v1 false-positive. Live RepoDigest matches modules/n8n-worker/module.yaml: sha256:72e2242d5d3b89f2501c9ffe4208b98bcd7345f3fa8c40a57d502d6c1a5315d3.

Known constraints

  • shellcheck is not installed locally, so static shellcheck was not run.
  • --all still surfaces real module drift/open-loop failures; this PR does not update module manifests.
Canary status: missing — manual canary 3+3 required before merge ## Canary Context Pack ### Product story `tests/smoke.sh --all` should be a trustworthy daily drift detector. PR #40 v1 found useful module drift, but canary showed the implementation could also produce false failures across many modules. This rewrite fixes the specific PR #40 findings and keeps runtime smoke independent from schema validation dependencies. ### What changed - Rewrote `tests/smoke.sh` as runtime smoke only: manifest presence, runbook-derived container name, live container status, registry digest drift, HTTP health, optional module hook. - Added `tests/validate-schema.sh` for Python/jsonschema/PyYAML validation outside the runtime smoke path. - JSON output now uses `jq`, not printf-built JSON. - Digest comparison reads the live image RepoDigest via `docker image inspect`, not container `.Image`. ### Choices made - Picked HIGH 3 option (b): split schema validation from runtime smoke. - Picked runbook `Container:` as primary container-name source; no `home-platform-` prefix fallback in the critical path. - Kept smoke-extra support as an existing extension hook, but no new remediation/cron framework. ### Verification - `bash -n tests/smoke.sh` — PASS - `bash -n tests/validate-schema.sh` — PASS - `tests/validate-schema.sh n8n-worker` — PASS - `tests/smoke.sh --json n8n-worker | jq` — PASS - Selected live targets: `n8n-worker` OK, `agaria-postgres` OK, `agent-plane-shadow-control` OK; `uptime-kuma` and `karakeep` show real live drift/health findings. - `tests/smoke.sh --all` inventory: 80 modules checked, 63 OK, 17 FAIL. ### n8n-worker drift status Confirmed PR #40 v1 false-positive. Live RepoDigest matches `modules/n8n-worker/module.yaml`: `sha256:72e2242d5d3b89f2501c9ffe4208b98bcd7345f3fa8c40a57d502d6c1a5315d3`. ### Known constraints - `shellcheck` is not installed locally, so static shellcheck was not run. - `--all` still surfaces real module drift/open-loop failures; this PR does not update module manifests.
fix(tests): rewrite smoke runtime drift checks
Some checks failed
canary-required / collect-diff (pull_request) Failing after 2s
canary-required / canary (pull_request) Has been skipped
6f20160a49
Per prompts/tooling-smoke-sh-rewrite.md.\n\n- remove Python package dependency from runtime smoke path\n- resolve containers from runbook instead of hardcoded home-platform prefix\n- compare registry RepoDigest safely via docker image inspect\n- emit JSON through jq and move schema validation to tests/validate-schema.sh\n\nVerification: bash -n, schema validator smoke, n8n-worker live smoke, selected live module checks, --all drift inventory. n8n-worker drift from PR #40 v1 is confirmed false-positive.
Sign in to join this conversation.
No reviewers
No labels
W6d-automerge-calibration
agent/claude-code
agent/codex
agent/hermes
agent/iskra
agent/ollama
agent/patchwarden
automerge-candidate
class/security-sensitive
cutover-gate
dependency/blocked
dependency/blocks-others
dependency/cross-repo
dependency/needs-confirmation
domain:agents
domain:ci
domain:docs
domain:forgejo
domain:infra
domain:memory
domain:runtime
domain:signal
domain:ux
flow/architecture
flow/blocked
flow/deployed
flow/done
flow/implementation
flow/intake
flow/maintained
flow/observed
flow/ready
flow/refining
flow/retired
flow/review
iterating
judge/codex-candidate
judge/hermes-candidate
judge/low-confidence
judge/needs-refinement
judge/operator-needed
judge/p0
judge/p1
judge/p2
judge/p3
judge/park
judge/patchwarden-candidate
judge/stale-priority
kind/adr
kind/bug
kind/chore
kind/feature
kind/infra
kind/ops
kind/refactor
kind/research
large-impact
merge/auto
merge/manual
merge/manual-dependency-conflict
merge/manual-failing-tests
merge/manual-merge-conflict
merge/manual-missing-review
merge/manual-operator-preference
merge/manual-red-zone
merge/manual-security-sensitive
merge/manual-unclear-scope
merge/manual-unknown
meta
mode:operator-only
mode:patchwarden-iskra-approved
mode:safe-auto
needs-operator-decision
needs-triage
not-ready
observed/erroring
observed/needs-followup
observed/pending
observed/retire-candidate
observed/unused
observed/used
operator-emotional
owner-attention
phase/02
phase/03
priority:p0
priority:p1
priority:p2
priority:p3
proposed
ready-for-agent
ready-for-operator
recovery
review:claude-reviewed
review:codex-reviewed
review:dziadek-reviewed
review:needs-human
risk/exposure
risk/process
risk/product
risk/runtime
safety:external-write
safety:no-prod-mutation
safety:prod-impact
safety:secret-touch
size/large
size/medium
size/small
size/tiny
size/unknown
source/adr
source/agent-generated
source/manual
source/operator-chat
source/voice-note
status:blocked
status:codex-ready
status:merged:pending-evidence
status:needs-evidence
status:operator-needed
status:parked
tier/full
tier/lite
tier/stacked
tier:0-platform-substrate
tier:1-iskra-value-layer
tier:2-tools-products-modules
type:bug
type:chore
type:docs
type:feat
type:policy
type:research
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
pdurlej/platform!52
No description provided.