fix(f3): prepare minio stateful smoke #315

Merged
pdurlej merged 1 commit from codex/f3/minio-prep into main 2026-05-17 03:06:35 +02:00
Collaborator

Canary status: missing — F3 MinIO prep; rely on required Forgejo checks before merge

Canary Context Pack

Product story

MinIO is the local S3-compatible object store. Before F3 can smoke it safely, the manifest needs to reflect the actual Tailnet exposure, live image, volume, and health endpoint.

What changed

  • Populated strict-v2 metadata for minio from live RS2000 runtime evidence.
  • Corrected the live volume name to home-platform_minio_data.
  • Changed health probe to tailnet-https://minio.pdurlej.com/minio/health/ready.
  • Corrected exposure metadata from public/no-auth to Tailnet-only with ts-allowlist@file.
  • Recorded the first-F3 backup strategy: direct volume archive now, future mc mirror restore drill when object-store growth warrants it.

Why it changed

The existing manifest failed strict-v2 and pointed at a public health URL that returns 403 through the allowlist. Runtime evidence shows MinIO is healthy and reachable over Tailnet.

Files touched

  • modules/minio/module.yaml

Runtime evidence

  • home-platform-minio-1: running healthy, image minio/minio:RELEASE.2025-09-07T16-13-09Z.
  • Image digest: minio/minio@sha256:14cea493d9a34af32f524e538b8346cf79f3321eff8e708c1e2960462bd8936e.
  • Volume: home-platform_minio_data mounted at /data.
  • Live volume size: 168K.
  • Public https://minio.pdurlej.com/minio/health/ready returns HTTP 403.
  • Tailnet-resolved https://minio.pdurlej.com/minio/health/ready returns HTTP 200.
  • Container-local http://127.0.0.1:9000/minio/health/live returns HTTP 200.

Known constraints

This PR does not run backup/smoke. After merge, F3 still requires backup-before and manual workflow_dispatch with allow_stateful=true.

Explicit out-of-scope

  • No compose changes.
  • No MinIO restart.
  • No MinIO credentials or bucket contents printed.
  • No minio-init fix; that remains tracked separately (#310 for the blocked sidecar pattern).

Requested decision

Merge to unblock MinIO backup-before and F3 no-op smoke.

Merge blockers

  • strict-v2 validation failure
  • evidence that Tailnet health does not return HTTP 200
  • evidence that exposure metadata no longer matches Traefik middleware

BMADX

  • Gear: X3
  • Gate: execution_allowed=true, bmad_status=ok
  • Reason: stateful object storage with private data and restore blast radius.

Spec sources read

  • modules/minio/module.yaml — target manifest
  • modules/minio/runbook.md — container/runbook name
  • compose/core/compose.yaml — MinIO service, healthcheck, Traefik middleware
  • compose/base/compose.yaml — live volume naming pattern
  • scripts/cutover/README.md — F3 backup class guidance

Verification

python3 "$HOME/.codex/skills/bmadx/scripts/sync_bmadx.py" check --gear X3 --compact
control-plane/.venv/bin/python -m platformctl.cli validate --strict-v2 modules/minio/module.yaml
control-plane/.venv/bin/pytest control-plane/platformctl/tests/test_validate.py control-plane/platformctl/tests/test_health_phase3.py -q
git diff --check

Refs: #142

Canary status: missing — F3 MinIO prep; rely on required Forgejo checks before merge ## Canary Context Pack ### Product story MinIO is the local S3-compatible object store. Before F3 can smoke it safely, the manifest needs to reflect the actual Tailnet exposure, live image, volume, and health endpoint. ### What changed - Populated strict-v2 metadata for `minio` from live RS2000 runtime evidence. - Corrected the live volume name to `home-platform_minio_data`. - Changed health probe to `tailnet-https://minio.pdurlej.com/minio/health/ready`. - Corrected exposure metadata from public/no-auth to Tailnet-only with `ts-allowlist@file`. - Recorded the first-F3 backup strategy: direct volume archive now, future `mc mirror` restore drill when object-store growth warrants it. ### Why it changed The existing manifest failed strict-v2 and pointed at a public health URL that returns 403 through the allowlist. Runtime evidence shows MinIO is healthy and reachable over Tailnet. ### Files touched - `modules/minio/module.yaml` ### Runtime evidence - `home-platform-minio-1`: `running healthy`, image `minio/minio:RELEASE.2025-09-07T16-13-09Z`. - Image digest: `minio/minio@sha256:14cea493d9a34af32f524e538b8346cf79f3321eff8e708c1e2960462bd8936e`. - Volume: `home-platform_minio_data` mounted at `/data`. - Live volume size: `168K`. - Public `https://minio.pdurlej.com/minio/health/ready` returns HTTP 403. - Tailnet-resolved `https://minio.pdurlej.com/minio/health/ready` returns HTTP 200. - Container-local `http://127.0.0.1:9000/minio/health/live` returns HTTP 200. ### Known constraints This PR does not run backup/smoke. After merge, F3 still requires backup-before and manual `workflow_dispatch` with `allow_stateful=true`. ### Explicit out-of-scope - No compose changes. - No MinIO restart. - No MinIO credentials or bucket contents printed. - No `minio-init` fix; that remains tracked separately (#310 for the blocked sidecar pattern). ### Requested decision Merge to unblock MinIO backup-before and F3 no-op smoke. ### Merge blockers - strict-v2 validation failure - evidence that Tailnet health does not return HTTP 200 - evidence that exposure metadata no longer matches Traefik middleware ## BMADX - Gear: X3 - Gate: `execution_allowed=true`, `bmad_status=ok` - Reason: stateful object storage with private data and restore blast radius. ## Spec sources read - `modules/minio/module.yaml` — target manifest - `modules/minio/runbook.md` — container/runbook name - `compose/core/compose.yaml` — MinIO service, healthcheck, Traefik middleware - `compose/base/compose.yaml` — live volume naming pattern - `scripts/cutover/README.md` — F3 backup class guidance ## Verification ```bash python3 "$HOME/.codex/skills/bmadx/scripts/sync_bmadx.py" check --gear X3 --compact control-plane/.venv/bin/python -m platformctl.cli validate --strict-v2 modules/minio/module.yaml control-plane/.venv/bin/pytest control-plane/platformctl/tests/test_validate.py control-plane/platformctl/tests/test_health_phase3.py -q git diff --check ``` Refs: #142
fix(f3): prepare minio stateful smoke
All checks were successful
base-is-main / guard (pull_request) Successful in 1s
canary-required / collect-diff (pull_request) Successful in 4s
patchwarden-pr-sanity / collect-diff (pull_request) Successful in 3s
platformctl plan / auto-apply scope (pull_request) Successful in 18s
canary-required / canary (pull_request) Successful in 13s
patchwarden-pr-sanity / sanity (pull_request) Successful in 19s
65dd655969
Sign in to join this conversation.
No reviewers
No labels
W6d-automerge-calibration
agent/claude-code
agent/codex
agent/hermes
agent/iskra
agent/ollama
agent/patchwarden
automerge-candidate
class/security-sensitive
cutover-gate
dependency/blocked
dependency/blocks-others
dependency/cross-repo
dependency/needs-confirmation
domain:agents
domain:ci
domain:docs
domain:forgejo
domain:infra
domain:memory
domain:runtime
domain:signal
domain:ux
flow/architecture
flow/blocked
flow/deployed
flow/done
flow/implementation
flow/intake
flow/maintained
flow/observed
flow/ready
flow/refining
flow/retired
flow/review
iterating
judge/codex-candidate
judge/hermes-candidate
judge/low-confidence
judge/needs-refinement
judge/operator-needed
judge/p0
judge/p1
judge/p2
judge/p3
judge/park
judge/patchwarden-candidate
judge/stale-priority
kind/adr
kind/bug
kind/chore
kind/feature
kind/infra
kind/ops
kind/refactor
kind/research
large-impact
merge/auto
merge/manual
merge/manual-dependency-conflict
merge/manual-failing-tests
merge/manual-merge-conflict
merge/manual-missing-review
merge/manual-operator-preference
merge/manual-red-zone
merge/manual-security-sensitive
merge/manual-unclear-scope
merge/manual-unknown
meta
mode:operator-only
mode:patchwarden-iskra-approved
mode:safe-auto
needs-operator-decision
needs-triage
not-ready
observed/erroring
observed/needs-followup
observed/pending
observed/retire-candidate
observed/unused
observed/used
operator-emotional
owner-attention
phase/02
phase/03
priority:p0
priority:p1
priority:p2
priority:p3
proposed
ready-for-agent
ready-for-operator
recovery
review:claude-reviewed
review:codex-reviewed
review:dziadek-reviewed
review:needs-human
risk/exposure
risk/process
risk/product
risk/runtime
safety:external-write
safety:no-prod-mutation
safety:prod-impact
safety:secret-touch
size/large
size/medium
size/small
size/tiny
size/unknown
source/adr
source/agent-generated
source/manual
source/operator-chat
source/voice-note
status:blocked
status:codex-ready
status:merged:pending-evidence
status:needs-evidence
status:operator-needed
status:parked
tier/full
tier/lite
tier/stacked
tier:0-platform-substrate
tier:1-iskra-value-layer
tier:2-tools-products-modules
type:bug
type:chore
type:docs
type:feat
type:policy
type:research
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
pdurlej/platform!315
No description provided.