fix(modules): probe Meerkat and ntfy health through Tailnet #307

Merged
pdurlej merged 1 commit from codex/f3/tailnet-health-batch into main 2026-05-17 01:12:47 +02:00
Collaborator

Canary status: missing — F3 health-probe batch; PR checks required before merge.

Canary Context Pack

Product story

Continue the operator-approved F3 stateful no-op batch without repeatedly stopping on known Tailnet-gated health endpoints.

What changed

Changed np-meerkat-backend and ntfy health probes from direct public HTTPS to tailnet-https:// while keeping expected status 200.

Why it changed

F3 run #1041 for np-meerkat-backend proved plan/apply were no-op and container health was good, but public health returned 403. Live checks show both meerkat.pdurlej.com/health and ntfy.pdurlej.com/health return 200 when resolved through RS2000 Tailnet, and 403 on ordinary public probe.

Files touched

  • modules/np-meerkat-backend/module.yaml
  • modules/ntfy/module.yaml

Runtime evidence

  • curl https://meerkat.pdurlej.com/health -> 403; curl --resolve meerkat.pdurlej.com:443:100.110.188.20 https://meerkat.pdurlej.com/health -> 200
  • curl https://ntfy.pdurlej.com/health -> 403; curl --resolve ntfy.pdurlej.com:443:100.110.188.20 https://ntfy.pdurlej.com/health -> 200
  • np-meerkat-backend run #1041: plan in-sync, apply noop, container running healthy, health failed only on HTTP 403.

Known constraints

No runtime mutation in this PR. safe-session-api already has no HTTP probe and remains container-only.

Explicit out-of-scope

No compose changes, no production restart, no deploy-control backup-profile fix.

Requested decision

Merge to retry np-meerkat-backend and continue with ntfy + safe-session-api.

Merge blockers

PR checks red or Tailnet probe contract unsupported.

Spec sources read

  • modules/np-meerkat-backend/module.yaml
  • modules/ntfy/module.yaml
  • modules/safe-session-api/module.yaml
  • tests/smoke.sh tailnet-https:// handling
  • F3 run #1041 artifact summary

Verification

  • uv run --with click --with jsonschema --with PyYAML --with httpx python -m platformctl.cli validate --strict-v2 ../modules/np-meerkat-backend/module.yaml: green
  • uv run --with click --with jsonschema --with PyYAML --with httpx python -m platformctl.cli validate --strict-v2 ../modules/ntfy/module.yaml: green
  • uv run --with pytest --with click --with jsonschema --with PyYAML --with httpx python -m pytest platformctl/tests/test_health_phase3.py: 19 passed
Canary status: missing — F3 health-probe batch; PR checks required before merge. ## Canary Context Pack ### Product story Continue the operator-approved F3 stateful no-op batch without repeatedly stopping on known Tailnet-gated health endpoints. ### What changed Changed `np-meerkat-backend` and `ntfy` health probes from direct public HTTPS to `tailnet-https://` while keeping expected status `200`. ### Why it changed F3 run #1041 for `np-meerkat-backend` proved plan/apply were no-op and container health was good, but public health returned `403`. Live checks show both `meerkat.pdurlej.com/health` and `ntfy.pdurlej.com/health` return `200` when resolved through RS2000 Tailnet, and `403` on ordinary public probe. ### Files touched - `modules/np-meerkat-backend/module.yaml` - `modules/ntfy/module.yaml` ### Runtime evidence - `curl https://meerkat.pdurlej.com/health` -> 403; `curl --resolve meerkat.pdurlej.com:443:100.110.188.20 https://meerkat.pdurlej.com/health` -> 200 - `curl https://ntfy.pdurlej.com/health` -> 403; `curl --resolve ntfy.pdurlej.com:443:100.110.188.20 https://ntfy.pdurlej.com/health` -> 200 - `np-meerkat-backend` run #1041: plan `in-sync`, apply `noop`, container running healthy, health failed only on HTTP 403. ### Known constraints No runtime mutation in this PR. `safe-session-api` already has no HTTP probe and remains container-only. ### Explicit out-of-scope No compose changes, no production restart, no `deploy-control` backup-profile fix. ### Requested decision Merge to retry `np-meerkat-backend` and continue with `ntfy` + `safe-session-api`. ### Merge blockers PR checks red or Tailnet probe contract unsupported. ## Spec sources read - `modules/np-meerkat-backend/module.yaml` - `modules/ntfy/module.yaml` - `modules/safe-session-api/module.yaml` - `tests/smoke.sh` `tailnet-https://` handling - F3 run #1041 artifact summary ## Verification - `uv run --with click --with jsonschema --with PyYAML --with httpx python -m platformctl.cli validate --strict-v2 ../modules/np-meerkat-backend/module.yaml`: green - `uv run --with click --with jsonschema --with PyYAML --with httpx python -m platformctl.cli validate --strict-v2 ../modules/ntfy/module.yaml`: green - `uv run --with pytest --with click --with jsonschema --with PyYAML --with httpx python -m pytest platformctl/tests/test_health_phase3.py`: 19 passed
fix(modules): probe Meerkat and ntfy health through Tailnet
All checks were successful
base-is-main / guard (pull_request) Successful in 1s
canary-required / collect-diff (pull_request) Successful in 4s
patchwarden-pr-sanity / collect-diff (pull_request) Successful in 3s
platformctl plan / auto-apply scope (pull_request) Successful in 20s
canary-required / canary (pull_request) Successful in 13s
patchwarden-pr-sanity / sanity (pull_request) Successful in 19s
f8cf665358
Sign in to join this conversation.
No reviewers
No labels
W6d-automerge-calibration
agent/claude-code
agent/codex
agent/hermes
agent/iskra
agent/ollama
agent/patchwarden
automerge-candidate
class/security-sensitive
cutover-gate
dependency/blocked
dependency/blocks-others
dependency/cross-repo
dependency/needs-confirmation
domain:agents
domain:ci
domain:docs
domain:forgejo
domain:infra
domain:memory
domain:runtime
domain:signal
domain:ux
flow/architecture
flow/blocked
flow/deployed
flow/done
flow/implementation
flow/intake
flow/maintained
flow/observed
flow/ready
flow/refining
flow/retired
flow/review
iterating
judge/codex-candidate
judge/hermes-candidate
judge/low-confidence
judge/needs-refinement
judge/operator-needed
judge/p0
judge/p1
judge/p2
judge/p3
judge/park
judge/patchwarden-candidate
judge/stale-priority
kind/adr
kind/bug
kind/chore
kind/feature
kind/infra
kind/ops
kind/refactor
kind/research
large-impact
merge/auto
merge/manual
merge/manual-dependency-conflict
merge/manual-failing-tests
merge/manual-merge-conflict
merge/manual-missing-review
merge/manual-operator-preference
merge/manual-red-zone
merge/manual-security-sensitive
merge/manual-unclear-scope
merge/manual-unknown
meta
mode:operator-only
mode:patchwarden-iskra-approved
mode:safe-auto
needs-operator-decision
needs-triage
not-ready
observed/erroring
observed/needs-followup
observed/pending
observed/retire-candidate
observed/unused
observed/used
operator-emotional
owner-attention
phase/02
phase/03
priority:p0
priority:p1
priority:p2
priority:p3
proposed
ready-for-agent
ready-for-operator
recovery
review:claude-reviewed
review:codex-reviewed
review:dziadek-reviewed
review:needs-human
risk/exposure
risk/process
risk/product
risk/runtime
safety:external-write
safety:no-prod-mutation
safety:prod-impact
safety:secret-touch
size/large
size/medium
size/small
size/tiny
size/unknown
source/adr
source/agent-generated
source/manual
source/operator-chat
source/voice-note
status:blocked
status:codex-ready
status:merged:pending-evidence
status:needs-evidence
status:operator-needed
status:parked
tier/full
tier/lite
tier/stacked
tier:0-platform-substrate
tier:1-iskra-value-layer
tier:2-tools-products-modules
type:bug
type:chore
type:docs
type:feat
type:policy
type:research
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
pdurlej/platform!307
No description provided.