ops(rs2000): F3 stateful smoke design + backup-before contract #271
Labels
No labels
W6d-automerge-calibration
agent/claude-code
agent/codex
agent/hermes
agent/iskra
agent/ollama
agent/patchwarden
automerge-candidate
class/security-sensitive
cutover-gate
dependency/blocked
dependency/blocks-others
dependency/cross-repo
dependency/needs-confirmation
domain:agents
domain:ci
domain:docs
domain:forgejo
domain:infra
domain:memory
domain:runtime
domain:signal
domain:ux
flow/architecture
flow/blocked
flow/deployed
flow/done
flow/implementation
flow/intake
flow/maintained
flow/observed
flow/ready
flow/refining
flow/retired
flow/review
iterating
judge/codex-candidate
judge/hermes-candidate
judge/low-confidence
judge/needs-refinement
judge/operator-needed
judge/p0
judge/p1
judge/p2
judge/p3
judge/park
judge/patchwarden-candidate
judge/stale-priority
kind/adr
kind/bug
kind/chore
kind/feature
kind/infra
kind/ops
kind/refactor
kind/research
large-impact
merge/auto
merge/manual
merge/manual-dependency-conflict
merge/manual-failing-tests
merge/manual-merge-conflict
merge/manual-missing-review
merge/manual-operator-preference
merge/manual-red-zone
merge/manual-security-sensitive
merge/manual-unclear-scope
merge/manual-unknown
meta
mode:operator-only
mode:patchwarden-iskra-approved
mode:safe-auto
needs-operator-decision
needs-triage
not-ready
observed/erroring
observed/needs-followup
observed/pending
observed/retire-candidate
observed/unused
observed/used
operator-emotional
owner-attention
phase/02
phase/03
priority:p0
priority:p1
priority:p2
priority:p3
proposed
ready-for-agent
ready-for-operator
recovery
review:claude-reviewed
review:codex-reviewed
review:dziadek-reviewed
review:needs-human
risk/exposure
risk/process
risk/product
risk/runtime
safety:external-write
safety:no-prod-mutation
safety:prod-impact
safety:secret-touch
size/large
size/medium
size/small
size/tiny
size/unknown
source/adr
source/agent-generated
source/manual
source/operator-chat
source/voice-note
status:blocked
status:codex-ready
status:merged:pending-evidence
status:needs-evidence
status:operator-needed
status:parked
tier/full
tier/lite
tier/stacked
tier:0-platform-substrate
tier:1-iskra-value-layer
tier:2-tools-products-modules
type:bug
type:chore
type:docs
type:feat
type:policy
type:research
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
pdurlej/platform#271
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Context
F3 is the future stateful smoke phase for RS2000. Unlike F1/F1.5/F2 no-op smokes, F3 touches services with local data or persistent dependencies. Every F3 candidate needs a backup-before-apply contract before any workflow_dispatch smoke.
PR #270 adds a draft operator-run helper:
scripts/cutover/backup-before-apply.shscripts/cutover/README.mdNo F3 smoke is part of PR #270. This issue is the design gate before any stateful smoke starts.
Stateful candidates by backup class
Class A - Postgres-family
Procedure:
pg_dumpallinside the target container, gzip output, write mode 0600 under/opt/pdurlej-platform/backups.Candidates:
postgreshoncho-postgresagent-plane-shadow-postgresNotes: core/high-value data. Do not use as first F3 smoke.
Class B - Redis-family
Procedure: request
redis-cli BGSAVEwhen possible, then archive mounted Redis data. Password-protected Redis may skip BGSAVE until auth behavior is verified.Candidates:
redishoncho-redisinfisical-redisNotes: core or security-sensitive dependencies. Not first F3.
Class C - Vault
Procedure: if
VAULT_TOKENis provided and Vault is unsealed, usevault operator raft snapshot save; otherwise mount archive fallback.Candidates:
vaultNotes: sunset/security-sensitive. Not first F3.
Class D - MinIO/S3
Procedure: prefer
mc mirrorto backup path whenMINIO_MC_ALIASis configured; otherwise mount archive fallback.Candidates:
minioNotes: needs operator decision on
mccredentials/alias and retention before use.Class E - Filesystem/app state
Procedure: archive Docker mounts for the resolved live container.
Candidates:
audio-mcp-legacyaudio-mcpdeploy-controlforgejogit-mirrorgmail-private-mcpgmail-triage-mcphermes-agencyinfisicaljellyfinkanboardn8n-mainnp-meerkat-backendnp-memosnp-openhabittrackernp-radicalenp-silverbulletnp-tududinpntfyobsidian-headless-syncproducts-agent-eval-labsafe-session-apishelfmarksignal-bridge-legacysignal-bridge-mautrixsignal-clistorage-ro-mcpsynapseteamspeak3traefikuptime-kumavoice-transcriptionNotes: likely first F3 should come from this class, but not routing, identity, core state, or high-value services. Current recommendation: start with
uptime-kumaor another low-blast Class E service after a mount audit.Class F - Engine-specific state
Procedure: engine dump preferred, mount archive fallback until dump is proven.
Candidates:
karakeep-meilisearchsearxngNotes:
searxngmay be low-blast, but only after confirming its real mounted state and restore story.Class G - Agaria separate root
Procedure: separate compose/root backup outside canonical RS2000 compose path.
Candidates:
agaria-postgresagaria-redisNotes: out of canonical compose; requires Agaria-specific plan before F3.
Backup destination and size evidence
Default destination from PR #270:
Backup files must be mode 0600 because they can contain secrets/private data.
Read-only evidence from 2026-05-13:
Per-module expected backup sizes are not yet asserted. They need a dedicated read-only volume audit before first F3 smoke.
Operator decisions before F3
uptime-kumaor similar after mount audit.Restore contract to define before first F3
Minimum restore notes needed:
pg_dumpalloutput into the correct Postgres container/database context.Explicit non-goals
References
Codex Wave 1 M01 closeout — F3 design gate superseded by completed F3 migration
Role: executor
Action: closing as resolved/superseded
This issue was the pre-F3 design gate for stateful smoke + backup-before contract. The current roadmap now treats F3/F3 final bosses as complete verification waves, and Milestone 01 only needs stale F3 reconciliation plus legacy cleanup.
Relevant durable state:
state/roadmap/current-platform-roadmap.md: F1/F1.5/F2/F3 smokes are complete verification waves, no longer future phases.state/STATUS_NOW.md: F3 live-service migration is complete; final-boss services were migrated with backup-before evidence.No runtime mutation was performed. Future backup/restore work belongs to Milestone 02 (#45/#238), not this stale F3 gate.