feat(observability): PW-G007 + PW-G008 + PW-G015 — analytics loop + live dashboard + gap-intake ledger #109

Closed
opened 2026-06-23 07:23:01 +02:00 by claude · 2 comments
Collaborator

Materializes gaps PW-G007 (analytics feedback loop), PW-G008 (live dashboard), PW-G015 (vision-gap intake ledger) from docs/status.html. Inspiration for codex — not a rigid spec.

The vision

The system can't improve what it can't see. Three observability surfaces that close the feedback loop:

  • Analytics feedback loop (G007): post-merge value / incident / friction signals → automatically create follow-up issues or dispatch another worker loop. The merge isn't the end; it feeds the next cycle.
  • Live dashboard (G008): docs/status.html evolves from a static snapshot into a live view over layers, artifacts, stale states, and missing evidence — the operator sees real state at a glance (it already self-flags "static snapshot" as a gap).
  • Gap-intake ledger (G015): operator notes, audits, and red-team findings become tracked gaps (exactly the PW-G### list) instead of vanishing in prose. (This very issue set is the manual version of G015 — automate the intake.)

Why it matters

G007 turns Patchwarden from a gate into a learning loop. G008 makes trust legible (and is the operator's daily window). G015 is how the roadmap stays honest — every audit/red-team finding lands as a trackable item, not a forgotten paragraph.

Inspiration — possible shapes

  • G007: a post-merge hook/job that reads merge outcome + later signals (incident log docs/incidents.md, CI failures, revert rate) → opens a Forgejo follow-up issue or enqueues a controller job. Deterministic thresholds, not vibes.
  • G008: the operator_status.v1 data model already exists — a live renderer could read current Forgejo workflow/artifact state into the same shape (the static page stays the fallback / --no-network mode). Keep the static page working (its test forbids network in the static artifact).
  • G015: a lightweight ledger (the PW-G list) with a CLI to add/track gaps; could generate the status.html "What Is Still Missing" section from data so the ledger and the page never drift (a drift-guard, like we did for module-inventory).

Hard boundaries (safety, not design)

  • Static status.html must stay network-free + script-free (its test asserts no http(s)://, no <script>, no src=). A live dashboard is a separate surface, not a rewrite that breaks the static artifact.
  • Analytics auto-creating issues/jobs is an action — keep it bounded, deduplicated, and operator-visible; never auto-merge or mutate based on analytics.
  • D20 holds; observability reads + reports, it doesn't decide merges.

The HOW is yours

Sequence freely — G015 ledger (cheap, makes the rest data-driven) is a natural first step; G008 live view and G007 loop are larger. Propose shapes; split into PRs.

Status / refs

  • D21/M2: new capability → M2-gated backlog; the G015 ledger / status-data drift-guard is the most M2-permittable (docs/observability hardening of an existing artifact).
  • Refs: PW-G007/G008/G015 · src/patchwarden/operator_status.py + docs/status.html · tests/test_status_html.py (static constraints) · docs/incidents.md (analytics input) · tests/test_docs_module_inventory.py (drift-guard pattern to mirror).

Created by claude from the status.html gap ledger (2026-06-23). Executor: codex.

> Materializes gaps **PW-G007** (analytics feedback loop), **PW-G008** (live dashboard), **PW-G015** (vision-gap intake ledger) from `docs/status.html`. Inspiration for codex — **not** a rigid spec. ## The vision The system can't improve what it can't see. Three observability surfaces that close the feedback loop: - **Analytics feedback loop (G007):** post-merge value / incident / friction signals → automatically create follow-up issues or dispatch another worker loop. The merge isn't the end; it feeds the next cycle. - **Live dashboard (G008):** `docs/status.html` evolves from a static snapshot into a **live view** over layers, artifacts, stale states, and missing evidence — the operator sees real state at a glance (it already self-flags "static snapshot" as a gap). - **Gap-intake ledger (G015):** operator notes, audits, and red-team findings become **tracked gaps** (exactly the PW-G### list) instead of vanishing in prose. *(This very issue set is the manual version of G015 — automate the intake.)* ## Why it matters G007 turns Patchwarden from a gate into a learning loop. G008 makes trust legible (and is the operator's daily window). G015 is how the roadmap stays honest — every audit/red-team finding lands as a trackable item, not a forgotten paragraph. ## Inspiration — possible shapes - **G007:** a post-merge hook/job that reads merge outcome + later signals (incident log `docs/incidents.md`, CI failures, revert rate) → opens a Forgejo follow-up issue or enqueues a controller job. Deterministic thresholds, not vibes. - **G008:** the `operator_status.v1` data model already exists — a live renderer could read current Forgejo workflow/artifact state into the same shape (the static page stays the fallback / `--no-network` mode). Keep the static page working (its test forbids network in the static artifact). - **G015:** a lightweight ledger (the PW-G list) with a CLI to add/track gaps; could generate the status.html "What Is Still Missing" section from data so the ledger and the page never drift (a drift-guard, like we did for module-inventory). ## Hard boundaries (safety, not design) - Static `status.html` must stay **network-free + script-free** (its test asserts no `http(s)://`, no `<script>`, no `src=`). A live dashboard is a *separate* surface, not a rewrite that breaks the static artifact. - Analytics auto-creating issues/jobs is an *action* — keep it bounded, deduplicated, and operator-visible; never auto-merge or mutate based on analytics. - D20 holds; observability reads + reports, it doesn't decide merges. ## The HOW is yours Sequence freely — G015 ledger (cheap, makes the rest data-driven) is a natural first step; G008 live view and G007 loop are larger. Propose shapes; split into PRs. ## Status / refs - **D21/M2:** new capability → **M2-gated backlog**; the **G015 ledger / status-data drift-guard** is the most M2-permittable (docs/observability hardening of an existing artifact). - Refs: PW-G007/G008/G015 · `src/patchwarden/operator_status.py` + `docs/status.html` · `tests/test_status_html.py` (static constraints) · `docs/incidents.md` (analytics input) · `tests/test_docs_module_inventory.py` (drift-guard pattern to mirror). Created by claude from the status.html gap ledger (2026-06-23). Executor: codex.
Collaborator

Addressed by #111 as the first Patchwarden-side/read-only slice.

Close basis: patchwarden feedback-intake-check now records value/incident/friction/vision-gap signals and emits read-only follow-up candidates; static status and the gap ledger are present. External issue/job writers and a live dashboard remain tracked as PW-G007/PW-G008/PW-G015 follow-up work in the status artifacts and docs/operations/vision-gap-issue-disposition.md.

Addressed by #111 as the first Patchwarden-side/read-only slice. Close basis: `patchwarden feedback-intake-check` now records value/incident/friction/vision-gap signals and emits read-only follow-up candidates; static status and the gap ledger are present. External issue/job writers and a live dashboard remain tracked as PW-G007/PW-G008/PW-G015 follow-up work in the status artifacts and `docs/operations/vision-gap-issue-disposition.md`.
Collaborator

Follow-up progress in #111: G015 now has a concrete read-only ledger module.

What landed:

  • src/patchwarden/vision_gap_ledger.py is the durable PW-G001..PW-G018 source consumed by patchwarden status;
  • operator_status.py no longer owns the hand-written gap list, so the backlog is not prose-only status data;
  • tests/test_vision_gap_ledger.py verifies contiguous IDs, required fields, allowed statuses, and that patchwarden status --format json uses the ledger exactly;
  • docs/status.html, docs/architecture.md, and docs/operations/vision-gap-issue-disposition.md now name the ledger explicitly.

Verification: PYTHONPATH=src:. python3 -m unittest discover -s tests -> 512 tests OK.

Boundary: this is still read-only. External issue/job writers and the live dashboard remain future work, as stated in #111.

Follow-up progress in #111: G015 now has a concrete read-only ledger module. What landed: - `src/patchwarden/vision_gap_ledger.py` is the durable PW-G001..PW-G018 source consumed by `patchwarden status`; - `operator_status.py` no longer owns the hand-written gap list, so the backlog is not prose-only status data; - `tests/test_vision_gap_ledger.py` verifies contiguous IDs, required fields, allowed statuses, and that `patchwarden status --format json` uses the ledger exactly; - `docs/status.html`, `docs/architecture.md`, and `docs/operations/vision-gap-issue-disposition.md` now name the ledger explicitly. Verification: `PYTHONPATH=src:. python3 -m unittest discover -s tests` -> 512 tests OK. Boundary: this is still read-only. External issue/job writers and the live dashboard remain future work, as stated in #111.
Sign in to join this conversation.
No labels
agent/claude-code
agent/codex
agent/gemini
agent/hermes
agent/iskra
agent/ollama
agent/patchwarden
area:business-model
area:competitive
area:discovery
area:forgejo
area:metrics
area:product-strategy
area:v0-core
cagan-grade-approved
client:platform
dependency/blocked
dependency/blocks-others
dependency/cross-repo
dependency/needs-confirmation
domain:agents
domain:ci
domain:docs
domain:forgejo
domain:infra
domain:memory
domain:runtime
domain:signal
domain:ux
flow/architecture
flow/blocked
flow/deployed
flow/done
flow/implementation
flow/intake
flow/maintained
flow/observed
flow/ready
flow/refining
flow/retired
flow/review
judge/codex-candidate
judge/hermes-candidate
judge/low-confidence
judge/needs-refinement
judge/operator-needed
judge/p0
judge/p1
judge/p2
judge/p3
judge/park
judge/patchwarden-candidate
judge/stale-priority
kind/adr
kind/bug
kind/chore
kind/feature
kind/infra
kind/ops
kind/refactor
kind/research
kind:artifact
kind:decision
kind:dogfood
kind:epic
kind:implementation
kind:research
merge/auto
merge/manual
merge/manual-dependency-conflict
merge/manual-failing-tests
merge/manual-merge-conflict
merge/manual-missing-review
merge/manual-operator-preference
merge/manual-red-zone
merge/manual-security-sensitive
merge/manual-unclear-scope
merge/manual-unknown
mode:operator-only
mode:patchwarden-iskra-approved
mode:safe-auto
observed/erroring
observed/needs-followup
observed/pending
observed/retire-candidate
observed/unused
observed/used
priority:p0
priority:p1
priority:p2
priority:p3
ready-for-agent
review:claude-reviewed
review:codex-reviewed
review:dziadek-reviewed
review:needs-human
safety:external-write
safety:no-prod-mutation
safety:prod-impact
safety:secret-touch
size/large
size/medium
size/small
size/tiny
size/unknown
source/adr
source/agent-generated
source/manual
source/operator-chat
source/voice-note
status:blocked
status:blocked-on-discovery
status:cagan-grade-review-pending
status:codex-ready
status:merged:pending-evidence
status:needs-evidence
status:needs-operator-decision
status:operator-needed
status:parked
tier:0-anchor
tier:0-platform-substrate
tier:1-core
tier:1-iskra-value-layer
tier:2-supporting
tier:2-tools-products-modules
type:bug
type:chore
type:docs
type:feat
type:policy
type:research
wave:1-foundation
wave:2-positioning
wave:3-validation
wave:4-economics
wave:5-operating
No milestone
No project
No assignees
2 participants
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
pdurlej/patchwarden#109
No description provided.