Evaluate Ponytail as a possible Patchwarden simplicity-review dependency #123
Labels
No labels
agent/claude-code
agent/codex
agent/gemini
agent/hermes
agent/iskra
agent/ollama
agent/patchwarden
area:business-model
area:competitive
area:discovery
area:forgejo
area:metrics
area:product-strategy
area:v0-core
cagan-grade-approved
client:platform
dependency/blocked
dependency/blocks-others
dependency/cross-repo
dependency/needs-confirmation
domain:agents
domain:ci
domain:docs
domain:forgejo
domain:infra
domain:memory
domain:runtime
domain:signal
domain:ux
flow/architecture
flow/blocked
flow/deployed
flow/done
flow/implementation
flow/intake
flow/maintained
flow/observed
flow/ready
flow/refining
flow/retired
flow/review
judge/codex-candidate
judge/hermes-candidate
judge/low-confidence
judge/needs-refinement
judge/operator-needed
judge/p0
judge/p1
judge/p2
judge/p3
judge/park
judge/patchwarden-candidate
judge/stale-priority
kind/adr
kind/bug
kind/chore
kind/feature
kind/infra
kind/ops
kind/refactor
kind/research
kind:artifact
kind:decision
kind:dogfood
kind:epic
kind:implementation
kind:research
merge/auto
merge/manual
merge/manual-dependency-conflict
merge/manual-failing-tests
merge/manual-merge-conflict
merge/manual-missing-review
merge/manual-operator-preference
merge/manual-red-zone
merge/manual-security-sensitive
merge/manual-unclear-scope
merge/manual-unknown
mode:operator-only
mode:patchwarden-iskra-approved
mode:safe-auto
observed/erroring
observed/needs-followup
observed/pending
observed/retire-candidate
observed/unused
observed/used
priority:p0
priority:p1
priority:p2
priority:p3
ready-for-agent
review:claude-reviewed
review:codex-reviewed
review:dziadek-reviewed
review:needs-human
safety:external-write
safety:no-prod-mutation
safety:prod-impact
safety:secret-touch
size/large
size/medium
size/small
size/tiny
size/unknown
source/adr
source/agent-generated
source/manual
source/operator-chat
source/voice-note
status:blocked
status:blocked-on-discovery
status:cagan-grade-review-pending
status:codex-ready
status:merged:pending-evidence
status:needs-evidence
status:needs-operator-decision
status:operator-needed
status:parked
tier:0-anchor
tier:0-platform-substrate
tier:1-core
tier:1-iskra-value-layer
tier:2-supporting
tier:2-tools-products-modules
type:bug
type:chore
type:docs
type:feat
type:policy
type:research
wave:1-foundation
wave:2-positioning
wave:3-validation
wave:4-economics
wave:5-operating
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
pdurlej/patchwarden#123
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Context
Ponytail is an MIT-licensed “lazy senior dev” ruleset/tooling repo:
https://github.com/DietrichGebert/ponytail
It provides cross-agent instructions, hooks, skills, and MCP support around a simple code-minimization ladder:
Relevant Ponytail sources:
ponytailskill: https://raw.githubusercontent.com/DietrichGebert/ponytail/main/skills/ponytail/SKILL.mdponytail-reviewskill: https://raw.githubusercontent.com/DietrichGebert/ponytail/main/skills/ponytail-review/SKILL.mdWhy this may matter for Patchwarden
Patchwarden already has a clean architectural slot for this kind of signal:
This is adjacent to existing deterministic quality work: generated-artifact, security-path, dependency-risk, content/slop sensors, structural-code artifacts, and review-quorum.
Ponytail’s most interesting contribution is not its hooks/plugin runtime. It is the complexity taxonomy:
That maps naturally to a possible Patchwarden “simplicity / complexity-budget” review lane or later deterministic module.
Dependency question
Do not assume Ponytail should become a runtime dependency.
Evaluate these options explicitly:
Option A — Inspiration only
Lift the taxonomy into Patchwarden reviewer prompts/docs. No runtime dependency, no Node requirement, no hooks.
Likely best first step.
Option B — External workflow/tool input
Run Ponytail or a Ponytail-style review outside Patchwarden and map output into
patchwarden.review_artifact.v1.Useful only if Ponytail exposes a stable machine-readable contract.
Option C — MCP/plugin dependency
Probably not for Patchwarden core. Ponytail’s MCP/plugin story is useful for operator/agent context, but Patchwarden should not depend on always-on agent adapters.
Option D — Runtime dependency
Reject by default unless there is a strong reason. It would add Node/tooling surface and would collide with Patchwarden’s current dependency discipline unless the value is clearly proven.
Proposed first slice
Create a default-off reviewer lane, tentatively:
Expected finding types:
unnecessary_dependencystdlib_reimplementationnative_feature_availablespeculative_abstractiondead_flexibilityshrinkable_logicInitial behavior:
Only promote any finding to a blocker after dogfood calibration.
Questions to answer
review-runprompt plumbing, or does Patchwarden need lane-specific prompt schemas?content_slop_sensor,dependency_risk_sensor, or structural-code gate?Acceptance criteria
simplicity_reviewlane.