docs(specs): prebuild for #176 Hermes voice-clone feasibility spike v0 #351

Merged
pdurlej merged 1 commit from claude/h-batch/hermes-voice-clone-spike-v0-prebuild into main 2026-05-23 10:31:38 +02:00
Collaborator

BATCH H Pan Herbatka fork output. Greenfield spike — hermes-agency has no existing voice/TTS infrastructure. Prepares execution of feasibility decision (SHIP/ITERATE/ABANDON) for voice-cloning operator voice for Hermes audio deliverables.

Contents

  • docs/specs/hermes-voice-clone-spike-v0/ — 6 spec files (constitution with 8 principles, specify, plan, tasks, implement-notes, README)
  • prompts/codex-hermes-voice-clone-spike.md — companion execution prompt

8 non-negotiable principles

  • P1 Privacy-first (NO cloud-API for samples — biometric identity)
  • P2 On-device inference only
  • P3 Spike-not-ship (Phase 07 ticket post-SHIP)
  • P4 Operator A/B as quality oracle (NOT MOS/CER metrics)
  • P5 Polish-language binding (NOT English benchmarks)
  • P6 Operator-bandwidth-conscious (≤30 min sample session)
  • P7 Infrastructure-realistic (Mac M1 / RS2000 / VPS1000)
  • P8 Decision-forcing (SHIP/ITERATE/ABANDON, no maybe)

5-slice funnel

(a) Survey → (b) Mac M1 bench → (c) Host bench → (d) Sample protocol → (e) Feasibility report

Top-3 candidates: Coqui XTTS-v2, OpenVoice (MyShell), F5-TTS — explicit Polish-support filter applied.

Safety/production boundary

This PR prepares execution only. Does NOT authorize: production deployment, cloud-API for samples, Hermes runtime mutation, implicit ship without separate Phase 07 ticket.

Tier: Trivial per ADR-0007 (docs-only).

Disjoint from active C/F forks (different file paths).

Refs #176

**BATCH H Pan Herbatka fork output.** Greenfield spike — `hermes-agency` has no existing voice/TTS infrastructure. Prepares execution of feasibility decision (SHIP/ITERATE/ABANDON) for voice-cloning operator voice for Hermes audio deliverables. ## Contents - `docs/specs/hermes-voice-clone-spike-v0/` — 6 spec files (constitution with 8 principles, specify, plan, tasks, implement-notes, README) - `prompts/codex-hermes-voice-clone-spike.md` — companion execution prompt ## 8 non-negotiable principles - P1 Privacy-first (NO cloud-API for samples — biometric identity) - P2 On-device inference only - P3 Spike-not-ship (Phase 07 ticket post-SHIP) - P4 Operator A/B as quality oracle (NOT MOS/CER metrics) - P5 Polish-language binding (NOT English benchmarks) - P6 Operator-bandwidth-conscious (≤30 min sample session) - P7 Infrastructure-realistic (Mac M1 / RS2000 / VPS1000) - P8 Decision-forcing (SHIP/ITERATE/ABANDON, no maybe) ## 5-slice funnel (a) Survey → (b) Mac M1 bench → (c) Host bench → (d) Sample protocol → (e) Feasibility report Top-3 candidates: Coqui XTTS-v2, OpenVoice (MyShell), F5-TTS — explicit Polish-support filter applied. ## Safety/production boundary This PR prepares execution only. Does NOT authorize: production deployment, cloud-API for samples, Hermes runtime mutation, implicit ship without separate Phase 07 ticket. Tier: Trivial per ADR-0007 (docs-only). Disjoint from active C/F forks (different file paths). Refs #176
docs(specs): prebuild for #176 Hermes voice-clone feasibility spike v0
All checks were successful
base-is-main / guard (pull_request) Successful in 2s
canary-required / collect-diff (pull_request) Successful in 4s
patchwarden-pr-sanity / collect-diff (pull_request) Successful in 3s
canary-required / canary (pull_request) Successful in 12s
patchwarden-pr-sanity / sanity (pull_request) Successful in 20s
a37386ba9a
BATCH H Pan Herbatka fork output. Greenfield spike — hermes-agency has
no existing voice/TTS infrastructure. This prebuild prepares execution
of feasibility decision (SHIP/ITERATE/ABANDON) for voice-cloning
operator's voice for Hermes audio deliverables.

What's in:

docs/specs/hermes-voice-clone-spike-v0/
- 00-constitution.md (8 non-negotiable principles: privacy-first,
  on-device only, spike-not-ship, operator A/B as quality oracle,
  Polish-language binding, operator-bandwidth-conscious, infra-realistic,
  decision-forcing)
- 01-specify.md (problem, MUST/SHOULD outcomes, acceptance criteria,
  90-min operator-bandwidth budget)
- 02-plan.md (5-slice funnel: survey -> Mac M1 bench -> host bench ->
  sample protocol -> feasibility report; top-3 candidates Coqui XTTS-v2,
  OpenVoice, F5-TTS with explicit Polish-support filter; Mac M1 primary
  benchmark host)
- 03-tasks.md (per-slice checklists with explicit verification steps;
  codex-executor vs operator-runnable mode marked per slice)
- 04-implement-notes.md (Polish phoneme gotchas, per-engine specifics,
  sample storage discipline, hermes-agency integration boundary)
- README.md (TL;DR + reading order + scope)

prompts/codex-hermes-voice-clone-spike.md
- Companion execution prompt per ADR-0018 + Codex feedback pattern
- Safety/production boundary: NO cloud-API for samples, NO Hermes
  runtime mutation, NO implicit ship without separate Phase 07 ticket
- 8 hard gates (privacy, locality, spike-not-ship, decision-forcing,
  Polish-binding, operator-bandwidth, sacred-path-adjacent, ADR-0018)
- 6 stop conditions including SECURITY INCIDENT path if sample leaks
- Reporting format + cousin coordination

Constitution P1+P2 (no cloud, no API for cloning OR inference) is the
absolute privacy hard-line — operator's voice is biometric identity.

Spike output = SHIP/ITERATE/ABANDON verdict. SHIP triggers separate
Phase 07 implementation ticket; ABANDON closes #176 with evidence;
ITERATE schedules next round.

Tier: Trivial per ADR-0007 (docs-only prebuild; no code/schema/runtime).

Refs #176
Sign in to join this conversation.
No reviewers
No labels
W6d-automerge-calibration
agent/claude-code
agent/codex
agent/hermes
agent/iskra
agent/ollama
agent/patchwarden
automerge-candidate
class/security-sensitive
cutover-gate
dependency/blocked
dependency/blocks-others
dependency/cross-repo
dependency/needs-confirmation
domain:agents
domain:ci
domain:docs
domain:forgejo
domain:infra
domain:memory
domain:runtime
domain:signal
domain:ux
flow/architecture
flow/blocked
flow/deployed
flow/done
flow/implementation
flow/intake
flow/maintained
flow/observed
flow/ready
flow/refining
flow/retired
flow/review
iterating
judge/codex-candidate
judge/hermes-candidate
judge/low-confidence
judge/needs-refinement
judge/operator-needed
judge/p0
judge/p1
judge/p2
judge/p3
judge/park
judge/patchwarden-candidate
judge/stale-priority
kind/adr
kind/bug
kind/chore
kind/feature
kind/infra
kind/ops
kind/refactor
kind/research
large-impact
merge/auto
merge/manual
merge/manual-dependency-conflict
merge/manual-failing-tests
merge/manual-merge-conflict
merge/manual-missing-review
merge/manual-operator-preference
merge/manual-red-zone
merge/manual-security-sensitive
merge/manual-unclear-scope
merge/manual-unknown
meta
mode:operator-only
mode:patchwarden-iskra-approved
mode:safe-auto
needs-operator-decision
needs-triage
not-ready
observed/erroring
observed/needs-followup
observed/pending
observed/retire-candidate
observed/unused
observed/used
operator-emotional
owner-attention
phase/02
phase/03
priority:p0
priority:p1
priority:p2
priority:p3
proposed
ready-for-agent
ready-for-operator
recovery
review:claude-reviewed
review:codex-reviewed
review:dziadek-reviewed
review:needs-human
risk/exposure
risk/process
risk/product
risk/runtime
safety:external-write
safety:no-prod-mutation
safety:prod-impact
safety:secret-touch
size/large
size/medium
size/small
size/tiny
size/unknown
source/adr
source/agent-generated
source/manual
source/operator-chat
source/voice-note
status:blocked
status:codex-ready
status:merged:pending-evidence
status:needs-evidence
status:operator-needed
status:parked
tier/full
tier/lite
tier/stacked
tier:0-platform-substrate
tier:1-iskra-value-layer
tier:2-tools-products-modules
type:bug
type:chore
type:docs
type:feat
type:policy
type:research
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
pdurlej/platform!351
No description provided.