fix(honcho): route memory LLM defaults to qwen #699
No reviewers
Labels
No labels
W6d-automerge-calibration
agent/claude-code
agent/codex
agent/hermes
agent/iskra
agent/ollama
agent/patchwarden
automerge-candidate
class/security-sensitive
cutover-gate
dependency/blocked
dependency/blocks-others
dependency/cross-repo
dependency/needs-confirmation
domain:agents
domain:ci
domain:docs
domain:forgejo
domain:infra
domain:memory
domain:runtime
domain:signal
domain:ux
flow/architecture
flow/blocked
flow/deployed
flow/done
flow/implementation
flow/intake
flow/maintained
flow/observed
flow/ready
flow/refining
flow/retired
flow/review
iterating
judge/codex-candidate
judge/hermes-candidate
judge/low-confidence
judge/needs-refinement
judge/operator-needed
judge/p0
judge/p1
judge/p2
judge/p3
judge/park
judge/patchwarden-candidate
judge/stale-priority
kind/adr
kind/bug
kind/chore
kind/feature
kind/infra
kind/ops
kind/refactor
kind/research
large-impact
merge/auto
merge/manual
merge/manual-dependency-conflict
merge/manual-failing-tests
merge/manual-merge-conflict
merge/manual-missing-review
merge/manual-operator-preference
merge/manual-red-zone
merge/manual-security-sensitive
merge/manual-unclear-scope
merge/manual-unknown
meta
mode:operator-only
mode:patchwarden-iskra-approved
mode:safe-auto
needs-operator-decision
needs-triage
not-ready
observed/erroring
observed/needs-followup
observed/pending
observed/retire-candidate
observed/unused
observed/used
operator-emotional
owner-attention
phase/02
phase/03
priority:p0
priority:p1
priority:p2
priority:p3
proposed
ready-for-agent
ready-for-operator
recovery
review:claude-reviewed
review:codex-reviewed
review:dziadek-reviewed
review:needs-human
risk/exposure
risk/process
risk/product
risk/runtime
safety:external-write
safety:no-prod-mutation
safety:prod-impact
safety:secret-touch
size/large
size/medium
size/small
size/tiny
size/unknown
source/adr
source/agent-generated
source/manual
source/operator-chat
source/voice-note
status:blocked
status:codex-ready
status:merged:pending-evidence
status:needs-evidence
status:operator-needed
status:parked
tier/full
tier/lite
tier/stacked
tier:0-platform-substrate
tier:1-iskra-value-layer
tier:2-tools-products-modules
type:bug
type:chore
type:docs
type:feat
type:policy
type:research
No milestone
No project
No assignees
2 participants
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
pdurlej/platform!699
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "codex/690-honcho-qwen-fix"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Canary status: missing - fire canary 3+3 manually before merge
Summary
This PR takes over the repo-source slice from draft PR #690 and switches Honcho's default Ollama/OpenAI-compatible text model contract from
gemma4:31b-cloudtoqwen3.5.It updates the source defaults, DR sandbox defaults, synthetic compatibility smoke, and active runbook/closeout docs. It does not restart production Honcho, does not edit RS2000 live override files, and does not write Infisical.
Refs #293, #357, #363, #690.
Canary Context Pack
Product story
Iskra/OpenClaw memory derivation must stop depending on a model that has been observed returning Markdown prose where Honcho needs JSON-compatible structured output. The repo source of truth should point future deploys and DR restores at the model already proven in the local classifier lane.
What changed
compose/apps/compose.yamlfromgemma4:31b-cloudtoqwen3.5.Why it changed
Draft PR #690 documents a live P1 symptom: Honcho derivation has been failing because the current Gemma route returns non-JSON prose. The source-of-truth repo needs to stop recreating that route before any runtime reconcile/restart happens.
Files touched
compose/apps/compose.yamlscripts/dr/w3d-local-sandbox-drill.shscripts/honcho/ollama-gemma-compat-smoke.pycontrol-plane/platformctl/tests/test_honcho_ollama_contract.pytests/test_honcho_log_privacy.pyrunbooks/honcho-ollama-gemma-switch.mdstate/cutover/honcho-closeout-plan.mdcontexts/persona-bridge/MAP.mdcontexts/observability/MAP.mddocs/incidents/2026-06-02-honcho-qwen-fix.mdRelevant context
Runtime evidence
Repo-only evidence in this PR:
PYTHONPATH=control-plane python3 -m pytest control-plane/platformctl/tests/test_honcho_ollama_contract.py tests/test_honcho_log_privacy.pypassed.git diff --checkpassed.bash -n scripts/dr/w3d-local-sandbox-drill.shpassed.PYTHONPATH=control-plane python3 -m platformctl.cli validate all --jsonpassed withexitCode: 0.No production restart, no live Infisical write, and no live Honcho derivation proof was performed in this PR.
Known constraints
The live acceptance criteria from #690 still require a gated runtime step: reconcile Infisical and RS2000 live overrides, restart only
home-platform-honcho-deriver-1andhome-platform-honcho-api-1, then prove valid JSON derivation and backlog progress.Explicit out-of-scope
Requested decision
Approve this as the source-of-truth repo fix needed before the runtime cutover/reconcile step.
Merge blockers
gemma4:31b-cloud.qwen3.5from the available classifier integration checkout.Spec sources read
docs/incidents/2026-06-02-honcho-qwen-fix.md- incident packet and acceptance constraints.compose/apps/compose.yaml- Honcho LLM defaults source of truth.scripts/dr/w3d-local-sandbox-drill.sh- DR/sandbox restore default contract.scripts/honcho/ollama-gemma-compat-smoke.py- synthetic provider compatibility smoke.control-plane/platformctl/tests/test_honcho_ollama_contract.py- Honcho model/default tests.tests/test_honcho_log_privacy.py- model metadata in log privacy fixture.runbooks/honcho-ollama-gemma-switch.md- operator runbook.state/cutover/honcho-closeout-plan.md- active closeout plan.contexts/persona-bridge/MAP.mdandcontexts/observability/MAP.md- context navigation labels./Users/pd/Developer/books-for-iskra/iskra-openclaw-integration/scripts/iskra-fastmail-dmz-classifier.py- exact existing Qwen model tag evidence.Patchwarden PR sanity
advisory_findings699deb5081804ccee2c3dbbbc04379d37ae4eba3fd2missingDeterministic findings
No deterministic findings.
Model reviewers
global-glm/glm-5.1:cloudStatus:
okVerdict:
OKmediumLegacy filenames may confuse future operators searching for Qwen referencesscripts/honcho/ollama-gemma-compat-smoke.py retains 'gemma' in filename while testing qwen3.5; runbooks/honcho-ollama-gemma-switch.md retains 'gemma' in filename while documenting Qwen switch. Diff shows schema changed to 'honcho_ollama_qweglobal-deepseek/deepseek-v4-pro:cloudStatus:
okVerdict:
OKlowModel tagqwen3.5not verified against Ollama Cloud catalogcompose/apps/compose.yaml: all Honcho model defaults changed toqwen3.5; incident doc references local classifier checkout but no live Ollama Cloud tag confirmation.qwen3.5is the exact tag available on Ollama Cloud and that it supports the required structured output for all Honcho roles.lowSmoke script filename still references Gemmascripts/honcho/ollama-gemma-compat-smoke.py: filename unchanged while content now defaults to Qwen; runbooks still reference this filename.ollama-qwen-compat-smoke.pyafter verifying no remote automation hardcodes the old name.redteam/kimi-k2.6:cloudStatus:
okVerdict:
NOT_OKhighCompatibility smoke falls back to plain prompt, masking JSON structured-output failuresscripts/honcho/ollama-gemma-compat-smoke.py chat_test only validates a plain string echo (HONCHO_QWEN_OK) and does not enforce JSON schema; control-plane/platformctl/tests/test_honcho_ollama_contract.py test_ollama_smoke_json_test_falls_bacPolicy notes
PLATFORMCTL_PR_SANITY_REDTEAM_MODELis configured.Operator-delegated approval via temporary admin lane: PR #699 is repo-source only, all required checks are green, no runtime restart, no Infisical write, no live override edit.