gemini(w5): apply projection partial-failure report tests #127

Closed
opened 2026-05-28 01:20:43 +02:00 by codex · 1 comment
Collaborator

Parent: #2
Agent lane: Gemini 3.5 Flash
Wave: 5 / Swarmheart / Leviathan
Risk class: medium

Goal

Make controlled projection apply recoverable when one task fails.

Context refs

  • docs/leviathan-projection-contract.md
  • _bmad-output/project-context.md

Scope

  • Find apply projection result shape.
  • Add fixture where one task succeeds and one fails safely.
  • Assert output has partial results, failed task refs, and resume/idempotency guidance.

Acceptance

  • Partial failure does not hide successful writes.
  • Batch audit summary records result without secrets.
  • No production apply feature flag changes.

Suggested checks

  • Projection apply tests in local/dev only.

Non-goals / fences

  • Do not deploy, restart production, rotate secrets, or run production migrations.
  • Do not widen MCP write authority or public exposure.
  • Keep the change small enough for one focused PR or one scouting report.

Expected output

A short PR or issue comment with findings, touched files, tests run, and remaining risks.

Parent: #2 Agent lane: Gemini 3.5 Flash Wave: 5 / Swarmheart / Leviathan Risk class: medium ## Goal Make controlled projection apply recoverable when one task fails. ## Context refs - `docs/leviathan-projection-contract.md` - `_bmad-output/project-context.md` ## Scope - Find apply projection result shape. - Add fixture where one task succeeds and one fails safely. - Assert output has partial results, failed task refs, and resume/idempotency guidance. ## Acceptance - Partial failure does not hide successful writes. - Batch audit summary records result without secrets. - No production apply feature flag changes. ## Suggested checks - Projection apply tests in local/dev only. ## Non-goals / fences - Do not deploy, restart production, rotate secrets, or run production migrations. - Do not widen MCP write authority or public exposure. - Keep the change small enough for one focused PR or one scouting report. ## Expected output A short PR or issue comment with findings, touched files, tests run, and remaining risks.
Author
Collaborator

Codex PR opened: #150

Scope:

  • added task-level idempotencyKey and resumeHint to Leviathan apply results,
  • centralized apply result formatting,
  • kept old batch audit replay compatibility by making the new fields optional in the schema,
  • added integration coverage for one applied task plus one failed move caused by proposal_required,
  • asserted the successful write remains applied, failed card remains unmoved, and batch audit stores summary/results/recovery fields.

Checks:

  • pnpm --filter @kan/api exec vitest integration-tests/agent.integration.test.ts --run -t "reports partial Leviathan apply failures" — passed.
  • pnpm --filter @kan/api exec vitest integration-tests/agent.integration.test.ts --run — passed, 16 tests.
  • pnpm --filter @kan/api typecheck — passed.
  • pnpm exec prettier --check packages/api/src/routers/agent.ts packages/api/integration-tests/agent.integration.test.ts docs/leviathan-projection-contract.md — passed.

No production deploy, restart, DB migration, token/Infisical change, or projection apply enablement.

Codex PR opened: https://git.pdurlej.com/pdurlej/kan-ductor/pulls/150 Scope: - added task-level `idempotencyKey` and `resumeHint` to Leviathan apply results, - centralized apply result formatting, - kept old batch audit replay compatibility by making the new fields optional in the schema, - added integration coverage for one applied task plus one failed move caused by `proposal_required`, - asserted the successful write remains applied, failed card remains unmoved, and batch audit stores summary/results/recovery fields. Checks: - `pnpm --filter @kan/api exec vitest integration-tests/agent.integration.test.ts --run -t "reports partial Leviathan apply failures"` — passed. - `pnpm --filter @kan/api exec vitest integration-tests/agent.integration.test.ts --run` — passed, 16 tests. - `pnpm --filter @kan/api typecheck` — passed. - `pnpm exec prettier --check packages/api/src/routers/agent.ts packages/api/integration-tests/agent.integration.test.ts docs/leviathan-projection-contract.md` — passed. No production deploy, restart, DB migration, token/Infisical change, or projection apply enablement.
Sign in to join this conversation.
No labels
3plus3-followup
agent/claude-code
agent/codex
agent/hermes
agent/iskra
agent/ollama
agent/patchwarden
analytics
api
cockpit
dependency/blocked
dependency/blocks-others
dependency/cross-repo
dependency/needs-confirmation
docs
domain:agents
domain:ci
domain:docs
domain:forgejo
domain:infra
domain:memory
domain:runtime
domain:signal
domain:ux
flow/architecture
flow/blocked
flow/deployed
flow/done
flow/implementation
flow/intake
flow/maintained
flow/observed
flow/ready
flow/refining
flow/retired
flow/review
gemini-flash
judge/codex-candidate
judge/hermes-candidate
judge/low-confidence
judge/needs-refinement
judge/operator-needed
judge/p0
judge/p1
judge/p2
judge/p3
judge/park
judge/patchwarden-candidate
judge/stale-priority
kind/adr
kind/bug
kind/chore
kind/feature
kind/infra
kind/ops
kind/refactor
kind/research
leviathan
mcp
merge/auto
merge/manual
merge/manual-dependency-conflict
merge/manual-failing-tests
merge/manual-merge-conflict
merge/manual-missing-review
merge/manual-operator-preference
merge/manual-red-zone
merge/manual-security-sensitive
merge/manual-unclear-scope
merge/manual-unknown
mode:operator-only
mode:patchwarden-iskra-approved
mode:safe-auto
observed/erroring
observed/needs-followup
observed/pending
observed/retire-candidate
observed/unused
observed/used
ops
priority:p0
priority:p1
priority:p2
priority:p3
review:claude-reviewed
review:codex-reviewed
review:dziadek-reviewed
review:needs-human
safety
safety:external-write
safety:no-prod-mutation
safety:prod-impact
safety:secret-touch
scout
security
size/large
size/medium
size/small
size/tiny
size/unknown
small-task
source/adr
source/agent-generated
source/manual
source/operator-chat
source/voice-note
status:blocked
status:codex-ready
status:merged:pending-evidence
status:needs-evidence
status:operator-needed
status:parked
tests
tier:0-platform-substrate
tier:1-iskra-value-layer
tier:2-tools-products-modules
type:bug
type:chore
type:docs
type:feat
type:policy
type:research
ui
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
pdurlej/kan-ductor#127
No description provided.