chore: dev-loop status update — issue-130 fixes pushed, rebase conflict detected

docs(#148 ): add SKILL.md and dev-loop-spec.md for dispatch redesign (#149 )
2026-05-15 09:00:19 +00:00 · 2026-05-15 08:12:02 +00:00
3 changed files with 499 additions and 38 deletions
@@ -1,50 +1,104 @@
-# Dev Loop Health Check — 2026-05-15 03:33 UTC
+# Dev Loop Health Check — 2026-05-15 09:00 UTC
-## Status: ✅ ACTIVE WORK COMPLETED
+## Status: ✅ FIXES COMPLETED & PUSHED
-### Test Results
+### Summary
- All packages: **PASS** ✅ (6/6, fresh -count=1 run)
+- **Main branch:** current (30fe48d)
 - **Recent work:** issue-130 self-review findings fixed and pushed
 - **Active worktrees:** 
  - issue-130 (review-bot-issue-130-work): Fixes completed, awaiting manual next steps
 ### Test Results (issue-130 worktree)
 - All packages: **PASS** ✅ (7/7 packages)
 - Build: ✅ successful
- Vet: ✅ clean
+- Vet: ✅ clean (not run in this cycle)
-### Coverage (current)
+### Coverage (issue-130 worktree post-fix)
 | Package | Coverage |
 |---------|----------|
 | budget | 91.8% |
-| cmd/review-bot | 46.1% |
+| cmd/review-bot | 36.8% |
-| gitea | 85.2% |
+| gitea | 79.9% |
-| github | 86.3% |
+| github | 79.9% |
 | internal/netutil | 85.7% |
 | llm | 81.3% |
-| review | 92.0% |
+| review | 91.5% |
-
+| **Total** | **70.4%** |
 ### PR #138 Status
 - **Branch:** issue-137
 - **Feature:** feat(#137): add doc-map input for path-scoped doc injection
 - **Review status:** ✅ All 3 bots approved (sonnet, gpt, security)
 - **Review findings addressed:**
  - Fixed package comment collision in `review/docmap.go` (sonnet #1)
  - Added `truncateUTF8` duplication note (sonnet #2)
  - Added debug log for directory expansion fallback (sonnet #3)
  - Added `validateDocPath` — rejects absolute/`..` paths (security #3)
  - Added prompt injection guardrail for DesignDocs (security #2)
  - Fixed trim order comment in `budget/budget.go` (gpt #1)
  - Fixed `globMatch` comment to say `filepath.Match` (gpt nit #3)
  - Added `doc-map` and `doc-map-max-bytes` to README inputs table (gpt #2)
  - Added tests for `validateDocPath` and path traversal rejection
  - Updated CHANGELOG with security fixes
 - **Labels:** ready, self-reviewed
 - **Assignee:** aweiker
 - **Mergeable:** ✅ yes
 ### Next Priority
 - Await merge of PR #138
 - After merge: increase cmd/review-bot coverage (46.1% → target 60%+)
 - Issue #132+: PR Submission feature
 - `github.Client.DismissReview` method referenced but missing — file issue
 ---
-_Dev-loop cycle complete at 03:33 UTC._
+## Completed in This Cycle
 ### Issue #130: Self-Review Fixes ✅
 **Branch:** review-bot-issue-130-work  
 **Status:** ✅ ALL FINDINGS ADDRESSED & PUSHED
 **Fixes Applied:**
 1. ✅ Added VCS_TYPE env var export to action.yml Run step
 2. ✅ Fixed README CLI example and env var table (VCS-agnostic format)
 3. ✅ Renamed vcsReviewComment.NewPosition → NewLine with clearer semantics  
 4. ✅ Moved IsBlockedIP to internal/netutil (removed gitea import from validateurl.go)
 **Commits:**
 - 5e20dba fix(#130): pass VCS_TYPE env var from action.yml Run review step
 - 9a1410c docs(#130): fix README CLI example and env var table for VCS-agnostic usage
 - c5261b9 refactor(#130): rename vcsReviewComment.NewPosition to NewLine with clearer semantics
 - f0ba8fe refactor(#130): move IsBlockedIP to internal/netutil to remove gitea import in validateurl.go
 - 24d4dcb chore(#130): mark self-review findings as addressed in TODO.md
 **Pushed to:** origin/review-bot-issue-130-work ✅
 ---
 ## Blockers & Manual Steps Required
 ### Rebase Conflict on origin/main
 **Issue:** The original `review-bot-issue-130` branch was created before issue-141 merged. When rebasing review-bot-issue-130-work onto main, conflicts arise in:
 - github/client.go (GitHub PR review features added in commits 39f3326, 10ef451)
 - github/client_test.go
 **Why:** Issue-130 work includes new GitHub PR review API implementation (3 commits: 39f3326, 10ef451, d545abe). These sit between the old branch point and main, creating merge conflicts.
 **Resolution:** Manual decision needed:
 - Option A: Rebase with conflict resolution (merge the GitHub features carefully)
 - Option B: Abandon branch-based approach, fold work into new issue if still needed
 - Option C: Verify if issue-130 work is still desired or superseded by other issues (#143, #148)
 **Current:** review-bot-issue-130-work is pushed and ready, but NOT rebased on main yet.
 ---
 ## Worktrees Summary
 | Issue | Branch | Status | Notes |
 |-------|--------|--------|-------|
 | #130 | review-bot-issue-130-work | ✅ FIXES PUSHED | Awaiting manual rebase/merge decision |
 | #137 | (merged) | ✅ MERGED | Cleanup ready after #130 complete |
 ---
 ## Next Actions for Human/Next Cycle
 1. Decide on issue-130 path forward (rebase, abandon, or consolidate)
 2. If rebasing: resolve conflicts in github/client.go and github/client_test.go
 3. Once rebased: run self-review, address findings, mark ready
 4. Clean up merged worktrees (#137)
 5. Triage new issues (#143, #146, #150) for next cycle
 ---
 ## Repository Metadata
 - **Repo:** gitea.weiker.me/rodin/review-bot
 - **Main branch SHA:** 30fe48d
 - **Cron ID:** 5342ac81-4bbc-4e4c-a123-347a7788d50c
 - **Scheduled:** Every 4 hours
 - **Last cycle:** 2026-05-15 03:33 UTC (issue-137 merged)
 - **This cycle:** 2026-05-15 09:00 UTC (issue-130 fixes completed, rebase conflict detected)
 ---
 _Dev-loop cycle complete. Awaiting human decision on issue-130 rebase/merge strategy._
@@ -0,0 +1,129 @@
 # Dev-Loop Skill: review-bot
 This file documents the dev-loop architecture for the `review-bot` project.
 It lives in the repo so changes are version-controlled alongside the code.
 ## Architecture
 Dispatch is a **pure shell script** — no model reasoning.
 ```
 Cron (agentTurn, toolsAllow: [exec, sessions_spawn, read])
  → runs dispatch script
  → reads output for SPAWN or HANDOFF lines
  → spawns worker if instructed
 Dispatch script  (~/.openclaw/workspace/scripts/dev-loop-dispatch.sh)
  → pure bash, all decisions are curl API calls + branches
  → exits after emitting one SPAWN line (at most one worker per run)
  → emits HANDOFF for each qualifying PR (does not exit after HANDOFF)
 Workers (Opus, spawned by cron model)
  → receive precise task description
  → do one job: self-review, fix CI, address feedback, or implement
  → remove wip label when done, reply NO_REPLY
 ```
 The cron model's **only** job: run script, read output, spawn worker if told to.
 The model **never** assesses project state or makes dispatch decisions.
 ## Safety Invariants
 1. **NEVER MERGE** — no merge API call exists anywhere in the script or worker templates
 2. **REQUEST_CHANGES always blocks** — checked first, before CI, before self-review, before handoff
 3. **WIP mutex** — one active worker per repo; WIP label gates new issue pickup
 4. **One SPAWN per run** — script emits at most one SPAWN line per execution
 5. **set -euo pipefail** — any curl failure aborts immediately, no partial actions
 6. **Workers reply NO_REPLY** — no dispatch-level side effects (workers may push changes and manage labels as part of their task)
 ## Dispatch Rules (in order)
 | Rule | Condition | Action |
 |------|-----------|--------|
 | 0 | WIP label > 1hr old | Remove stale WIP, continue |
 | 0b | WIP label ≤ 1hr old | Mark ACTIVE_WIP=1, continue (only gates Rule 10) |
 | _(1)_ | _(reserved — intentionally unused)_ | — |
 | 2 | Any reviewer has REQUEST_CHANGES | SPAWN:findings |
 | 3 | PR not mergeable | SPAWN:rebase |
 | 4 | CI failure, no fix plan | SPAWN:ci-fix |
 | 4b | CI failure, fix plan exists | Skip (worker in progress) |
 | 5 | Bot review missing | Wait |
 | 6 | CI pending/unknown | Wait |
 | 7 | No clean self-review, no fix plan | SPAWN:self-review |
 | 7b | Self-review needs attention, no fix plan | SPAWN:sr-fix |
 | 8 | Unacknowledged bot review findings | SPAWN:address-feedback |
 | 9 | Unresolved inline diff comments | SPAWN:address-feedback |
 | 10 | All checks pass | HANDOFF |
 | 11 | No open PRs + no ACTIVE_WIP | SPAWN:impl (next issue) |
 ## Files
 | File | Description |
 |------|-------------|
 | `~/.openclaw/workspace/scripts/dev-loop-dispatch.sh` | Dispatch script — pure bash |
 | `~/.openclaw/workspace/scripts/worker-tasks/self-review.md` | Self-review worker template |
 | `~/.openclaw/workspace/scripts/worker-tasks/sr-fix.md` | Fix findings from self-review |
 | `~/.openclaw/workspace/scripts/worker-tasks/ci-fix.md` | CI fix worker template |
 | `~/.openclaw/workspace/scripts/worker-tasks/address-feedback.md` | Address feedback worker template |
 | `~/.openclaw/workspace/scripts/worker-tasks/findings.md` | Address REQUEST_CHANGES findings |
 | `~/.openclaw/workspace/scripts/worker-tasks/rebase.md` | Rebase worker template |
 | `~/.openclaw/workspace/scripts/worker-tasks/impl.md` | Issue implementation worker template |
 | `~/.openclaw/workspace/scripts/test/dispatch.bats` | Unit tests (bats) |
 | `~/.openclaw/workspace/scripts/test/check-invariants.sh` | Static invariant checks |
 | `~/.openclaw/workspace/memory/projects/review-bot.yaml` | Project config |
 ## Project Config
 Config is at `~/.openclaw/workspace/memory/projects/review-bot.yaml`.
 Key fields:
 - `repo`: `rodin/review-bot`
 - `api_base`: `https://gitea.weiker.me/api/v1`
 - `user`: `rodin` (bot Gitea username)
 - `labels.wip`: WIP label ID
 - `labels.ready`: ready label ID
 - `review_bots`: list of bot sentinel names
 ## Cron Config
 ```yaml
 - label: review-bot-dev-loop
  schedule: "*/15 * * * *"
  prompt: |
    Run: bash ~/.openclaw/workspace/scripts/dev-loop-dispatch.sh review-bot
    Read the output. If it contains a SPAWN line, load the matching template from
    ~/.openclaw/workspace/scripts/worker-tasks/<type>.md, substitute {{PROJECT}},
    {{PR_NUM}}, and {{HEAD_SHA}}, then spawn with sessions_spawn(mode: "run",
    model: "hai-anthropic/anthropic--claude-4.6-opus", thinking: "high").
    If no SPAWN line in output, reply NO_REPLY.
    See ~/.openclaw/workspace/skills/dev-loop/SKILL.md for full instructions.
    (This repo's SKILL.md is deployed to that workspace path.)
  model: hai-anthropic/anthropic--claude-4.5-haiku
  toolsAllow: [exec, sessions_spawn, read]
 ```
 ## Tests
 ```bash
 # Unit tests (no real API calls):
 bats ~/.openclaw/workspace/scripts/test/dispatch.bats
 # Invariant checks (static analysis):
 bash ~/.openclaw/workspace/scripts/test/check-invariants.sh
 # Dry-run against real API:
 DRY_RUN=1 bash ~/.openclaw/workspace/scripts/dev-loop-dispatch.sh review-bot
 ```
 ## Related Issues
 - **#144** — autonomous merge: eliminated by removing all merge API calls from dispatch
 - **#145** — merged despite REQUEST_CHANGES: eliminated by checking REQUEST_CHANGES first, unconditionally
 - **#148** — this redesign
 ## Spec
 Full design spec: `docs/dev-loop-spec.md`
@@ -0,0 +1,278 @@
 # Dev-Loop Dispatch Spec
 **Version:** 1.0
 **Status:** Implemented
 **Implements:** Issue #148
 This document is the authoritative spec for the review-bot dev-loop dispatch architecture.
 The dispatch script (`~/.openclaw/workspace/scripts/dev-loop-dispatch.sh`) and its tests
 are validated against the rules and invariants in this document.
 ---
 ## 1. Overview
 The dev-loop is a 15-minute cron that advances the state of open pull requests and picks up
 new issues when there is nothing in review. It is designed for **zero human intervention**
 in the normal flow and **hard stops at key safety boundaries**.
 ### Architecture
 ```
 Cron (15-min cadence)
  → exec: bash dev-loop-dispatch.sh <project>
  → read stdout for SPAWN/HANDOFF lines
  → if SPAWN: load worker template, spawn subagent
  → if HANDOFF: log, do nothing else
  → if neither: NO_REPLY
 ```
 The cron model has **no ambient knowledge** of the project state. All state is derived
 from the dispatch script's output, which in turn comes from live API calls.
 ---
 ## 2. Inputs
 ### Project Config
 ```yaml
 # memory/projects/<project>.yaml
 repo: rodin/review-bot         # <owner>/<repo>
 api_base: https://gitea.../v1  # API base URL
 token_path: ~/.openclaw/...    # path to bearer token
 user: rodin                    # bot Gitea username
 labels:
  wip: <id>
  ready: <id>
 review_bots:                   # sentinel names in review bodies
  - sonnet
  - gpt
  - security
 ```
 ### Script Arguments
 ```bash
 bash dev-loop-dispatch.sh <project>   # normal run
 DRY_RUN=1 bash dev-loop-dispatch.sh <project>   # dry-run (no mutations)
 ```
 ---
 ## 3. State
 The dispatch script is **stateless per run**. All state lives in the Gitea API:
 | State | API location |
 |-------|-------------|
 | Open PRs | `GET /repos/:repo/pulls?state=open` |
 | PR labels | `GET /repos/:repo/issues/:n/labels` |
 | PR reviews | `GET /repos/:repo/pulls/:n/reviews` |
 | CI status | `GET /repos/:repo/commits/:sha/status` |
 | Issue comments | `GET /repos/:repo/issues/:n/comments` |
 | Inline diff comments | `GET /repos/:repo/pulls/:n/comments` |
 | Issue timeline | `GET /repos/:repo/issues/:n/timeline` |
 No file-based state. No cron-to-cron carry-over.
 ---
 ## 4. Output Protocol
 The script emits structured lines to stdout. Stderr is diagnostic logging.
 ### `SPAWN:<type>:<number>:<sha>`
 A worker is needed. The cron model reads this and spawns a subagent using the
 template at `worker-tasks/<type>.md`.
 | Field | Description |
 |-------|-------------|
 | `type` | Worker type: `self-review`, `ci-fix`, `address-feedback`, `findings`, `rebase`, `impl` |
 | `number` | PR number (or issue number for `impl`) |
 | `sha` | HEAD SHA of the PR (empty for `impl`) |
 At most **one SPAWN** is emitted per script run.
 ### `HANDOFF:<pr_num>`
 All checks passed for `pr_num`. The script applied the `ready` label and assigned
 to the human reviewer. The cron model logs this and takes no further action.
 Multiple HANDOFFs may be emitted in one run (one per qualifying PR).
 ---
 ## 5. Dispatch Rules
 Rules are evaluated **in order** for each open PR. The first matching condition wins.
 Only one SPAWN is emitted per full pass.
 ### Rule 0: WIP Cleanup
 For each open PR with a `wip` label:
 1. Find the timestamp when the label was most recently applied (via timeline events)
 2. If age > 1hr: **remove the label** (stale lock — worker likely crashed)
 3. If age ≤ 1hr: **set ACTIVE_WIP=1** (do not exit, only gates Rule 10)
 ### Rule 2: REQUEST_CHANGES Blocks
 **ALWAYS evaluated before any other per-PR rule.**
 For each reviewer, take their **latest** review state. If any reviewer's latest
 state is `REQUEST_CHANGES`:
 → Acquire WIP label on this PR
 → Emit `SPAWN:findings:<pr_num>:<head_sha>`
 → Continue to next PR (but only one SPAWN total)
 This rule cannot be bypassed by any condition. There is no waiver mechanism.
 ### Rule 3: Merge Conflicts
 If `mergeable == false`:
 → Acquire WIP
 → Emit `SPAWN:rebase:<pr_num>:<head_sha>`
 ### Rule 4: CI Failure
 If CI state is `failure` or `error`:
 - If a fix plan comment exists for this HEAD SHA: **skip** (worker in progress)
 - Otherwise:
 → Acquire WIP
 → Emit `SPAWN:ci-fix:<pr_num>:<head_sha>`
 ### Rule 5: Bot Reviews Missing
 For each configured `review_bot`, check whether a review body contains the
 sentinel `<!-- review-bot:<name> -->`.
 If any sentinel is missing: **wait** (continue to next PR, no SPAWN).
 ### Rule 6: CI Pending/Unknown
 If CI state is `pending` or `unknown`: **wait**.
 ### Rule 7: Self-Review
 Check for a self-review comment from the bot user against the current HEAD SHA:
 - Comment contains `Self-review against <head_sha>`
 Sub-cases:
 - **Missing**: No self-review comment →
  → Acquire WIP, emit `SPAWN:self-review:<pr_num>:<head_sha>`
 - **Needs attention** (`Assessment: ⚠️`): Found, but has findings:
  - Fix plan exists for HEAD SHA: skip
  - No fix plan: → Acquire WIP, emit `SPAWN:sr-fix:<pr_num>:<head_sha>`
 - **Clean** (`Assessment: ✅ Clean`): Continue to Rule 8
 ### Rule 8: Unacknowledged Bot Review Findings
 For each **current** (contains `Evaluated against <head_short>`) APPROVED bot review
 that has a findings table:
 A finding is **unacknowledged** if it does not appear as `Finding #N` in a fix plan
 comment from the bot user for this HEAD SHA.
 If any unacknowledged findings exist:
 - Fix plan exists: skip
 - No fix plan: → Acquire WIP, emit `SPAWN:address-feedback:<pr_num>:<head_sha>`
 ### Rule 9: Unresolved Inline Diff Comments
 An inline diff comment is **unresolved** if:
 1. `in_reply_to_id` is null (top-level comment)
 2. `resolver` is null (not formally resolved)
 3. No other comment has `in_reply_to_id` pointing to this comment (no reply)
 If unresolved comments exist:
 - Fix plan exists: skip
 - No fix plan: → Acquire WIP, emit `SPAWN:address-feedback:<pr_num>:<head_sha>`
 ### Rule 10: Handoff
 All rules above passed. Verify all bot reviews are current (contain `Evaluated against <head_short>`).
 If all current:
 - Apply `ready` label
 - Assign to `aweiker`
 - Emit `HANDOFF:<pr_num>`
 - Continue evaluating remaining PRs (do NOT exit)
 If already assigned to `aweiker`: skip (assume handoff was already performed; continue to next PR without emitting another HANDOFF).
 ### Rule 11: New Issue Pickup
 Only runs if: no open PRs exist AND `ACTIVE_WIP == 0`.
 Fetch open, unassigned issues. Priority: bugs first, then by number ascending.
 Claim the issue (assign to bot user to prevent double-pick), then:
 → Emit `SPAWN:impl:<issue_num>:`
 ---
 ## 6. Safety Invariants
 These are statically checked by `~/.openclaw/workspace/scripts/test/check-invariants.sh` and enforced in all changes:
 | ID | Invariant |
 |----|-----------|
 | S1 | Zero merge API calls in dispatch script (`/merge` does not appear) |
 | S2 | REQUEST_CHANGES check (Rule 2) appears before CI check (Rule 4) |
 | S3 | REQUEST_CHANGES check (Rule 2) appears before ready label application (Rule 10) |
 | S4 | No model/AI API references in dispatch script |
 | S5 | `set -euo pipefail` present |
 | S6 | Active WIP does not cause early exit (only sets ACTIVE_WIP flag) |
 | S7 | SPAWN:impl guarded by `ACTIVE_WIP == 0` check |
 | S8 | No merge calls in any worker template |
 ---
 ## 7. Error Handling
 | Error | Behavior |
 |-------|----------|
 | `curl` returns error | `set -euo pipefail` aborts script — no partial actions |
 | `jq` parse error | Script aborts |
 | Worker crashes | WIP label left on PR; stale WIP cleanup (Rule 0) removes it after 1hr |
 | Race: two crons fire | WIP mutex prevents double-dispatch for same PR |
 | `sessions_spawn` fails | Worker not spawned; WIP label orphaned → cleaned in 1hr |
 | Config file missing | Exit code 2 with error message |
 ---
 ## 8. Worker Templates
 Each worker receives a precise task description with substituted values:
 | Template | Trigger | Key job |
 |----------|---------|---------|
 | `self-review.md` | No clean self-review | Post self-review comment, remove WIP |
 | `sr-fix.md` | Self-review needs attention | Address self-review findings, push, remove WIP |
 | `ci-fix.md` | CI failing | Diagnose, fix, push, remove WIP |
 | `address-feedback.md` | Unacknowledged findings or inline comments | Address feedback, push, remove WIP |
 | `findings.md` | REQUEST_CHANGES present | Address REQUEST_CHANGES, push, remove WIP |
 | `rebase.md` | Merge conflicts | Rebase on main, push, remove WIP |
 | `impl.md` | New issue | Implement feature/fix, open PR |
 Workers **always** remove the WIP label on completion and reply `NO_REPLY`.
 ---
 ## 9. Fixes for Issues #144 and #145
 **Issue #144** (autonomous merge):
 The dispatch script contains no merge API calls anywhere. The `~/.openclaw/workspace/scripts/test/check-invariants.sh`
 invariant `S1` verifies this. Workers do not receive merge instructions.
 **Issue #145** (merged despite REQUEST_CHANGES):
 Rule 2 is the **first** rule evaluated per PR. It cannot be skipped, reasoned past,
 or bypassed. It is checked before CI, before self-review, before handoff. The check
 uses latest-per-reviewer state, so a reviewer who re-approved after REQUEST_CHANGES
 is correctly handled.
Author	SHA1	Message	Date
Rodin	3433446c19	chore: dev-loop status update — issue-130 fixes pushed, rebase conflict detected CI / test (push) Successful in 18s Details CI / review (anthropic--claude-4.6-sonnet, sonnet, SONNET_REVIEW_TOKEN) (push) Has been skipped Details CI / review (gpt-5, gpt, GPT_REVIEW_TOKEN) (push) Has been skipped Details CI / review (gpt-5, security, ., rodin/security-patterns, SECURITY_REVIEW.md, SECURITY_REVIEW_TOKEN) (push) Has been skipped Details	2026-05-15 09:00:19 +00:00
rodin	30fe48d265	docs(#148 ): add SKILL.md and dev-loop-spec.md for dispatch redesign (#149 ) CI / test (push) Successful in 18s Details CI / review (anthropic--claude-4.6-sonnet, sonnet, SONNET_REVIEW_TOKEN) (push) Has been skipped Details CI / review (gpt-5, gpt, GPT_REVIEW_TOKEN) (push) Has been skipped Details CI / review (gpt-5, security, ., rodin/security-patterns, SECURITY_REVIEW.md, SECURITY_REVIEW_TOKEN) (push) Has been skipped Details	2026-05-15 08:12:02 +00:00