review-bot/docs/dev-loop-spec.md

# Dev-Loop Dispatch Spec

**Version:** 1.0
**Status:** Implemented
**Implements:** Issue #148

This document is the authoritative spec for the review-bot dev-loop dispatch architecture.
The dispatch script (`~/.openclaw/workspace/scripts/dev-loop-dispatch.sh`) and its tests
are validated against the rules and invariants in this document.

---

## 1. Overview

The dev-loop is a 15-minute cron that advances the state of open pull requests and picks up
new issues when there is nothing in review. It is designed for **zero human intervention**
in the normal flow and **hard stops at key safety boundaries**.

### Architecture

```
Cron (15-min cadence)
  → exec: bash dev-loop-dispatch.sh <project>
  → read stdout for SPAWN/HANDOFF lines
  → if SPAWN: load worker template, spawn subagent
  → if HANDOFF: log, do nothing else
  → if neither: NO_REPLY
```

The cron model has **no ambient knowledge** of the project state. All state is derived
from the dispatch script's output, which in turn comes from live API calls.

---

## 2. Inputs

### Project Config

```yaml
# memory/projects/<project>.yaml
repo: rodin/review-bot         # <owner>/<repo>
api_base: https://gitea.../v1  # API base URL
token_path: ~/.openclaw/...    # path to bearer token
user: rodin                    # bot Gitea username
labels:
  wip: <id>
  ready: <id>
review_bots:                   # sentinel names in review bodies
  - sonnet
  - gpt
  - security
```

### Script Arguments

```bash
bash dev-loop-dispatch.sh <project>   # normal run
DRY_RUN=1 bash dev-loop-dispatch.sh <project>   # dry-run (no mutations)
```

---

## 3. State

The dispatch script is **stateless per run**. All state lives in the Gitea API:

| State | API location |
|-------|-------------|
| Open PRs | `GET /repos/:repo/pulls?state=open` |
| PR labels | `GET /repos/:repo/issues/:n/labels` |
| PR reviews | `GET /repos/:repo/pulls/:n/reviews` |
| CI status | `GET /repos/:repo/commits/:sha/status` |
| Issue comments | `GET /repos/:repo/issues/:n/comments` |
| Inline diff comments | `GET /repos/:repo/pulls/:n/comments` |
| Issue timeline | `GET /repos/:repo/issues/:n/timeline` |

No file-based state. No cron-to-cron carry-over.

---

## 4. Output Protocol

The script emits structured lines to stdout. Stderr is diagnostic logging.

### `SPAWN:<type>:<number>:<sha>`

A worker is needed. The cron model reads this and spawns a subagent using the
template at `worker-tasks/<type>.md`.

| Field | Description |
|-------|-------------|
| `type` | Worker type: `self-review`, `ci-fix`, `address-feedback`, `findings`, `rebase`, `impl` |
| `number` | PR number (or issue number for `impl`) |
| `sha` | HEAD SHA of the PR (empty for `impl`) |

At most **one SPAWN** is emitted per script run.

### `HANDOFF:<pr_num>`

All checks passed for `pr_num`. The script applied the `ready` label and assigned
to the human reviewer. The cron model logs this and takes no further action.

Multiple HANDOFFs may be emitted in one run (one per qualifying PR).

---

## 5. Dispatch Rules

Rules are evaluated **in order** for each open PR. The first matching condition wins.
Only one SPAWN is emitted per full pass.

### Rule 0: WIP Cleanup

For each open PR with a `wip` label:

1. Find the timestamp when the label was most recently applied (via timeline events)
2. If age > 1hr: **remove the label** (stale lock — worker likely crashed)
3. If age ≤ 1hr: **set ACTIVE_WIP=1** (do not exit, only gates Rule 10)

### Rule 2: REQUEST_CHANGES Blocks

**ALWAYS evaluated before any other per-PR rule.**

For each reviewer, take their **latest** review state. If any reviewer's latest
state is `REQUEST_CHANGES`:

→ Acquire WIP label on this PR
→ Emit `SPAWN:findings:<pr_num>:<head_sha>`
→ Continue to next PR (but only one SPAWN total)

This rule cannot be bypassed by any condition. There is no waiver mechanism.

### Rule 3: Merge Conflicts

If `mergeable == false`:

→ Acquire WIP
→ Emit `SPAWN:rebase:<pr_num>:<head_sha>`

### Rule 4: CI Failure

If CI state is `failure` or `error`:

- If a fix plan comment exists for this HEAD SHA: **skip** (worker in progress)
- Otherwise:

→ Acquire WIP
→ Emit `SPAWN:ci-fix:<pr_num>:<head_sha>`

### Rule 5: Bot Reviews Missing

For each configured `review_bot`, check whether a review body contains the
sentinel `<!-- review-bot:<name> -->`.

If any sentinel is missing: **wait** (continue to next PR, no SPAWN).

### Rule 6: CI Pending/Unknown

If CI state is `pending` or `unknown`: **wait**.

### Rule 7: Self-Review

Check for a self-review comment from the bot user against the current HEAD SHA:
- Comment contains `Self-review against <head_sha>`

Sub-cases:
- **Missing**: No self-review comment →
  → Acquire WIP, emit `SPAWN:self-review:<pr_num>:<head_sha>`
- **Needs attention** (`Assessment: ⚠️`): Found, but has findings:
  - Fix plan exists for HEAD SHA: skip
  - No fix plan: → Acquire WIP, emit `SPAWN:sr-fix:<pr_num>:<head_sha>`
- **Clean** (`Assessment: ✅ Clean`): Continue to Rule 8

### Rule 8: Unacknowledged Bot Review Findings

For each **current** (contains `Evaluated against <head_short>`) APPROVED bot review
that has a findings table:

A finding is **unacknowledged** if it does not appear as `Finding #N` in a fix plan
comment from the bot user for this HEAD SHA.

If any unacknowledged findings exist:
- Fix plan exists: skip
- No fix plan: → Acquire WIP, emit `SPAWN:address-feedback:<pr_num>:<head_sha>`

### Rule 9: Unresolved Inline Diff Comments

An inline diff comment is **unresolved** if:
1. `in_reply_to_id` is null (top-level comment)
2. `resolver` is null (not formally resolved)
3. No other comment has `in_reply_to_id` pointing to this comment (no reply)

If unresolved comments exist:
- Fix plan exists: skip
- No fix plan: → Acquire WIP, emit `SPAWN:address-feedback:<pr_num>:<head_sha>`

### Rule 10: Handoff

All rules above passed. Verify all bot reviews are current (contain `Evaluated against <head_short>`).

If all current:
- Apply `ready` label
- Assign to `aweiker`
- Emit `HANDOFF:<pr_num>`
- Continue evaluating remaining PRs (do NOT exit)

If already assigned to `aweiker`: skip (assume handoff was already performed; continue to next PR without emitting another HANDOFF).

### Rule 11: New Issue Pickup

Only runs if: no open PRs exist AND `ACTIVE_WIP == 0`.

Fetch open, unassigned issues. Priority: bugs first, then by number ascending.

Claim the issue (assign to bot user to prevent double-pick), then:
→ Emit `SPAWN:impl:<issue_num>:`

---

## 6. Safety Invariants

These are statically checked by `~/.openclaw/workspace/scripts/test/check-invariants.sh` and enforced in all changes:

| ID | Invariant |
|----|-----------|
| S1 | Zero merge API calls in dispatch script (`/merge` does not appear) |
| S2 | REQUEST_CHANGES check (Rule 2) appears before CI check (Rule 4) |
| S3 | REQUEST_CHANGES check (Rule 2) appears before ready label application (Rule 10) |
| S4 | No model/AI API references in dispatch script |
| S5 | `set -euo pipefail` present |
| S6 | Active WIP does not cause early exit (only sets ACTIVE_WIP flag) |
| S7 | SPAWN:impl guarded by `ACTIVE_WIP == 0` check |
| S8 | No merge calls in any worker template |
| S9 | Zero close-PR API calls in dispatch script (`state=closed` does not appear) |
| S10 | No close-PR API calls in any worker template; every worker template contains `NEVER close a PR` |

---

## 7. Error Handling

| Error | Behavior |
|-------|----------|
| `curl` returns error | `set -euo pipefail` aborts script — no partial actions |
| `jq` parse error | Script aborts |
| Worker crashes | WIP label left on PR; stale WIP cleanup (Rule 0) removes it after 1hr |
| Race: two crons fire | WIP mutex prevents double-dispatch for same PR |
| `sessions_spawn` fails | Worker not spawned; WIP label orphaned → cleaned in 1hr |
| Config file missing | Exit code 2 with error message |

---

## 8. Worker Templates

Each worker receives a precise task description with substituted values:

| Template | Trigger | Key job |
|----------|---------|---------|
| `self-review.md` | No clean self-review | Post self-review comment, remove WIP |
| `sr-fix.md` | Self-review needs attention | Address self-review findings, push, remove WIP |
| `ci-fix.md` | CI failing | Diagnose, fix, push, remove WIP |
| `address-feedback.md` | Unacknowledged findings or inline comments | Address feedback, push, remove WIP |
| `findings.md` | REQUEST_CHANGES present | Address REQUEST_CHANGES, push, remove WIP |
| `rebase.md` | Merge conflicts | Rebase on main, push, remove WIP |
| `impl.md` | New issue | Implement feature/fix, open PR |

Workers **always** remove the WIP label on completion and reply `NO_REPLY`.

### Worker Absolute Constraints

Every worker template begins with an `⛔ ABSOLUTE CONSTRAINTS` section containing these rules:

- **NEVER close a PR.** Never call `PATCH /pulls/{id}` with `state=closed`. Closing a PR requires human action. "Duplicate", "superseded", or "already done" are never a worker's call.
- **NEVER merge a PR.** Never call the merge API. Merging requires human approval.
- **NEVER use the gitea-aweiker token.** All API calls use the gitea-rodin token only.
- **NEVER act on a PR with active REQUEST_CHANGES.** Fix the findings first.

The first two constraints are statically enforced by `check-invariants.sh`: S1 and S9 cover the dispatch script (no merge, no close); S8 covers worker templates (no merge calls); S10 covers worker templates (no close calls, with NEVER-close text verified present in each). The remaining two constraints (token usage and REQUEST_CHANGES gate) are enforced by runtime logic.

---

## 9. Fixes for Issues #144, #145, and #157

**Issue #144** (autonomous merge):
The dispatch script contains no merge API calls anywhere. The `~/.openclaw/workspace/scripts/test/check-invariants.sh`
invariant `S1` verifies this. Workers do not receive merge instructions.

**Issue #145** (merged despite REQUEST_CHANGES):
Rule 2 is the **first** rule evaluated per PR. It cannot be skipped, reasoned past,
or bypassed. It is checked before CI, before self-review, before handoff. The check
uses latest-per-reviewer state, so a reviewer who re-approved after REQUEST_CHANGES
is correctly handled.

**Issue #157** (autonomous PR close):
Worker templates were missing an explicit constraint against closing PRs. The dispatch
script never had a close call, but workers could reason their way into calling
`PATCH /pulls/{id}` with `state=closed`. All worker templates now include
`NEVER close a PR` in their ABSOLUTE CONSTRAINTS section. Invariant S9 verifies
the dispatch script contains no close calls. Invariant S10 verifies
worker templates contain no close calls and each contains the NEVER-close text.

Regression tests in `dispatch.bats` statically verify all of these constraints.