rodin/review-bot

Fork 0

Files

T

Rodin 9c7538cafa

PR Ready Gate / clear-labels (pull_request) Successful in 1s

Details

CI / test (pull_request) Successful in 12s

Details

CI / review (/anthropic/v1, anthropic--claude-4.6-sonnet, sonnet, anthropic, SONNET_REVIEW_TOKEN) (pull_request) Successful in 34s

Details

CI / review (/openai/v1, gpt-5, security, openai, SECURITY_REVIEW.md, SECURITY_REVIEW_TOKEN) (pull_request) Successful in 1m15s

Details

CI / review (/openai/v1, gpt-5, gpt, openai, GPT_REVIEW_TOKEN) (pull_request) Successful in 1m43s

Details

feat(persona): add role-based review personas

Add persona system for specialized review roles. Each persona defines:
- A specific review focus (security, architecture, documentation)
- Custom system prompt additions
- Personality/tone adjustments

Built-in personas: security, architect, docs
Custom personas: load from JSON via persona-file flag

Includes workspace validation to prevent path traversal attacks.

Closes #51

2026-05-10 08:44:53 -07:00

12 KiB

Raw Blame History

Design: Role-based Review Personas (Issue #51)

Problem

Current review-bot performs generic code review. Every reviewer (regardless of reviewer-name) uses the same base prompt and evaluates the same concerns. This leads to:

Redundancy — Two reviewers (e.g., GPT + Claude twins) often flag identical issues
Gaps — Generic reviewers miss specialized concerns (security, domain logic, architecture)
Noise — NITs about style mixed with critical security findings
No ownership — Findings lack clear domain attribution

Constraints

Must work with existing CLI flags and CI workflow patterns
Must not break backwards compatibility (existing configs still work)
Must integrate cleanly with the budget system (personas add to context)
Multiple personas running in parallel must not interfere with each other
Each persona must have clear scope boundaries (no duplication)

Proposed Approach

1. Persona Definition

A persona is a named review role with:

Identity — Who am I? What's my expertise?
Focus — What do I look for?
Scope boundaries — What do I explicitly NOT comment on?
Severity calibration — What counts as MAJOR/MINOR/NIT for MY domain?

Personas are defined in YAML files that can live:

In the pattern repos (shared across projects)
In the target repo (project-specific personas)
Inline via a new --persona-file flag

2. Persona File Format

# .review/personas/security.yaml
name: security
display_name: Security Specialist
model_preference: opus  # optional hint for expensive analysis

identity: |
  You are a security specialist reviewing code for vulnerabilities.
  Your expertise: OWASP Top 10, injection attacks, auth/authz, secrets management,
  event sourcing security (replay attacks, event injection).

focus:
  - Injection attacks (SQL, command, path traversal, template)
  - Authentication and authorization gaps
  - Secrets exposure (hardcoded credentials, tokens in logs)
  - Input validation (unsanitized input, unsafe deserialization)
  - Race conditions with security implications
  - Event sourcing attack vectors

ignore:
  - Code style and naming conventions
  - Performance (unless security-related)
  - Documentation
  - General code quality
  - Test coverage

severity:
  critical: "Remote code execution, auth bypass, data exfiltration"
  major: "Privilege escalation, information disclosure, DoS"
  minor: "Missing rate limiting, verbose errors"
  nit: "Theoretical risk with low exploitability"

output_format: |
  For each finding:
  - Severity: [CRITICAL|MAJOR|MINOR|NIT]
  - Attack vector: How could this be exploited?
  - Evidence: Code snippet showing the vulnerability
  - Recommendation: Specific fix

3. New CLI Flags

--persona-file PATH      Path to persona YAML file (local or in repo)
--persona NAME           Built-in persona name (security, architect, domain)

Either flag sets the persona. If neither is provided, behavior is unchanged (generic review).

4. Prompt Assembly

Current flow:

SystemBase → Patterns → Conventions → [LLM]

New flow with persona:

PersonaPrompt (from YAML) → Patterns (filtered?) → Conventions → [LLM]

The persona's identity/focus/ignore/severity sections become the system prompt, replacing the generic "You are an expert code reviewer" base.

5. Built-in Personas

Ship with these built-in personas (loadable via --persona NAME):

Name	Focus
`security`	Vulnerabilities, auth, secrets
`architect`	Patterns, consistency, design
`domain`	Business logic (requires repo-specific config)
`docs`	Documentation, API clarity

Built-in personas live in review/personas/ as embedded Go assets or YAML shipped with the binary.

6. CI Workflow Integration

Single persona:

- uses: rodin/review-bot/.gitea/actions/review@v1
  with:
    reviewer-name: security
    persona: security
    ...

Multiple personas (parallel jobs):

jobs:
  review:
    strategy:
      matrix:
        include:
          - name: security
            persona: security
          - name: architect
            persona: architect
    steps:
      - uses: rodin/review-bot/.gitea/actions/review@v1
        with:
          reviewer-name: ${{ matrix.name }}
          persona: ${{ matrix.persona }}

Custom persona from repo:

- uses: rodin/review-bot/.gitea/actions/review@v1
  with:
    reviewer-name: trading
    persona-file: .review/personas/trading.yaml

7. Persona + Patterns Interaction

Some personas benefit from filtered patterns:

Security → only security-related patterns
Architect → all patterns (structural focus)
Domain → domain docs, not language patterns

For v1, keep it simple: all patterns are included regardless of persona. Future enhancement could add patterns_filter to persona YAML.

8. Output Format Changes

Persona name appears in the review header:

# Security Review

## Summary
No critical vulnerabilities found in this change.

## Findings
| # | Severity | File | Line | Finding |
...

## Recommendation
**APPROVE** — No security-relevant issues detected.

---
*Review by security*
<!-- review-bot:security -->

State/Data Model

Persona struct

// review/persona.go
type Persona struct {
    Name          string   `yaml:"name"`
    DisplayName   string   `yaml:"display_name"`
    ModelPref     string   `yaml:"model_preference,omitempty"`
    Identity      string   `yaml:"identity"`
    Focus         []string `yaml:"focus"`
    Ignore        []string `yaml:"ignore"`
    Severity      Severity `yaml:"severity"`
    OutputFormat  string   `yaml:"output_format,omitempty"`
}

type Severity struct {
    Critical string `yaml:"critical"`
    Major    string `yaml:"major"`
    Minor    string `yaml:"minor"`
    Nit      string `yaml:"nit"`
}

Loading precedence

--persona-file PATH → load from local file system
--persona NAME → load from embedded built-ins
Neither → use generic system prompt (current behavior)

Error Cases

Error	Handling
Persona file not found	Fatal exit with clear message
Invalid YAML in persona file	Fatal exit with parse error
Both `--persona` and `--persona-file` specified	Fatal exit: mutually exclusive
Unknown built-in persona name	Fatal exit with list of valid names
Empty identity in persona	Warning, fall back to generic prompt

Edge Cases

Empty focus list: Valid — persona relies on identity alone
Empty ignore list: Valid — no explicit scope exclusions
No severity section: Use default MAJOR/MINOR/NIT definitions
Model preference set but budget insufficient: Ignore preference, log warning
Persona file in pattern repo: Fetch like other pattern files

Testing Strategy

Unit tests

persona_test.go: Parse valid/invalid YAML, validate required fields
prompt_test.go: Verify persona prompt assembly
Integration with budget: persona prompts count toward token limit

Integration tests

End-to-end with --persona security (built-in)
End-to-end with --persona-file custom.yaml
Backwards compatibility: no flags = generic behavior

Manual verification

Run security persona on a PR with obvious vulnerability
Verify security persona ignores style issues
Verify non-security persona doesn't flag security issues

Implementation Phases

Phase 1: Persona types and loading

review/persona.go: Persona struct + YAML parsing
review/persona_test.go: Unit tests
Embed built-in personas in binary
Compiles clean, tests pass

Phase 2: Prompt generation

review/prompt.go: BuildPersonaPrompt(p Persona) string
Modify BuildSystemBase() to accept optional persona
Integrate persona prompt with budget system
Tests for prompt assembly

Phase 3: CLI integration

Add --persona and --persona-file flags
Flag validation (mutually exclusive, valid names)
Load persona based on flags
Pass persona to prompt builder

Phase 4: Action integration

Add persona and persona-file inputs to action.yml
Update README with persona examples
End-to-end CI test

Phase 5: Built-in personas

security.yaml built-in
architect.yaml built-in
docs.yaml built-in
Document each persona's focus

Open Questions

Persona file location in repo: Should we support --persona-file .review/security.yaml where the file is fetched from the PR's repo (like conventions)? This adds complexity but enables project-specific personas without action changes.
Model preference enforcement: If persona specifies model_preference: opus but the action uses a different model, should we warn? Override? Ignore? Current thinking: log warning, use the specified model (user controls model via action input).
Severity override output: If persona defines custom severity levels (CRITICAL), should the JSON output include them, or map back to standard MAJOR/MINOR/NIT? Current thinking: keep standard output format, use severity calibration only for prompt guidance.

Completion Checklist

Persona struct matches YAML schema exactly?
Built-in personas embedded in binary (not external files)?
--persona and --persona-file are mutually exclusive?
Unknown persona name produces clear error with valid options?
Empty persona file fields have sensible defaults?
Persona prompt integrates with budget system (token counting)?
Backwards compatibility: no flags = current behavior?
Review header shows persona display name?
Sentinel still uses reviewer-name (not persona name)?
Unit tests cover parse errors, missing fields, valid YAML?

Design Review Findings (Self-Review)

Finding 1: Severity Mapping

The persona YAML allows critical severity, but the LLM output parser (review/parser.go) only accepts MAJOR/MINOR/NIT.

Resolution: Keep standard output format. Persona severity section is ONLY for calibrating the LLM's judgment (prompt guidance). Output must still use MAJOR/MINOR/NIT. Document this clearly in persona format docs.

Finding 2: Embedding Built-in Personas

Go doesn't natively embed YAML. Must use //go:embed directive (Go 1.16+).

Resolution: Create review/personas/ directory with YAML files and use:

//go:embed personas/*.yaml
var embeddedPersonas embed.FS

Finding 3: display_name vs reviewer-name

Design says header shows "persona display name" but sentinel uses "reviewer-name". This is correct - they serve different purposes:

display_name → human-readable header ("Security Specialist Review")
reviewer-name → machine sentinel for cleanup ()

When persona is used, display_name takes precedence for the header title, but reviewer-name (CLI flag) is still used for the sentinel.

Design Revision: JSON Instead of YAML

Reason: Project convention is "Go standard library only — no external dependencies."

YAML requires gopkg.in/yaml.v3 or similar. To maintain zero dependencies, persona files will use JSON instead.

Updated Persona File Format

{
  "name": "security",
  "display_name": "Security Specialist",
  "model_preference": "opus",
  "identity": "You are a security specialist reviewing code for vulnerabilities.\nYour expertise: OWASP Top 10, injection attacks, auth/authz, secrets management.",
  "focus": [
    "Injection attacks (SQL, command, path traversal, template)",
    "Authentication and authorization gaps",
    "Secrets exposure (hardcoded credentials, tokens in logs)"
  ],
  "ignore": [
    "Code style and naming conventions",
    "Performance (unless security-related)",
    "Documentation"
  ],
  "severity": {
    "major": "Privilege escalation, information disclosure, DoS",
    "minor": "Missing rate limiting, verbose errors",
    "nit": "Theoretical risk with low exploitability"
  }
}

This maintains all the same fields but uses JSON encoding, which Go handles natively via encoding/json.

12 KiB Raw Blame History

Design: Role-based Review Personas (Issue #51)

Problem

Constraints

Proposed Approach

1. Persona Definition

2. Persona File Format

3. New CLI Flags

4. Prompt Assembly

5. Built-in Personas

6. CI Workflow Integration

7. Persona + Patterns Interaction

8. Output Format Changes

State/Data Model

Persona struct

Loading precedence

Error Cases

Edge Cases

Testing Strategy

Unit tests

Integration tests

Manual verification

Implementation Phases

Phase 1: Persona types and loading

Phase 2: Prompt generation

Phase 3: CLI integration

Phase 4: Action integration

Phase 5: Built-in personas

Open Questions

Completion Checklist

Design Review Findings (Self-Review)

Finding 1: Severity Mapping

Finding 2: Embedding Built-in Personas

Finding 3: display_name vs reviewer-name

Design Revision: JSON Instead of YAML

Updated Persona File Format

12 KiB

Raw Blame History