feat(#137): add doc-map input for path-scoped doc injection #138

First-time contributor

[NIT] Committing a PLAN-137.md file to the repository is unusual — planning/design documents are typically kept in issue trackers or wikis. This couples the implementation repository to an issue-tracking artifact. Consider removing it before merge or moving it to a docs/ or .design/ directory if architecture decision records are desired.

**[NIT]** Committing a PLAN-137.md file to the repository is unusual — planning/design documents are typically kept in issue trackers or wikis. This couples the implementation repository to an issue-tracking artifact. Consider removing it before merge or moving it to a `docs/` or `.design/` directory if architecture decision records are desired.

review/docmap.go

						
				@@ -0,0 +54,4 @@

						return nil, fmt.Errorf("read doc-map file %q: %w", localPath, err)

					}

					var cfg DocMapConfig

sonnet-review-bot commented

First-time contributor

[MINOR] The package comment says 'Package review provides doc-map parsing and doc injection...' but the review package already exists and has a broader purpose. This package-level comment in docmap.go will either be ignored (not the first file) or will conflict with the package doc in other files. The file-level comment should just be a regular comment, not a package doc comment, or should be omitted since the package is already documented elsewhere.

**[MINOR]** The package comment says 'Package review provides doc-map parsing and doc injection...' but the `review` package already exists and has a broader purpose. This package-level comment in docmap.go will either be ignored (not the first file) or will conflict with the package doc in other files. The file-level comment should just be a regular comment, not a `package` doc comment, or should be omitted since the package is already documented elsewhere.

review/docmap.go

						
				@@ -0,0 +198,4 @@

							continue

						}

						for _, entry := range entries {

sonnet-review-bot commented

First-time contributor

[MINOR] The truncateUTF8 function is duplicated — an identical implementation already exists in budget/budget.go. Per the DRY principle and the project's own patterns, this should be extracted to a shared utility. Since review and budget are separate packages, the options are: (1) move it to a shared internal package, or (2) accept the duplication with a comment. As-is, future changes to the truncation logic must be made in two places.

**[MINOR]** The `truncateUTF8` function is duplicated — an identical implementation already exists in `budget/budget.go`. Per the DRY principle and the project's own patterns, this should be extracted to a shared utility. Since `review` and `budget` are separate packages, the options are: (1) move it to a shared internal package, or (2) accept the duplication with a comment. As-is, future changes to the truncation logic must be made in two places.

review/docmap.go

						
				@@ -0,0 +242,4 @@

					content string

				}

				// loadDocEntries returns the doc content for a given path.

sonnet-review-bot commented

First-time contributor

[MINOR] In loadDocEntries, when GetAllFilesInPath returns an error (rather than empty results), the code silently falls through to try GetFileContent. The plan's decision explicitly states 'If GetAllFilesInPath returns an error, try GetFileContent', which this does — but the directory error is swallowed without logging. If both calls fail, only the file error is returned. Consider logging a debug message for the directory error before the fallback, which would help diagnose unexpected behavior.

**[MINOR]** In `loadDocEntries`, when `GetAllFilesInPath` returns an error (rather than empty results), the code silently falls through to try `GetFileContent`. The plan's decision explicitly states 'If GetAllFilesInPath returns an error, try GetFileContent', which this does — but the directory error is swallowed without logging. If both calls fail, only the file error is returned. Consider logging a debug message for the directory error before the fallback, which would help diagnose unexpected behavior.

review/docmap_test.go

						
				@@ -0,0 +11,4 @@

				// fakeDocFetcher is a mock DocFetcher for tests.

				type fakeDocFetcher struct {

					files map[string]string // path -> content

sonnet-review-bot commented

First-time contributor

[NIT] The fakeDocFetcher struct fields have inconsistent comment styles (inline comments on the struct fields are fine), but the dirs field type map[string]map[string]string could benefit from a brief comment explaining the structure, similar to the files field comment. Minor readability nit.

**[NIT]** The `fakeDocFetcher` struct fields have inconsistent comment styles (inline comments on the struct fields are fine), but the `dirs` field type `map[string]map[string]string` could benefit from a brief comment explaining the structure, similar to the `files` field comment. Minor readability nit.

review/docmap_test.go

						
				@@ -0,0 +335,4 @@

					}

				}

				func TestLoadMatchingDocs_ContextSizeGuard(t *testing.T) {

sonnet-review-bot commented

2026-05-15 03:27:28 +00:00

First-time contributor

[NIT] The test TestLoadMatchingDocs_ContextSizeGuard uses MaxBytes: 350 with content of 200 bytes each and checks len(content) > 600. The content after metadata (headings, newlines) will be well above 350 bytes due to formatting overhead. The test is validating the right behavior (truncation occurs), but the boundary check (> 600) is loose. Not a bug, just slightly confusing as the actual formatted content with headings for doc a alone is ~220+ bytes.

**[NIT]** The test `TestLoadMatchingDocs_ContextSizeGuard` uses `MaxBytes: 350` with content of 200 bytes each and checks `len(content) > 600`. The content after metadata (headings, newlines) will be well above 350 bytes due to formatting overhead. The test is validating the right behavior (truncation occurs), but the boundary check (`> 600`) is loose. Not a bug, just slightly confusing as the actual formatted content with headings for doc a alone is ~220+ bytes.

rodin commented

Self-review against 9670a5fda3

Assessment: ✅ Clean

No issues found — ready for human review.

Checks:

Error handling: All errors wrapped/propagated. Nil files slice from Step 3 failure safely handled. ✅
Coherence: Single purpose (doc-map injection). All changes necessary. ✅
Fits codebase: VCS-fetched docs match conventions-file pattern. Interface in consumer package. ✅
Solution: VCS API approach correct. ** glob without new dependency. Size guard prevents context overflow. ✅
Completeness: All 10 acceptance criteria met. Full test coverage. CHANGELOG created. ✅

Self-review against 9670a5fda358a4d46b8ccd72b61eaea601d6e020 Assessment: ✅ Clean No issues found — ready for human review. **Checks:** - Error handling: All errors wrapped/propagated. Nil `files` slice from Step 3 failure safely handled. ✅ - Coherence: Single purpose (doc-map injection). All changes necessary. ✅ - Fits codebase: VCS-fetched docs match `conventions-file` pattern. Interface in consumer package. ✅ - Solution: VCS API approach correct. `**` glob without new dependency. Size guard prevents context overflow. ✅ - Completeness: All 10 acceptance criteria met. Full test coverage. CHANGELOG created. ✅

rodin added the self-reviewed needs-review labels 2026-05-15 03:27:40 +00:00

security-review-bot requested review from security-review-bot 2026-05-15 03:28:03 +00:00

security-review-bot approved these changes 2026-05-15 03:28:03 +00:00

security-review-bot left a comment

~~Original review~~

Superseded — see current review for up-to-date findings.

Previous findings (commit 9670a5fd)

Security Review

Summary

The changes add a path-scoped design doc injection feature with careful workspace path validation and use of the approved YAML parser. CI passed and the implementation avoids obvious SSRF, injection, or auth issues. I identified a few defense-in-depth hardening opportunities around resource limits and LLM prompt injection handling.

Findings

#	Severity	File	Line	Finding
1	[MINOR]	`review/docmap.go`	168	Potential denial-of-service via unbounded directory fetch: LoadMatchingDocs calls GetAllFilesInPath which returns full contents of all files under a path before applying the MaxBytes cap. A malicious or misconfigured doc-map could point to a very large directory, causing excessive memory/network usage prior to truncation.
2	[MINOR]	`budget/budget.go`	187	Prompt-injection hardening: Design docs (which are ultimately repository-controlled data) are injected into the system prompt without explicit instruction separation. While content is fetched from the default branch (reducing attacker control via PR), best practice is to clearly treat docs as data and direct the model not to follow any instructions contained within them to mitigate prompt injection.
3	[MINOR]	`review/docmap.go`	229	Path hardening: doc paths from the local YAML are passed directly to GetAllFilesInPath/GetFileContent. Although the VCS API should constrain paths to the repo, adding local validation (e.g., reject absolute paths, clean and deny any '..' segments) would provide defense-in-depth against potential backend path handling quirks.

Recommendation

APPROVE — Overall, the feature is implemented securely with good input validation for the local doc-map file and appropriate use of the approved YAML library. To strengthen the security posture: (1) Add resource-exhaustion guards to doc fetching by avoiding full directory content retrieval before enforcing limits—prefer a two-phase approach (list metadata first, then fetch files incrementally until the byte budget is reached), and consider caps on number of files and per-file size. (2) When embedding design docs, include explicit prompt-guardrails in the system message that instruct the model to treat design-doc content as reference data and to ignore any instructions found within that content, or move the docs to the user prompt with clear data boundaries to mitigate prompt injection risks. (3) Sanitize doc paths from the YAML (reject absolute paths and any path traversal components) before invoking VCS API calls for defense-in-depth. With these adjustments, the changes would further reduce DoS and LLM prompt manipulation risks while maintaining the intended functionality.

Review by security

Evaluated against 9670a5fd

~~Original review~~ **Superseded** — [see current review](https://gitea.weiker.me/rodin/review-bot/pulls/138#pullrequestreview-3809) for up-to-date findings. <details><summary>Previous findings (commit 9670a5fd)</summary> # Security Review ## Summary The changes add a path-scoped design doc injection feature with careful workspace path validation and use of the approved YAML parser. CI passed and the implementation avoids obvious SSRF, injection, or auth issues. I identified a few defense-in-depth hardening opportunities around resource limits and LLM prompt injection handling. ## Findings | # | Severity | File | Line | Finding | |---|----------|------|------|--------| | 1 | [MINOR] | `review/docmap.go` | 168 | Potential denial-of-service via unbounded directory fetch: LoadMatchingDocs calls GetAllFilesInPath which returns full contents of all files under a path before applying the MaxBytes cap. A malicious or misconfigured doc-map could point to a very large directory, causing excessive memory/network usage prior to truncation. | | 2 | [MINOR] | `budget/budget.go` | 187 | Prompt-injection hardening: Design docs (which are ultimately repository-controlled data) are injected into the system prompt without explicit instruction separation. While content is fetched from the default branch (reducing attacker control via PR), best practice is to clearly treat docs as data and direct the model not to follow any instructions contained within them to mitigate prompt injection. | | 3 | [MINOR] | `review/docmap.go` | 229 | Path hardening: doc paths from the local YAML are passed directly to GetAllFilesInPath/GetFileContent. Although the VCS API should constrain paths to the repo, adding local validation (e.g., reject absolute paths, clean and deny any '..' segments) would provide defense-in-depth against potential backend path handling quirks. | ## Recommendation **APPROVE** — Overall, the feature is implemented securely with good input validation for the local doc-map file and appropriate use of the approved YAML library. To strengthen the security posture: (1) Add resource-exhaustion guards to doc fetching by avoiding full directory content retrieval before enforcing limits—prefer a two-phase approach (list metadata first, then fetch files incrementally until the byte budget is reached), and consider caps on number of files and per-file size. (2) When embedding design docs, include explicit prompt-guardrails in the system message that instruct the model to treat design-doc content as reference data and to ignore any instructions found within that content, or move the docs to the user prompt with clear data boundaries to mitigate prompt injection risks. (3) Sanitize doc paths from the YAML (reject absolute paths and any path traversal components) before invoking VCS API calls for defense-in-depth. With these adjustments, the changes would further reduce DoS and LLM prompt manipulation risks while maintaining the intended functionality. --- *Review by security*  --- *Evaluated against 9670a5fd* </details>

budget/budget.go

						
				@@ -185,6 +187,10 @@ func buildResult(s Sections, trimmed []string, estTokens int) Result {

						sys.WriteString("\n\n## Repository Conventions\n\nThe repository has the following coding conventions that must be respected:\n\n")

security-review-bot commented

[MINOR] Prompt-injection hardening: Design docs (which are ultimately repository-controlled data) are injected into the system prompt without explicit instruction separation. While content is fetched from the default branch (reducing attacker control via PR), best practice is to clearly treat docs as data and direct the model not to follow any instructions contained within them to mitigate prompt injection.

**[MINOR]** Prompt-injection hardening: Design docs (which are ultimately repository-controlled data) are injected into the system prompt without explicit instruction separation. While content is fetched from the default branch (reducing attacker control via PR), best practice is to clearly treat docs as data and direct the model not to follow any instructions contained within them to mitigate prompt injection.

security-review-bot marked this conversation as resolved

review/docmap.go

						
				@@ -0,0 +165,4 @@

				// a formatted string suitable for injection into the system prompt.

				//

				// Behavior:

				//   - Paths that look like directories (end with /, or GetAllFilesInPath returns files)

security-review-bot commented

[MINOR] Potential denial-of-service via unbounded directory fetch: LoadMatchingDocs calls GetAllFilesInPath which returns full contents of all files under a path before applying the MaxBytes cap. A malicious or misconfigured doc-map could point to a very large directory, causing excessive memory/network usage prior to truncation.

**[MINOR]** Potential denial-of-service via unbounded directory fetch: LoadMatchingDocs calls GetAllFilesInPath which returns full contents of all files under a path before applying the MaxBytes cap. A malicious or misconfigured doc-map could point to a very large directory, causing excessive memory/network usage prior to truncation.

security-review-bot marked this conversation as resolved

review/docmap.go

						
				@@ -0,0 +226,4 @@

								sb.WriteString("\n> ⚠️ (truncated — size limit reached)\n")

							}

							totalBytes += len(content)

							slog.Debug("doc-map: injected doc", "path", entry.path, "bytes", len(content))

security-review-bot commented

[MINOR] Path hardening: doc paths from the local YAML are passed directly to GetAllFilesInPath/GetFileContent. Although the VCS API should constrain paths to the repo, adding local validation (e.g., reject absolute paths, clean and deny any '..' segments) would provide defense-in-depth against potential backend path handling quirks.

**[MINOR]** Path hardening: doc paths from the local YAML are passed directly to GetAllFilesInPath/GetFileContent. Although the VCS API should constrain paths to the repo, adding local validation (e.g., reject absolute paths, clean and deny any '..' segments) would provide defense-in-depth against potential backend path handling quirks.

security-review-bot marked this conversation as resolved

gpt-review-bot approved these changes 2026-05-15 03:28:03 +00:00

gpt-review-bot left a comment

First-time contributor

~~Original review~~

Superseded — see current review for up-to-date findings.

Previous findings (commit 9670a5fd)

Gpt Review

Summary

Solid implementation of path-scoped design doc injection with clear wiring through flags, action inputs, budgeting, and comprehensive tests. Minor documentation and comment consistency issues noted, but functionally and idiomatically sound.

Findings

#	Severity	File	Line	Finding
1	[MINOR]	`budget/budget.go`	1	Top-of-file comment still states trim order as 'patterns → conventions → file context → diff truncation' but code now includes 'design docs' between conventions and file context. Update the comment to reflect the new order to avoid confusion.
2	[MINOR]	`README.md`	1	The 'Action Inputs' table does not list the new 'doc-map' and 'doc-map-max-bytes' inputs, even though they're mentioned elsewhere. Consider adding them to keep the reference table complete.
3	[NIT]	`review/docmap.go`	1	Comment for globMatch mentions 'Standard path.Match patterns' while code uses filepath.Match. Consider aligning the comment to say 'filepath.Match patterns' for accuracy.
4	[NIT]	`review/docmap.go`	53	In ParseDocMapConfig fallback, the returned error on second unmarshal failure wraps the original strict-mode error (err), potentially obscuring the relaxed unmarshal error (err2). Consider returning or including the second error for clearer diagnostics.
5	[NIT]	`review/docmap.go`	232	sortDocEntries uses a manual insertion sort; consider using sort.Slice for simplicity and readability since the standard library provides it.

Recommendation

APPROVE — The feature is well-implemented: action inputs are correctly added and passed via environment variables, CLI flags are introduced and validated, documents are matched and loaded via VCS with robust glob matching and a sensible size cap, and the budgeting layer integrates the new 'DesignDocs' section in the appropriate trim order. Tests are comprehensive and cover parsing, matching, directory expansion, size guards, and error handling, aligning with the repository's testing conventions. CI is green. Please address the minor documentation inconsistencies (trim order comment and README inputs table) and consider the small comment and error-reporting nits for clarity. Otherwise, this is good to merge.

Review by gpt

Evaluated against 9670a5fd

~~Original review~~ **Superseded** — [see current review](https://gitea.weiker.me/rodin/review-bot/pulls/138#pullrequestreview-3810) for up-to-date findings. <details><summary>Previous findings (commit 9670a5fd)</summary> # Gpt Review ## Summary Solid implementation of path-scoped design doc injection with clear wiring through flags, action inputs, budgeting, and comprehensive tests. Minor documentation and comment consistency issues noted, but functionally and idiomatically sound. ## Findings | # | Severity | File | Line | Finding | |---|----------|------|------|--------| | 1 | [MINOR] | `budget/budget.go` | 1 | Top-of-file comment still states trim order as 'patterns → conventions → file context → diff truncation' but code now includes 'design docs' between conventions and file context. Update the comment to reflect the new order to avoid confusion. | | 2 | [MINOR] | `README.md` | 1 | The 'Action Inputs' table does not list the new 'doc-map' and 'doc-map-max-bytes' inputs, even though they're mentioned elsewhere. Consider adding them to keep the reference table complete. | | 3 | [NIT] | `review/docmap.go` | 1 | Comment for globMatch mentions 'Standard path.Match patterns' while code uses filepath.Match. Consider aligning the comment to say 'filepath.Match patterns' for accuracy. | | 4 | [NIT] | `review/docmap.go` | 53 | In ParseDocMapConfig fallback, the returned error on second unmarshal failure wraps the original strict-mode error (err), potentially obscuring the relaxed unmarshal error (err2). Consider returning or including the second error for clearer diagnostics. | | 5 | [NIT] | `review/docmap.go` | 232 | sortDocEntries uses a manual insertion sort; consider using sort.Slice for simplicity and readability since the standard library provides it. | ## Recommendation **APPROVE** — The feature is well-implemented: action inputs are correctly added and passed via environment variables, CLI flags are introduced and validated, documents are matched and loaded via VCS with robust glob matching and a sensible size cap, and the budgeting layer integrates the new 'DesignDocs' section in the appropriate trim order. Tests are comprehensive and cover parsing, matching, directory expansion, size guards, and error handling, aligning with the repository's testing conventions. CI is green. Please address the minor documentation inconsistencies (trim order comment and README inputs table) and consider the small comment and error-reporting nits for clarity. Otherwise, this is good to merge. --- *Review by gpt*  --- *Evaluated against 9670a5fd* </details>

review/docmap.go Outdated

						
				@@ -0,0 +1,303 @@

				// Package review provides doc-map parsing and doc injection for path-scoped

gpt-review-bot commented

First-time contributor

[NIT] Comment for globMatch mentions 'Standard path.Match patterns' while code uses filepath.Match. Consider aligning the comment to say 'filepath.Match patterns' for accuracy.

**[NIT]** Comment for globMatch mentions 'Standard path.Match patterns' while code uses filepath.Match. Consider aligning the comment to say 'filepath.Match patterns' for accuracy.

review/docmap.go

						
				@@ -0,0 +50,4 @@

				// Unknown top-level keys produce a warning but are not fatal.

				func ParseDocMapConfig(localPath string) (*DocMapConfig, error) {

					data, err := readFileBytes(localPath)

					if err != nil {

gpt-review-bot commented

First-time contributor

[NIT] In ParseDocMapConfig fallback, the returned error on second unmarshal failure wraps the original strict-mode error (err), potentially obscuring the relaxed unmarshal error (err2). Consider returning or including the second error for clearer diagnostics.

**[NIT]** In ParseDocMapConfig fallback, the returned error on second unmarshal failure wraps the original strict-mode error (err), potentially obscuring the relaxed unmarshal error (err2). Consider returning or including the second error for clearer diagnostics.

review/docmap.go

						
				@@ -0,0 +229,4 @@

							slog.Debug("doc-map: injected doc", "path", entry.path, "bytes", len(content))

						}

					}

gpt-review-bot commented

2026-05-15 03:33:21 +00:00

First-time contributor

[NIT] sortDocEntries uses a manual insertion sort; consider using sort.Slice for simplicity and readability since the standard library provides it.

**[NIT]** sortDocEntries uses a manual insertion sort; consider using sort.Slice for simplicity and readability since the standard library provides it.

rodin added 5 commits 2026-05-15 03:32:32 +00:00

fix(#137 ): address review findings in docmap.go a24edeee89

- Fix package comment collision: convert to file comment (not package doc)
- Add debug log for directory expansion failure before single-file fallback
- Add validateDocPath: reject absolute paths and '..' segments (security #3)
- Update globMatch comment to say 'filepath.Match' not 'path.Match' (gpt nit #3)
- Add duplication note to truncateUTF8 explaining why it's kept separate (sonnet #2)

test(#137 ): add tests for validateDocPath and path traversal rejection a80c12355b

fix(#137 ): address review findings in budget.go 860dd98415

- Update package comment trim order to include design docs (gpt #1)
- Add prompt injection guardrail for DesignDocs section (security #2)

docs(#137 ): add doc-map and doc-map-max-bytes to action inputs table (gpt #2 ) 45e2f5fc1c

chore(#137 ): update CHANGELOG with security fixes from review

PR Ready Gate / clear-labels (pull_request) Successful in 2s

Details

CI / test (pull_request) Successful in 17s

Details

CI / review (anthropic--claude-4.6-sonnet, sonnet, SONNET_REVIEW_TOKEN) (pull_request) Successful in 44s

Details

CI / review (gpt-5, security, ., rodin/security-patterns, SECURITY_REVIEW.md, SECURITY_REVIEW_TOKEN) (pull_request) Successful in 1m31s

Details

CI / review (gpt-5, gpt, GPT_REVIEW_TOKEN) (pull_request) Successful in 1m48s

Details

f7815b8778

rodin removed the needs-review self-reviewed labels 2026-05-15 03:32:44 +00:00

aweiker was assigned by rodin

2026-05-15 03:32:48 +00:00

rodin added the ready self-reviewed labels 2026-05-15 03:32:56 +00:00

rodin referenced this issue from a commit

chore: dev-loop health check — PR #138 ready for merge at 2026-05-15 03:33 UTC

sonnet-review-bot approved these changes 2026-05-15 03:33:32 +00:00

sonnet-review-bot left a comment

First-time contributor

~~Original review~~

Superseded — see current review for up-to-date findings.

Previous findings (commit f7815b87)

Sonnet Review

Summary

A well-structured feature addition implementing path-scoped design document injection. The code follows established Go patterns, is well-tested, and handles security concerns (path traversal) and error cases correctly. A few minor issues worth noting but none are blockers.

Findings

#	Severity	File	Line	Finding
1	[MINOR]	`review/docmap.go`	57	ParseDocMapConfig uses yaml.Strict() first, then re-parses with yaml.Unmarshal on failure to get the 'unknown keys' warning. This double-parse is slightly wasteful but more importantly, it means the error message logged is the strict-mode error (which describes the unknown key) rather than a custom message. The current approach works correctly but could be simplified: just always use non-strict mode and rely on the YAML library's warning mechanisms if available, or accept that users won't get per-key detail. The current approach is acceptable given the go-yaml library's API.
2	[MINOR]	`review/docmap.go`	170	LoadMatchingDocs writes a truncation notice ("⚠️ Design document context truncated — size limit reached.") to the string builder when limitReached is set inside the outer loop over entries, but this notice can be written multiple times if the outer loop continues processing additional docPaths after limitReached is already set. The outer loop does break on limitReached, so the notice in the inner loop (line 229) is the concern. Looking more carefully: the inner loop only writes the notice once then breaks, but if the same doc triggers truncation, the outer notice at line 229 fires then the outer loop also has a break due to limitReached. The flow is actually correct — the inner notice fires at most once (truncated flag set, limitReached set, inner break), and the outer loop breaks at the top. No double-write occurs. However, this is subtle enough to warrant a clarifying comment.
3	[MINOR]	`review/docmap.go`	302	sortDocEntries uses an insertion sort implemented manually. The standard library's sort.Slice would be idiomatic and more readable: `sort.Slice(entries, func(i, j int) bool { return entries[i].path < entries[j].path })`. The comment says "doc lists are small" which justifies the O(n²) complexity, but the standard library sort is both clearer and handles all sizes correctly. This is a NIT-level style issue per project conventions.
4	[NIT]	`review/docmap.go`	1	The package doc comment is a single-line imperative description rather than the conventional `// Package review ...` format documented in the Go patterns (documentation.md pattern #3). The comment reads `// doc-map parsing and doc injection for path-scoped design document context in AI code reviews.` — but this is a file-level comment, not a package comment. The review package presumably has a package comment elsewhere; this file comment is fine as a file-level description, though it's atypical (most Go files don't have pre-package file comments).
5	[NIT]	`review/docmap_test.go`	14	The fakeDocFetcher struct fields lack alignment/formatting: `dirs map[string]map[string]string // dir path -> (file path -> content)` — the comment is useful but gofmt may reformat the spacing. Not a correctness issue.
6	[NIT]	`PLAN-137.md`	1	Committing a PLAN file to the repository is unusual and will persist in the repo history as a planning artifact. Consider whether this file should be in the PR description or a wiki instead. That said, some projects intentionally keep ADRs/plans in the repo, so this is a process note rather than a code issue.

Recommendation

APPROVE — Approve. The implementation is solid: the DocFetcher interface keeps the review package decoupled from cmd-level dependencies (correct application of the interface pattern), the glob matching correctly handles **, path traversal is defended at both the config-reading level and before VCS calls, the budget integration is clean with the right trimming priority, and test coverage is thorough (parse, match, load, dir expansion, truncation, path traversal rejection). The duplicated truncateUTF8 is explicitly acknowledged with a justification comment, which is the right approach. The minor findings are cosmetic or informational and do not affect correctness or security.

Review by sonnet

Evaluated against f7815b87

~~Original review~~ **Superseded** — [see current review](https://gitea.weiker.me/rodin/review-bot/pulls/138#pullrequestreview-3811) for up-to-date findings. <details><summary>Previous findings (commit f7815b87)</summary> # Sonnet Review ## Summary A well-structured feature addition implementing path-scoped design document injection. The code follows established Go patterns, is well-tested, and handles security concerns (path traversal) and error cases correctly. A few minor issues worth noting but none are blockers. ## Findings | # | Severity | File | Line | Finding | |---|----------|------|------|--------| | 1 | [MINOR] | `review/docmap.go` | 57 | ParseDocMapConfig uses yaml.Strict() first, then re-parses with yaml.Unmarshal on failure to get the 'unknown keys' warning. This double-parse is slightly wasteful but more importantly, it means the error message logged is the strict-mode error (which describes the unknown key) rather than a custom message. The current approach works correctly but could be simplified: just always use non-strict mode and rely on the YAML library's warning mechanisms if available, or accept that users won't get per-key detail. The current approach is acceptable given the go-yaml library's API. | | 2 | [MINOR] | `review/docmap.go` | 170 | LoadMatchingDocs writes a truncation notice ("⚠️ Design document context truncated — size limit reached.") to the string builder when limitReached is set inside the outer loop over entries, but this notice can be written multiple times if the outer loop continues processing additional docPaths after limitReached is already set. The outer loop does break on limitReached, so the notice in the inner loop (line 229) is the concern. Looking more carefully: the inner loop only writes the notice once then breaks, but if the same doc triggers truncation, the outer notice at line 229 fires then the outer loop also has a break due to limitReached. The flow is actually correct — the inner notice fires at most once (truncated flag set, limitReached set, inner break), and the outer loop breaks at the top. No double-write occurs. However, this is subtle enough to warrant a clarifying comment. | | 3 | [MINOR] | `review/docmap.go` | 302 | sortDocEntries uses an insertion sort implemented manually. The standard library's sort.Slice would be idiomatic and more readable: `sort.Slice(entries, func(i, j int) bool { return entries[i].path < entries[j].path })`. The comment says "doc lists are small" which justifies the O(n²) complexity, but the standard library sort is both clearer and handles all sizes correctly. This is a NIT-level style issue per project conventions. | | 4 | [NIT] | `review/docmap.go` | 1 | The package doc comment is a single-line imperative description rather than the conventional `// Package review ...` format documented in the Go patterns (documentation.md pattern #3). The comment reads `// doc-map parsing and doc injection for path-scoped design document context in AI code reviews.` — but this is a file-level comment, not a package comment. The review package presumably has a package comment elsewhere; this file comment is fine as a file-level description, though it's atypical (most Go files don't have pre-package file comments). | | 5 | [NIT] | `review/docmap_test.go` | 14 | The fakeDocFetcher struct fields lack alignment/formatting: `dirs map[string]map[string]string // dir path -> (file path -> content)` — the comment is useful but gofmt may reformat the spacing. Not a correctness issue. | | 6 | [NIT] | `PLAN-137.md` | 1 | Committing a PLAN file to the repository is unusual and will persist in the repo history as a planning artifact. Consider whether this file should be in the PR description or a wiki instead. That said, some projects intentionally keep ADRs/plans in the repo, so this is a process note rather than a code issue. | ## Recommendation **APPROVE** — Approve. The implementation is solid: the DocFetcher interface keeps the review package decoupled from cmd-level dependencies (correct application of the interface pattern), the glob matching correctly handles `**`, path traversal is defended at both the config-reading level and before VCS calls, the budget integration is clean with the right trimming priority, and test coverage is thorough (parse, match, load, dir expansion, truncation, path traversal rejection). The duplicated truncateUTF8 is explicitly acknowledged with a justification comment, which is the right approach. The minor findings are cosmetic or informational and do not affect correctness or security. --- *Review by sonnet*  --- *Evaluated against f7815b87* </details>

PLAN-137.md Outdated

						
				@@ -0,0 +1,194 @@

				# Plan: Issue #137 — doc-map input for path-scoped doc injection

sonnet-review-bot commented

First-time contributor

[NIT] Committing a PLAN file to the repository is unusual and will persist in the repo history as a planning artifact. Consider whether this file should be in the PR description or a wiki instead. That said, some projects intentionally keep ADRs/plans in the repo, so this is a process note rather than a code issue.

**[NIT]** Committing a PLAN file to the repository is unusual and will persist in the repo history as a planning artifact. Consider whether this file should be in the PR description or a wiki instead. That said, some projects intentionally keep ADRs/plans in the repo, so this is a process note rather than a code issue.

review/docmap.go

						
				@@ -0,0 +1,332 @@

				// doc-map parsing and doc injection for path-scoped design document context in AI code reviews.

sonnet-review-bot commented

First-time contributor

[NIT] The package doc comment is a single-line imperative description rather than the conventional // Package review ... format documented in the Go patterns (documentation.md pattern #3). The comment reads // doc-map parsing and doc injection for path-scoped design document context in AI code reviews. — but this is a file-level comment, not a package comment. The review package presumably has a package comment elsewhere; this file comment is fine as a file-level description, though it's atypical (most Go files don't have pre-package file comments).

**[NIT]** The package doc comment is a single-line imperative description rather than the conventional `// Package review ...` format documented in the Go patterns (documentation.md pattern #3). The comment reads `// doc-map parsing and doc injection for path-scoped design document context in AI code reviews.` — but this is a file-level comment, not a package comment. The review package presumably has a package comment elsewhere; this file comment is fine as a file-level description, though it's atypical (most Go files don't have pre-package file comments).

review/docmap.go

						
				@@ -0,0 +54,4 @@

					}

					var cfg DocMapConfig

					if err := yaml.UnmarshalWithOptions(data, &cfg, yaml.Strict()); err != nil {

sonnet-review-bot commented

First-time contributor

[MINOR] ParseDocMapConfig uses yaml.Strict() first, then re-parses with yaml.Unmarshal on failure to get the 'unknown keys' warning. This double-parse is slightly wasteful but more importantly, it means the error message logged is the strict-mode error (which describes the unknown key) rather than a custom message. The current approach works correctly but could be simplified: just always use non-strict mode and rely on the YAML library's warning mechanisms if available, or accept that users won't get per-key detail. The current approach is acceptable given the go-yaml library's API.

**[MINOR]** ParseDocMapConfig uses yaml.Strict() first, then re-parses with yaml.Unmarshal on failure to get the 'unknown keys' warning. This double-parse is slightly wasteful but more importantly, it means the error message logged is the strict-mode error (which describes the unknown key) rather than a custom message. The current approach works correctly but could be simplified: just always use non-strict mode and rely on the YAML library's warning mechanisms if available, or accept that users won't get per-key detail. The current approach is acceptable given the go-yaml library's API.

review/docmap.go

						
				@@ -0,0 +167,4 @@

				//   - Paths that look like directories (end with /, or GetAllFilesInPath returns files)

				//     are expanded to all .md files under them.

				//   - Missing files are logged as warnings and skipped.

				//   - Total content is capped at opts.MaxBytes; truncation is noted inline.

sonnet-review-bot commented

First-time contributor

[MINOR] LoadMatchingDocs writes a truncation notice ("⚠️ Design document context truncated — size limit reached.") to the string builder when limitReached is set inside the outer loop over entries, but this notice can be written multiple times if the outer loop continues processing additional docPaths after limitReached is already set. The outer loop does break on limitReached, so the notice in the inner loop (line 229) is the concern. Looking more carefully: the inner loop only writes the notice once then breaks, but if the same doc triggers truncation, the outer notice at line 229 fires then the outer loop also has a break due to limitReached. The flow is actually correct — the inner notice fires at most once (truncated flag set, limitReached set, inner break), and the outer loop breaks at the top. No double-write occurs. However, this is subtle enough to warrant a clarifying comment.

**[MINOR]** LoadMatchingDocs writes a truncation notice ("⚠️ Design document context truncated — size limit reached.") to the string builder when limitReached is set inside the outer loop over entries, but this notice can be written multiple times if the outer loop continues processing additional docPaths after limitReached is already set. The outer loop does break on limitReached, so the notice in the inner loop (line 229) is the concern. Looking more carefully: the inner loop only writes the notice once then breaks, but if the same doc triggers truncation, the outer notice at line 229 fires then the outer loop also has a break due to limitReached. The flow is actually correct — the inner notice fires at most once (truncated flag set, limitReached set, inner break), and the outer loop breaks at the top. No double-write occurs. However, this is subtle enough to warrant a clarifying comment.

review/docmap.go

						
				@@ -0,0 +299,4 @@

				}

				// validateDocPath rejects doc paths that could cause path traversal via the

				// VCS API (absolute paths, any ".." segment). Defense-in-depth: the VCS API

sonnet-review-bot commented

First-time contributor

[MINOR] sortDocEntries uses an insertion sort implemented manually. The standard library's sort.Slice would be idiomatic and more readable: sort.Slice(entries, func(i, j int) bool { return entries[i].path < entries[j].path }). The comment says "doc lists are small" which justifies the O(n²) complexity, but the standard library sort is both clearer and handles all sizes correctly. This is a NIT-level style issue per project conventions.

**[MINOR]** sortDocEntries uses an insertion sort implemented manually. The standard library's sort.Slice would be idiomatic and more readable: `sort.Slice(entries, func(i, j int) bool { return entries[i].path < entries[j].path })`. The comment says "doc lists are small" which justifies the O(n²) complexity, but the standard library sort is both clearer and handles all sizes correctly. This is a NIT-level style issue per project conventions.

review/docmap_test.go

						
				@@ -0,0 +11,4 @@

				// fakeDocFetcher is a mock DocFetcher for tests.

				type fakeDocFetcher struct {

					files map[string]string // path -> content

sonnet-review-bot commented