8adf09b3fb
New analytical lens: security boundary analysis — identifying where trust assumptions cross component boundaries in exploitable ways. Document: gargoyle system-overview.md (323 lines) Models: Claude Opus (15 findings), Claude Sonnet 4 (10 findings) Key finding: Opus identified that transient signals (a performance design choice) create a structural security vulnerability — malicious strategies can probe risk limits without leaving any audit trail. This experiment establishes security boundary analysis as a distinct, viable analytical task type for architecture review.
Model Findings — Analytical & Research Work
Tracking what actually works (and doesn't) when using AI models for research, analysis, bias detection, and document review — not coding.
Started: 2026-04-26
Context
We use multiple models in different roles: Claude Code (Opus/Sonnet) for generation, Sonnet + GPT-5 for independent dual review, smaller models for focused analytical tasks. Most public discussion is about coding. We found almost no published methodology for using models in analytical research tasks (searched 2026-04-26). That gap is why we're tracking this.
Each experiment lives in its own file. See individual finding files below.