Files

T

Rodin 8adf09b3fb Add security boundary analysis experiment (2026-05-10)

New analytical lens: security boundary analysis — identifying where trust
assumptions cross component boundaries in exploitable ways.

Document: gargoyle system-overview.md (323 lines)
Models: Claude Opus (15 findings), Claude Sonnet 4 (10 findings)

Key finding: Opus identified that transient signals (a performance design
choice) create a structural security vulnerability — malicious strategies
can probe risk limits without leaving any audit trail.

This experiment establishes security boundary analysis as a distinct,
viable analytical task type for architecture review.

2026-05-10 16:05:45 -07:00

8.5 KiB

Raw Blame History

Security Boundary Analysis: A New Analytical Lens

Date: 2026-05-10 Task: Identify security boundary violations and trust assumption gaps in gargoyle's system-overview.md (323 lines) — a high-level architecture document describing component interactions, ports, bounded contexts, and domain invariants. How we used them: Same document (full text) + same focused analytical question to both models via HAI proxy. Highly structured prompt specifying 5 categories of security analysis: trust boundary crossings, privilege escalation paths, data integrity assumptions, authentication/authorization gaps, and audit trail exploitability. Required specific output format per finding. No tools, no project context beyond the document itself.

Model	Time	Output tokens	Findings	Critical	High	Medium	Low
Claude Sonnet 4	~18s	1,343	10	2	4	4	0
Claude Opus	~180s*	3,370	15	3	9	3	0

*Opus time approximate due to timeout during collection

What They Found — Common Ground (both identified)

Both models independently identified these core security concerns:

MarketDataIngestion trust (Finding 1 in both): The shared tick distribution mechanism lacks integrity verification — downstream components trust that ticks are authentic without validation. A compromised adapter could inject false prices affecting all users simultaneously.
BrokerAdapter fill trust: Fill events are treated as authoritative ("ground truth") without authentication. A compromised adapter could inject fabricated fills, corrupting position state across all downstream components.
Instrument ID manipulation: The resolution from raw ticker to instrument_id at the ingestion boundary could be manipulated to cause trades for wrong securities.
User isolation enforcement: User instances are described as "isolated" but the enforcement mechanism isn't specified. Both models questioned whether process-level isolation prevents cross-user message injection.
Kill switch authorization: The kill switch can be engaged by "Operator/System" but authentication/authorization requirements aren't specified.
Audit trail integrity: The decision_id/signal_id linking is referential rather than cryptographically bound, allowing potential falsification.
PortfolioMonitor privilege: PortfolioMonitor can issue "close-only" orders directly to OrderManager without described authorization verification.
Reconciliation trust: The reconciliation process trusts broker-provided data without verification of authenticity.

Opus Unique Findings (not in Sonnet)

Signal reconnaissance via audit blindspot: Since "signals are never persisted" and only "relevant" signals are logged, a malicious strategy could emit thousands of probing signals that never reach rejection or aggregation thresholds — leaving no audit trail while conducting reconnaissance on risk limits and portfolio state. This is architecturally significant — it identifies that the audit model has a structural blind spot by design, not by oversight.
Paper trading state leakage: The design's explicit "substitutability" principle means the system cannot distinguish paper simulator from production broker. Any configuration error or migration bug could cause simulated fills to pollute production records.
Corporate action injection: The "Lot adjusted" event for corporate actions lacks specified authorization — an attacker with access could inject fake splits/dividends to manipulate positions and cost basis.
Administrative action audit gap: The domain events table lists trading events but not administrative operations (risk limit changes, kill switch state, reconciliation overrides). Operator actions could be modified without any audit trail.
Cross-user event observation: Although users have "isolated" instances, they all subscribe to the same shared tick event stream. Depending on implementation, a compromised instance might observe other users' tick consumption patterns through timing attacks.

Sonnet Unique Findings (not in Opus)

Cross-user fill routing: Sonnet explicitly identified that BrokerAdapter must correctly route fills to the right user's OrderManager, but there's no verification mechanism described. A compromised adapter could send User A's fills to User B. Opus mentioned adapter compromise for injection but didn't specifically address the routing problem.

Quality Assessment

Claude Opus produced the most findings (15 vs 10) with notably deeper reasoning about second-order effects. The signal reconnaissance finding (#5 in Opus) is the most architecturally significant discovery — it identifies that a core design choice (transient signals for performance) creates a structural security vulnerability that can't be fixed without changing the persistence model. The paper trading state leakage finding shows reasoning about how substitutability principles can backfire. Opus also uniquely identified the administrative audit gap — a compliance-critical finding for a trading system.
Claude Sonnet 4 was remarkably fast (18s vs ~180s) and found 10 solid issues. The cross-user fill routing finding shows good boundary reasoning. Sonnet's findings were well-structured and actionable. However, Sonnet stayed closer to the explicit trust boundaries in the document, while Opus reasoned about implications of design principles (like substitutability and signal transience).

Key Insight — Security Boundary Analysis as a Task Type

This is a genuinely NEW analytical lens not previously tested. Unlike:

Assumption-finding: What must be true for this to work?
Race condition identification: What timing interleavings cause problems?
Design coherence: Does the document contradict itself?

Security boundary analysis asks: Where can a malicious or compromised component affect things beyond its intended scope?

This requires reasoning about:

What each component can see (data visibility)
What each component can do (action scope)
How trust is established (or not) at boundaries
What happens when trust assumptions are violated

The models performed well because the document explicitly describes component boundaries and interactions. Both models successfully identified that "isolation" is claimed but not specified, that "ground truth" status doesn't mean "verified authentic," and that audit coverage has structural gaps.

Comparison to Previous Task Types

Analytical lens	Primary reasoning mode	Opus strength?	Sonnet strength?
Assumption-finding	What's unstated	✓	✓
Race conditions	Temporal interleavings	✓	✗ (per Finding #13)
Design coherence	Self-contradiction	✓	Mixed
Security boundaries	Adversarial scope	✓✓	✓

Security boundary analysis appears to favor reasoning models (Opus) because it requires modeling adversarial behavior and reasoning about trust transitivity. However, Sonnet performed better here than on race conditions — suggesting that security analysis is more about boundary enumeration than temporal reasoning.

Practical Implications

Security boundary analysis is viable for architecture review. Both models produced actionable findings that would matter for a real trading system.
The audit blindspot finding is worth pursuing. Opus's insight that transient signals create a reconnaissance capability is a genuine security design flaw. This should be added to gargoyle's security review backlog.
Run this lens on other architecture docs. The technique worked well on a system overview. Would it work on more detailed component docs? Would findings overlap with assumption-finding, or remain distinct?
Opus's depth justifies the time cost for security-critical analysis. The 10x time difference produced 50% more findings AND higher-order insights. For security review, the extra investment is worth it.

Next Experiments

Run security boundary analysis on a detailed component doc (e.g., order-execution.md) to see if findings overlap with Finding #12's assumption analysis.
Test whether adding an explicit "adversarial actor model" to the prompt changes the findings (e.g., "assume the Strategy Worker author is malicious").
Compare against GPT-5 when available — does reasoning-token-heavy analysis produce different security insights than Opus's internal reasoning?

8.5 KiB Raw Blame History