Files
model-research/findings
Rodin faaa6d9c11 Finding #57: Event flow correctness analysis - new analytical lens
Tests a novel lens for event-sourced architectures: can all state be
reconstructed from documented events alone?

Key findings:
- GPT-5 brings external domain knowledge (broker APIs, compliance)
- Opus reasons through failure modes systematically (crash boundaries)
- Sonnet does rapid structural analysis (missing pieces)

21 unique findings across three models with only 5 in common.
Each model's reasoning style reveals different issue categories.

New pattern: event flow analysis exposes model reasoning styles
that gap-finding and contradiction detection don't surface.
2026-05-09 13:29:58 -07:00
..

Model Findings — Analytical & Research Work

Tracking what actually works (and doesn't) when using AI models for research, analysis, bias detection, and document review — not coding.

Started: 2026-04-26

Context

We use multiple models in different roles: Claude Code (Opus/Sonnet) for generation, Sonnet + GPT-5 for independent dual review, smaller models for focused analytical tasks. Most public discussion is about coding. We found almost no published methodology for using models in analytical research tasks (searched 2026-04-26). That gap is why we're tracking this.

Each experiment lives in its own file. See individual finding files below.