Rodin
|
0f43934cb8
|
Add finding #67: Inter-document contradiction analysis
Sonnet 4 outperforms GPT-5 on inter-document contradiction analysis:
- More findings (5 vs 4)
- Faster (14s vs 136s)
- Better severity calibration (3 Critical vs 0 Critical)
Key insight: GPT-5's extended reasoning (9.7K tokens) doesn't pay off
for this task type. Inter-document comparison requires parallel pattern
matching, not serial verification.
|
2026-05-10 18:32:45 -07:00 |
|