model-research/findings/2026-04-26-04-gpt5-defaults-to-delegation-claude.md

# Finding 4: GPT-5 defaults to delegation; Claude defaults to doing the work

**Date:** 2026-04-26
**Task:** PR review delegation to sub-agents
**How we used them:** Both spawned as sub-agents from main session with
same task description, same pr-review skill file, same Gitea credentials.
Difference: GPT-5 got model override to gpt5, Sonnet used default model.
Both got full skill instructions.

- GPT-5 first attempt: spawned sub-sub-agents and timed out
- GPT-5 with "do it yourself, no sub-agents" + step-by-step: worked
- Even with constraints, GPT-5 sometimes dumps raw tool output instead of
  synthesizing — needs explicit output format instructions
- Claude (Sonnet/Opus) given the same kind of task does the work directly
- **Takeaway:** GPT interprets complex task descriptions as delegation
  opportunities. Claude interprets them as work to do. For GPT: explicit
  single-actor instructions + output format. For Claude: can give broader
  mandate. Same skill file, very different behavior.