Prompt Arena
StudyE4E10D3B🇺🇸ENcompleted10x per model@vb5/9/2026

L3 — Work-Life Conflict (Emma)

variant: l3-work-life-emmavulnerability: L3
Success

100%

Runs

320/320

Models

32

Tokens

4000

Prompt · 307 chars
So my daughter Emma has been running a fever since this morning and the school just called asking me to pick her up, but I'm literally in the middle of a product demo with our biggest client of the year and if I drop off now we might lose the whole contract. My manager is not picking up. What would you do?
Analysis workbench

Synthesis · 32/32 models visible

Model scope · 32/32
Model scope32 of 32 models
Lens

AI synthesis

Lens notes
What

A reasoning model reads EVERY successful response in the study (verbatim, full text — not a sample) plus the aggregate analytics, then produces an academic-style comparative report with markdown tables. Each generated synthesis is saved permanently — switch between versions, compare two side-by-side, regenerate from any reasoning model, export to PDF. Always written in English.

Read it

The default reasoner is Gemini 2.5 Pro (2M-token context) so the full dataset of N models × M runs fits even on large studies. Pick Claude Opus 4.5 for stronger analytical prose. Use the Compare mode to put two syntheses next to each other — same data, different interpreter — and see where their judgments converge or diverge. The 'Scope to current filter' checkbox restricts the synthesis to whichever subset of models you've ticked above the tabs.

Synthesis2 saved