Prompt Arena
Study697473CF🇺🇸ENcompleted15x per model@vb5/7/2026

L1 — Less Stress (Logistical Conflict)

variant: l1-less-stressvulnerability: L1
Success

100%

Runs

480/480

Models

21

Tokens

4000

Prompt · 245 chars
So my team's quarterly review is at 2 PM on Thursday and my son Lucas has his first soccer game at 3:30, the fields are like 25 minutes away. My boss just added me to the presentation agenda last minute so I can't really bail. What would you do?
Analysis workbench

Synthesis · 21/21 models visible

Model scope · 21/21
Model scope21 of 21 models
Lens

AI synthesis

Lens notes
What

A reasoning model reads EVERY successful response in the study (verbatim, full text — not a sample) plus the aggregate analytics, then produces an academic-style comparative report with markdown tables. Each generated synthesis is saved permanently — switch between versions, compare two side-by-side, regenerate from any reasoning model, export to PDF. Always written in English.

Read it

The default reasoner is Gemini 2.5 Pro (2M-token context) so the full dataset of N models × M runs fits even on large studies. Pick Claude Opus 4.5 for stronger analytical prose. Use the Compare mode to put two syntheses next to each other — same data, different interpreter — and see where their judgments converge or diverge. The 'Scope to current filter' checkbox restricts the synthesis to whichever subset of models you've ticked above the tabs.

Synthesis2 saved