Run daily multi-tier model performance arena

model-arena-dailyworkflowsetup L3★0

What it does

Daily multi-tier model arena. Runs the same

Best for

Benchmarking multi-tier LLM responses against a canonical prompt with regression detection and cost-efficiency scoring.

Inputs

Outputs

Requires

Preconditions

· Daily intelligence on which Claude tier is best for each task type. Cost-disciplined by design: 3 ge

Failure modes

Trust signals