Measure software performance comprehensively

measure-performanceworkflowsetup L3★0

What it does

Measure ship-studios performance (import/startup cost, offline test-suite timing, the DSP benchmark + scaling + memory, perf-instrumentation overhead, CLI cold-start) on a SINGLE SERIAL measure pass so the numbers stay honest, then fan out one analysis agent per surface to read the code against those measured numbers, adversarially verify every finding, and synthesize a prioritized report. Re-runnable: it re-measures the repo each run rather than trusting baked-in numbers.

Best for

Perf fixes where file groups are disjoint and can edit in parallel without race conditions.

Inputs

· structured data

Outputs

· analysis results

Requires

· Claude Code agent runtime (parallel/fan-out)

Preconditions

· Artifact must conform to specified schema
· Claude Code workflow harness

Failure modes

· Blocking dependencies unresolved causes topo-sort failure
· Agent returns data not matching schema → structured rejection

Trust signals

· Adversarial cross-check phase
· Independent verification lens

Capability

truth-audit → compare alternatives