Measure software performance comprehensively
measure-performanceworkflowsetup L3★0
Blankenship-Daniel/ship-studios ↗What it does
Measure ship-studios performance (import/startup cost, offline test-suite timing, the DSP benchmark + scaling + memory, perf-instrumentation overhead, CLI cold-start) on a SINGLE SERIAL measure pass so the numbers stay honest, then fan out one analysis agent per surface to read the code against those measured numbers, adversarially verify every finding, and synthesize a prioritized report. Re-runnable: it re-measures the repo each run rather than trusting baked-in numbers.
Best for
Perf fixes where file groups are disjoint and can edit in parallel without race conditions.
Inputs
- · structured data
Outputs
- · analysis results
Requires
- · Claude Code agent runtime (parallel/fan-out)
Preconditions
- · Artifact must conform to specified schema
- · Claude Code workflow harness
Failure modes
- · Blocking dependencies unresolved causes topo-sort failure
- · Agent returns data not matching schema → structured rejection
Trust signals
- · Adversarial cross-check phase
- · Independent verification lens