cyberneticlibrary

Run A/B benchmark sweep across tests

gimle-ab-sweepworkflowsetup L31
ant013/Gimle-Palace
What it does

Deploy agent fleet to process batches in parallel

Best for

Running A/B experiments with parallel variant evaluation and statistical reporting.

Outputs
  • · candidate implementations or solutions
Requires
  • · Claude agent API
  • · parallel execution harness
  • · Workflow phase management
  • · agent role/specialist dispatch
  • · JSON schema validation
  • · git read-only commands
  • · grep/ripgrep
Preconditions

User cohort data and A/B variant configs

Failure modes
  • · agent returns malformed/invalid schema
  • · one agent in parallel batch times out or errors
  • · input file not found or unreadable
  • · agent role/specialist unavailable
Trust signals
  • · stored in repo: ant013/Gimle-Palace
  • · strict JSON schema validation