Design and score marketing experiments

growth-engineskillsetup L1★2,362

Causal-lift measurements

ab-experimentation-27pp vs no-skill baselinewith-skill 73% · baseline 100%

Measured by running the task with and without this artifact, K=5, graded by deterministic checks — no LLM judging.

What it does

Design and score marketing experiments with statistical analysis

Best for

Teams running 5+ simultaneous A/B tests across channels who need automated winner detection with statistical rigor

Inputs

Outputs

Preconditions

Python 3, experiment data directory, telemetry consent

Failure modes

small sample sizes (<15 samples/variant) reduce statistical power; manual min-samples override needed for low-volume channels

Trust signals