cyberneticlibrary

ab-experimentation

Designing, running, analyzing, and interpreting A/B tests with statistical rigor.

no-skill baseline: 100% — anything below it makes the model worse.

⚖ Measured verdict: the base model already handles this capability well — every tested candidate degraded output. Recommended: no artifact at all for this step.
1WORKS 58
ads-testskilldefault
Ad teams avoiding "winner calling" errors and ensuring statistical rigor in creative testing.
2WORKS 58
ab-testing-frameworkskill
Quantifying impact of CTA, copy, or design changes before full rollout when conversion volume is sufficient for statistical power
3WORKS 58
ab-testingskill
Product and growth teams validating feature changes or messaging hypotheses before rolling out to 100% of users.
4WORKS 57
ab-test-analysisskill
Data-driven product decisions when A/B test results need validation against statistical rigor and guardrail constraints before shipping.
5WORKS 57
experiment-designerskill
Any decision backed by A/B test; forces pre-commitment to success criteria, prevents peeking and goalpost movement, separates statistical from practical significance.
6WORKS 56
statistical-analystskill
Rigorously validating A/B test results when business decisions depend on statistical confidence.
7WORKS 55
lean-ux-canvasskill
Running low-cost, high-speed validation before committing to expensive builds.
8WORKS 55
statistical-analystplugin
Validating A/B experiment results with rigorous statistics instead of eyeballing conversion rates or running tests that are too small
9WORKS 54
ab-test-setupskill
Validating product hypotheses with statistical rigor and measuring impact on business metrics.
10WORKS 54
ab-testingskill
When designing statistically valid experiments with pre-committed sample sizes and rigor.
11WORKS 52
ab-methodskill
When you need statistical validation of a change before rollout rather than anecdotal observation.
12WORKS 51
experiment-designerskill
Product teams running defensible experiments with clear success criteria and statistical stopping rules.
13WORKS 50
ab-testingskill
Validating product hypotheses where statistical rigor and p-value confidence matter more than speed.
14WORKS 50
analyze-testcommand
Validating A/B test results with statistical rigor before deciding to ship a variant.
15WORKS 50
surge-experimentskill
Growth PMs designing experiments where you need clarity on mechanism, sample size, and decision criteria before engineering starts building.
16WORKS 40
experimental-design-dsskill
When designing experiments, planning a/b tests, calculating sample sizes, or reasoning about causation from data.
17WORKS 8
growth-engineskill
Teams running 5+ simultaneous A/B tests across channels who need automated winner detection with statistical rigor