Build and optimize AI skills with evals

skill-creatorskillsetup L2★6

What it does

Create and iteratively improve skills with eval loops and performance benchmarking

Best for

Building new skills when you need structured eval loops before production deployment.

Inputs

Outputs

Requires

Preconditions

Understand what the skill should do; ability to define test cases; eval framework installed

Failure modes

Skill intent too vague; insufficient test coverage; eval metrics don't match actual performance; description doesn't trigger

Trust signals