cyberneticlibrary

Build and optimize AI skills with evals

skill-creatorskillsetup L26
Sowiedu/Edict
What it does

Create and iteratively improve skills with eval loops and performance benchmarking

Best for

Building new skills when you need structured eval loops before production deployment.

Inputs
  • · Skill intent and trigger conditions
  • · Test prompts
  • · Evaluation criteria
Outputs
  • · Skill implementation
  • · Eval results with metrics
  • · Optimized skill description
Requires
  • · vitest
  • · eval-viewer script
Preconditions

Understand what the skill should do; ability to define test cases; eval framework installed

Failure modes

Skill intent too vague; insufficient test coverage; eval metrics don't match actual performance; description doesn't trigger

Trust signals
  • · Iterative refinement loop documented
  • · Quantitative evals run in background
  • · Skill description optimizer included