cyberneticlibrary

Evaluate AI work quality

self-evalskillsetup L117,464
alirezarezvani/claude-skills
What it does

Evaluate agent capability gaps and suggest skill improvements

Best for

When analyzing agent performance to identify missing capabilities and plan skill development.

Inputs
  • · agent performance logs
  • · failed task samples
  • · capability matrix
Outputs
  • · gap analysis report
  • · skill recommendations
  • · priority-ordered skill plan
Preconditions

Performance history; clear capability definitions

Failure modes
  • · sparse logs
  • · false causation
  • · overfit to recent tasks
Trust signals
  • · metrics-driven analysis
  • · multi-source evidence
  • · actionable recommendations