cyberneticlibrary

Judge platform answer quality with KB on

analyze-platform-faithfulnessworkflowsetup L30
guli20001221/compshare-agent
What it does

Judge KB system drift via multi-run comparison

Best for

Verify knowledge-base integration does not degrade platform-FAQ answers.

Inputs
  • · test probes or findings array
Outputs
  • · verdict object (refuted/confirmed)
  • · result object
Requires
  • · agent()
  • · pipeline()
Preconditions
  • · Agent runtime available (Claude Code Workflow)
Failure modes
  • · Retry exhaustion on complex multi-step reasoning
Trust signals
  • · StructuredOutput schema validation
  • · Verification phase cross-validates results