cyberneticlibrary

Evaluate chatbot prompts with consensus panel

ga-chatbot-qa-panelworkflowsetup L31
GuitarAlchemist/ga
What it does

Judge semantic correctness via multi-lens consensus panel

Best for

Semantic quality assurance using a multi-lens consensus panel.

Inputs
  • · topic: string
  • · category: string
  • · host: string
  • · limit: number
Outputs
  • · verdict: object (scores, winner, rationale)
Requires
  • · GaChatbot.Api
  • · YAML parsing
  • · bash/CLI
Preconditions
  • · Backend service running
  • · Corpus/test data available
Failure modes
  • · Non-deterministic output fails bit-identity gate
  • · Backend endpoint unreachable or malformed response
Trust signals
  • · Determinism/reproducibility gates
  • · Multi-judge consensus model with dissent tracking
  • · Frozen reference for regression detection