cyberneticlibrary

Test chat mode quality

test-chat-modesworkflowsetup L30
yatharthk2/odyssey
What it does

Test and judge chat routing modes

Best for

Comparing LLM retrieval strategies (KG vs vector, HyDE vs plain) with factual accuracy judges.

Inputs
  • · Question set
  • · Routing mode parameters
Outputs
  • · Quality report
  • · Mode comparison
Requires
  • · Python
  • · WebSocket
Preconditions

Required crates/files exist; Git repo initialized

Failure modes
  • · Agent timeout or output validation failure
  • · Merge/conflict during parallel work
  • · Blocking dependency unmet
Trust signals
  • · 2-vote or n-vote verification
  • · Spec-driven phase boundaries