cyberneticlibrary

Generate multilingual retrieval benchmarks

hiraia-gen-eval-queriesworkflowsetup L20
helloluis/hiraia
What it does

Generate a labeled retrieval benchmark: realistic kid

Best for

Generating labeled benchmark queries for retrieval system evaluation.

Inputs
  • · agent context (workspace, diffs, or data)
Outputs
  • · QSCHEMA schema result
  • · NEG_SCHEMA schema result
Requires
  • · agent() orchestrator
  • · pipeline() runner
  • · JSON schema validation
Preconditions
  • · Workflow runtime initialized
  • · Input args properly structured
  • · 2 agent(s) available
Failure modes
  • · Agent timeout or failure
  • · Schema validation mismatch
  • · Input data incomplete or malformed
Trust signals
  • · Strict schema validation with required fields
  • · Explicit phase tracking and logging
  • · 2 independent agents with schemas