cyberneticlibrary

The library

Everything we index — ranked by what works, never by stars.

Artifacts Capabilities Recipes

kindall skill workflow mcp_server command subagent plugin|✓ proven ✗ harmful measured only65 artifacts

forSales Marketing HR Finance Legal Ops Product Engineering Data Productivity Supportsetup≤ plug & play ≤ + a key ≤ multi-tool

Score items with adversarial votingworkflowDataOpsL3

wf-swarm-score · Verifying adversarial score spreads and detecting hallucinating ants via 3-role consensus.

★367 WORKS 55

Synthesize multi-source research with verificationworkflowOpsDataL3

research · Exploratory research with parallel hypothesis agents and cross-check synthesis.

Grade adversarial test corpus with consensusworkflowEngineeringDataL3

ix-adversarial-llm-panel · Complex workflows requiring parallel agents, synthesized judgment, or multi-phase triage.

Measure software performance comprehensivelyworkflowEngineeringDataL3

measure-performance · Perf fixes where file groups are disjoint and can edit in parallel without race conditions.

Analyze speech synthesis gaps per languageworkflowEngineeringDataL3

espeak-audit-all-langs · Finding subtle bugs or policy violations across large codebases that require consistent multi-agent consensus.

Compare TTS voice quality with human audioworkflowEngineeringDataL3

espeak-audit-tts-vs-human · Finding subtle bugs or policy violations across large codebases that require consistent multi-agent consensus.

Research topics with citation verificationworkflowDataL3

investigate · Orchestrate multi-agent investigate across parallel branches and synthesis.

Detect and classify unknown platformsworkflowDataL3

detect-platforms · workflow automation in specialized problem domain; check artifact name and description.

Extract evidence from research papersworkflowDataL2

fieldatlas-deepread · Literature review requiring faithful extraction with verbatim evidence from long academic papers.

Tag and discover benchmark tasksworkflowProductDataL2

enrich-task-tags · Batch tagging homogeneous content (e.g., all AL benchmark tasks) against controlled vocabulary.

Extract structured data from Japanese PDF vocabularyworkflowDataL2

vocab-extract · Best for automating complex multi-phase vocab-extract processes at scale.

Extract and deduplicate reusable judgment patternsworkflowDataL3

compound-extract · Mining reusable patterns from code changes while deduplicating existing knowledge.

Run daily multi-tier model performance arenaworkflowDataL3

model-arena-daily · Benchmarking multi-tier LLM responses against a canonical prompt with regression detection and cost-efficiency scoring.

Generate state-of-field research reportworkflowProductDataL3

fieldatlas-synth · Grounding research idea generation in corpus citations with novelty-checking and feasibility critique.

Compare AI models dailyworkflowDataProductL3

model-arena-daily · Daily intelligence on which Claude tier is best for each task type. Cost-disciplined by design: 3 generators + 1 judge =

Audit research findings for completenessworkflowDataL3

grounding-audit · Parallel read-only audits against design claims with completeness criticism.

Generate daily research briefworkflowDataMarketingL3

research-pulse-daily · Daily lightweight research pulse: 3 parallel domain scans + synthesis.

Build company research source catalogworkflowDataL3

driver-menu-build · Interactive menu-driven workflows with dynamic branching.

Search from multiple angles in parallelworkflowDataL3

multi-modal-sweep · Orchestrate multi-phase multi-modal-sweep tasks with structured verification and recovery.

Debug trading indicator efficacyworkflowDataL3

indicator-why-believed · Orchestrate multi-phase indicator-why-believed tasks with structured verification and recovery.

Review papers across model tiersworkflowEngineeringDataL3

paper-review-fanout · Parallel review and triage workflows across multiple criteria or models.

Validate field atlas extractionsworkflowDataEngineeringL2

fieldatlas-validate · Environment or artifact verification with mechanical pass/fail detection against ground truth.

Sweep screenspot parameters single-variableworkflowEngineeringDataL3

screenspot-param-sweep · Best for adversarial evaluation across agent personas.

Active-learn classification prompt rulesworkflowDataEngineeringL3

classify-improve · improving classifiers via data-driven signal mining from real failures

Scout and combine research algorithmsworkflowDataL3

scout-and-combine · fan-out parallel research with synthesized results

Survey binary behavior across reposworkflowDataL3

rebench-binary-survey · Comprehensive multi-phase scan across heterogeneous source corpus with parallel validation.

Benchmark model conversion performanceworkflowDataL3

model-convert-benchmark · Multi-phase model-convert-benchmark orchestration with structured phase control and receipts.

Classify reference taxonomyworkflowDataL2

references-audit-classify · Comprehensive multi-phase scan across heterogeneous source corpus with parallel validation.

Load bulk contract data into BigQueryworkflowDataL3

cf-parallel-harvester-rawload · Append-loading UK Contracts Finder historical data (2016-2026) into BigQuery in resumable 2-month shards.

Extract entities from news articlesworkflowDataL2

news-graph-extract · Building knowledge graphs from news corpora where entities and relationships must be structured for Neo4j ingestion.

Catalog object-detection datasetsworkflowDataL3

find-kitchen33-datasets · When you need systematic discovery and verification of specialized datasets across multiple sources.

Verify retention tournament metricsworkflowDataL3

alberta-retention-tournament-verify · When you need multi-phase orchestration with parallel agents across a complex workflow.

★18,530 WORKS 60

Build driver catalogs by industryworkflowOpsDataL3

driver-menu-build · Building curated driver catalogs from event sources with blind naming.

Analyze with adversarial score verificationworkflowDataL3

wf-analyze · Go/no-go decisions on architecture changes when you want scored evidence from multiple dimensions plus adversarial challenge built in.

★368 WORKS 51

Research topics across multiple sourcesworkflowOpsDataL3

research · Synthesizing evidence on academic/technical topics when you need multi-modality (text+code+data) sweep plus two-stage extraction-enrichment pipeline.

Autonomous ML research loopworkflowDataEngineeringL3

{PROJECT_NAME}-autoresearch · Use when workflow operations need {PROJECT_NAME} autoresearch.

Extract and merge literature reviewworkflowDataProductL3

paper-review-literature · Use when workflow operations need paper review literature.

Adversarially review research paperworkflowDataOpsL3

review-paper · Use when workflow operations need review paper.

Fan-out deep research with citationsworkflowDataMarketingL3

deep-research · Use when workflow operations need deep research.

Research and verify specificity audit methodologyworkflowDataL3

step3-specificity-methodology · Auditing bispecific therapeutic window via parallel research, synthesis, and adversarial critique.

Ingest and index large books in parallelworkflowDataL3

ingest-mcluhan-understanding-media · Ingesting large multi-chapter books with deduplication and cross-chunk synthesis via chunked planners.

Cross-check research claims with sourcesworkflowDataL2

deep-research · Running multi-source research with web fetching, fact verification, and cited synthesis.

Fan-out research with adversarial fact-checkworkflowOpsDataL3

research · Fan-out web research, dedup sources, synthesize with citations, and fact-check claims.

Multi-panel refute findings with consensusworkflowDataOpsL3

verify-findings · Verify code audit findings using 3 independent skeptic lenses.

Execute deterministic research pipelineworkflowOpsDataL3

research-pipeline · Execute research through fan-out collection, dedup, and analysis phases.

★393 WORKS 51

Extract and verify factual claimsworkflowOpsDataL2

claim_verifier · Verify document claims are grounded in cited sources.

Scan conformal prediction literatureworkflowDataEngineeringL3

cp-lit-scan · Systematically surveying a research frontier across multiple sub-topics to identify novel research gaps.

Curate tech community research digestworkflowOpsDataL2

research-digest · Aggregating research signals from 7 heterogeneous sources with real-time relevance ranking and dedup.

Run ablation study on hardest problemsworkflowDataL3

mhpp-10-ablation · Ablation studies where solver agents invoke external tools themselves (agentic-tool pattern, not pre-generation).

Backfill public contracts data to BigQueryworkflowDataL4

contracts-finder-backfill · Large-scale procurement time-series ingestion where idempotency and resumability are critical.

page 1 / 2→