The library

Everything we index — ranked by what works, never by stars.

untested
Review code changes with consensus debateworkflowEngineeringL2
codex-debate · When code review requires adversarial consensus between multiple reviewers.
untested
Repair markup codes in descriptionsworkflowEngineeringOpsL2
markup-repair · Fixing text markup without changing wording or structure.
untested
Curate subjects for image generationworkflowMarketingProductL2
hiraia-curate-images · Filtering subject clusters into drawable, single-subject image concepts.
untested
Generate multilingual retrieval benchmarksworkflowEngineeringProductL2
hiraia-gen-eval-queries · Generating labeled benchmark queries for retrieval system evaluation.
untested
Hunt bugs until stabilizedworkflowEngineeringL2
loop-until-dry · Iterative bug hunting until convergence on a stable result set.
untested
Find and fill content gapsworkflowProductMarketingL2
completeness-critic · Identifying gaps in initial coverage then filling them iteratively.
untested
Prepare TypeScript SDK releasesworkflowEngineeringL2
ts-sdk-publish · Publishing SDK releases with validation and multi-stage deployment.
untested
Validate findings with structured rubricsworkflowL2
verify-findings-rubric · Multi-agent orchestration for verify findings rubric workflows.
untested
Verify financial figures adversariallyworkflowFinanceL2
verifica-report · Auditing financial notes against transactional DB when you need claim-by-claim verification with source traceability.
untested
Generate features with parallel filesworkflowEngineeringL2
scaffold-feature · Scaffolding a new feature when you want parallel file generation against a plan plus a built-in consistency review.
untested
Generate autonomous architecture plansworkflowEngineeringL2
lets-plan-workflow · Multi-phase orchestration when phases need to be planned sequentially with approval checkpoints.
untested
Batch design and architect clustersworkflowEngineeringL2
evolve-batch · Iterative batch refinement when you need multiple feedback cycles with controlled evolution.
untested
Audit tool migration UX and gapsworkflowEngineeringL2
agi-cli-migration-ux · CLI migrations when you need designed UX with clear deprecation paths and fallback.
untested
Find and propose missing test casesworkflowEngineeringL2
test-gap-finder · Test improvement when you need to identify which code paths are untested and which tests are weak.
untested
Map and summarize codebase architectureworkflowEngineeringL2
codebase-survey · Onboarding or refactoring when you want a comprehensive structure survey without running code.
untested
Map subsystem dependencies and planworkflowEngineeringL2
understand · Deep dives when you need a thorough mental model of a complex system from an agent.
untested
Verify findings through adversarial refutationworkflowEngineeringL2
adversarial-verify · Decision-making when you want an agent to systematically argue against a claim before committing.
untested
Review repository with routing controlsworkflowEngineeringL2
routed_repo_review · Conducting structured code/design reviews with multiple perspectives in parallel.
untested
Audit test files against rulesworkflowEngineeringL2
finalize-testing-rules-audit · Conducting structured code/design reviews with multiple perspectives in parallel.
untested
Extract and index forge conventionsworkflowEngineeringL2
forge-conventions-extract · Extracting and indexing knowledge from distributed sources.
untested
Analyze customer churn and retention strategyworkflowSalesFinanceL2
columbia-churn-2024-2026 · Orchestrating multi-agent workflows with bounded costs and structured outputs.
untested
Run smoke test validationworkflowEngineeringL2
submit_smoke · Orchestrating multi-agent workflows with bounded costs and structured outputs.
untested
Generate and synthesize best answer in parallelworkflowL2
fan-out-reduce · Orchestrating multi-agent workflows with bounded costs and structured outputs.
untested
Monitor deployment health automaticallyworkflowEngineeringOpsL2
canary · Post-deployment health monitoring with parallel metric collection and automated judging.
untested
Discover and fill knowledge gapsworkflowProductEngineeringL2
mishkan-knowledge-gap-discovery · When auditing whether expected knowledge exists or needs research to fill gaps.
untested
Review and cluster product backlogworkflowProductL2
lets-backlog · When domain experts need to surface backlog gaps and cluster related themes in parallel.
untested
Scale agents within token budgetworkflowEngineeringOpsL2
loop-until-budget · When a task needs to run repetitively until completion or token budget exhaustion.
untested
Design and execute test strategyworkflowEngineeringProductL2
test · When running tests needs detailed instrumentation and result aggregation.
untested
Fan-out multi-agent dispatchworkflowEngineeringL2
fan-out.template · When executing parallel agents on the same arguments without manual orchestration.
untested
Summarize Jira sprint digestworkflowOpsEngineeringL2
jira-digest · When generating team digests from Jira activity with structured, parseable output.
untested
Plan implementation slices with orchestrationworkflowEngineeringL2
fathomdb-0.8.0-slice-plan · Planning large multi-phase database implementations with complex dependency sequencing and human-in-loop gates.
untested
Process PR comments and request reviewsworkflowEngineeringL2
comms-workflow · Continuous monitoring and triage of PR comments with automated fixes and intelligent re-request logic.
untested
Generate release notes from git historyworkflowEngineeringL2
sefer-release-notes · Assembling release notes from commits when different categories need orthogonal specialist perspectives.
untested
Run parallel UI tests with PlaywrightworkflowEngineeringL2
playwright-parallel-test · Running parallel UI tests across form/nav/layout when isolation and concurrent feedback matter.
untested
Cross-check research claims with sourcesworkflowDataL2
deep-research · Running multi-source research with web fetching, fact verification, and cited synthesis.
untested
Extract CLAUDE.md candidates from sessionsworkflowEngineeringL2
mine-claude-md-from-sessions · Extracting and synthesizing codebase documentation from session transcripts.
untested
Generate project context snapshotworkflowEngineeringL2
context-refresh · Refreshing and updating cached context for downstream tasks.
untested
Generate multi-zoom architecture documentationworkflowEngineeringL2
architecture-docs · Generating architecture documentation from code analysis and design specs.
untested
Diagnose and fix reported bugsworkflowEngineeringL2
debug · Isolate root cause of flaky tests through hypothesis-driven test mutation.
untested
Parallelize work on cheap modelsworkflowEngineeringOpsL2
cheap-fanout-synthesis · Fan-out cheap research when no single synthesizer can hold the full pool.
untested
Extract and verify factual claimsworkflowOpsDataL2
claim_verifier · Verify document claims are grounded in cited sources.
untested
Map project API surfaceworkflowEngineeringProductL2
api-surface-map · Discovering and documenting REST/RPC/GraphQL APIs automatically by grouping endpoints across large codebases.
untested
Classify commits into feature bucketsworkflowEngineeringOpsL2
reclassify-commits · Batch-labeling hundreds of commits into coherent features without manual triage.
untested
Curate tech community research digestworkflowOpsDataL2
research-digest · Aggregating research signals from 7 heterogeneous sources with real-time relevance ranking and dedup.
untested
Inventory and catalog all DAGsworkflowEngineeringOpsL2
dag-inventory · Cataloging and classifying large numbers of DAGs for safe migration or archival decisions.
untested
Fix markup translations in stringsworkflowEngineeringL2
markup-repair-sonnet · Preserving localization quality while fixing structural markup mismatches token-by-token.
untested
Detect configuration schema driftworkflowEngineeringL2
sync-settings-options · Keeping plugin configuration synchronized across instances without manual sync.
untested
Analyze screenshots with vision AIworkflowEngineeringL2
screenshot-analyze · Extracting semantic meaning and anomalies from screenshots via agent interpretation.
untested
Validate schema error handlingworkflowEngineeringL2
mock-schema-failure · Simulating schema failures to test error handling paths without live breaking changes.
untested
Test immutable workflow globalsworkflowEngineeringL2
protected-globals · Restricting variable scope in workflows to prevent unintended cross-phase mutations.
page 96 / 121