The library

Everything we index — ranked by what works, never by stars.

untested
Research Bicep language feature gapssubagentEngineeringL2
bicep-researcher · Identifying undocumented Bicep features (new decorators, type system, assertions, extensions) that should be added to bicep-docs to keep documentation current with language releases.
untested
Package and audit contest submissionssubagentL2
mathodology-submission-packager · Final contest submission quality gate—verifies that no secrets/caches exist, all figures are reproducible from source, outside users can run the package, and submission complies with all rules.
untested
Build macOS automation with ShortcutssubagentOpsL2
system-engineer · Integrating native macOS system capabilities (Spotlight search via mdfind, file metadata via mdls, Finder tags via xattr, Shortcuts automation) into Swift tools via test-driven patterns.
untested
Diagnose frontend build and TypeScript errorssubagentEngineeringL2
build-validator · Rapidly fixing TypeScript/ESLint/Vite build breakages in a strict React 19 + Express 5 monorepo by identifying the exact root cause (type narrowing, decorator syntax, postinstall failure) and applying minimal fixes.
untested
Implement frontend from design specssubagentEngineeringL2
frank · Production Next.js component implementation from PRDs that faithfully follows design specs, validated in real browser across mobile/tablet/desktop viewports with zero console warnings.
untested
Update project documentationsubagentEngineeringL1
writer · Producing original long-form thought leadership and explainers on emerging topics when subject-matter expertise is available and citations are verifiable.
untested
Verify development plans achieve goalssubagentEngineeringL2
gsd-plan-checker · Catching plan incompleteness and hidden assumptions before executing a complex multi-month project that would otherwise fail due to missing coordination or unresolved dependencies.
untested
Validate implementations against specsubagentEngineeringL2
critic · Independent quality gate after delivery that identifies whether work actually meets the brief (not self-assessed), separates critical fixes from polish, and provides actionable rework guidance.
untested
Write and validate test coveragesubagentEngineeringL2
team-tester · Verifying that frontend, backend, and infrastructure pieces actually work together in realistic scenarios before handing off to production ops.
untested
Manage quiz questions in CSVsubagentProductL2
add-quiz-questions · Managing multi-language quiz CSV with AI generation and duplicate detection without manual scripts.
untested
Audit code for security vulnerabilitiessubagentEngineeringL2
security · Auditing code for authorization, RLS, and authentication vulnerabilities before launch.
untested
Detect accessibility patterns across pagessubagentEngineeringL2
cross-page-analyzer · Computing accessibility severity scores and cross-page patterns faster than manual auditing.
untested
Evaluate AI behavior with LLM evalssubagentEngineeringL2
ai-eval-engineer · Validating LLM behavior against strict criteria (structure, safety, cost) that string assertions cannot verify.
untested
Audit Astro site for SEOsubagentMarketingL2
seo-audit · Identifying crawlability and metadata issues systematically across static Astro sites.
untested
Design system architecture and decisionssubagentEngineeringL2
architect · Auditing code for authorization, RLS, and authentication vulnerabilities before launch.
untested
Review code and flag violationssubagentEngineeringL1
code-inline-reviewer · Validating LLM behavior against strict criteria (structure, safety, cost) that string assertions cannot verify.
untested
Resolve build and type errorssubagentEngineeringL2
build-error-resolver · Auditing code for authorization, RLS, and authentication vulnerabilities before launch.
untested
Document technical architecture decisionssubagentEngineeringL2
adr-author · Auditing code for authorization, RLS, and authentication vulnerabilities before launch.
untested
Refactor code without changing behaviorsubagentEngineeringL2
refactorer · Improving code design while guaranteeing behavior preservation through automated validation.
untested
Validate high-risk planssubagentEngineeringL2
emma-frost · Flagging specific coding-standard violations with inline docs links at scale.
untested
Access AI ecosystem and regulatory datapluginDataL2
tensorfeed · Auditing code for authorization, RLS, and authentication vulnerabilities before launch.
untested
Compare pricing across developer toolspluginFinanceL2
agentdeals · Handling specialized tasks that generic agents cannot perform.
untested
Generate images and videos with AIpluginMarketingL2
fal-ai · Handling specialized tasks that generic agents cannot perform.
untested
Automate Firefox testing and scrapingpluginEngineeringL2
firefox-devtools · Handling specialized tasks that generic agents cannot perform.
untested
Debug iOS Safari with WebKit InspectorpluginEngineeringL2
iwdp-mcp · Handling specialized tasks that generic agents cannot perform.
untested
Browse pages at 97% fewer tokenspluginEngineeringL2
pagemap · Browsing high-token pages (PDFs, documentation, dense paywalls) where 97% compression outweighs loss of formatting.
untested
Persist planning and progress across AI editorspluginOpsProductivityL2
planning-with-files · Multi-session planning where Markdown context outlives any single LLM conversation, across teams using different coding IDEs.
untested
Deploy React/Vite web apps on Tencent CloudBasepluginEngineeringL2
cloudbase-sites · Two-stage save→deploy workflow for iterative React/Vite front-end development on CloudBase, inspired by Codex Sites pattern.
untested
Access Tencent CloudBase models, auth, and backend servicespluginEngineeringL2
cloudbase · Tencent stack full-stack apps needing unified auth, NoSQL+SQL, blob storage, serverless, and WeChat Mini Program integration.
untested
Delegate work to Codex, Gemini, and OpenCode agentspluginOpsL2
owlex · Multi-agent workflows requiring load-balancing or task-specific agents (Codex for code, Gemini for breadth, OpenCode for spec-driven generation).
untested
Match tasks to cognitive operations from 679-item librarypluginL2
ejentum · Complex reasoning, code generation, or deception-resistant verification where 679 pre-verified cognitive topologies beat ad-hoc reasoning.
untested
Run plan-execute-validate loops with parallel work and memorypluginOpsEngineeringL2
forge · Multi-file refactors, full-stack features, or spec-driven work where parallel validation and retry beat sequential manual oversight.
untested
Execute terminal commands and manage files across formatspluginEngineeringOpsL2
desktop-commander · Local-only automation across text, code, PDFs, spreadsheets where client-side execution avoids cloud egress and latency.
untested
Access 5 business growth skills with sales and revenue focuspluginSalesOpsL2
business-growth-skills · Startup GTM (customer success manager hiring, sales engineering processes, RevOps setup, contract/proposal templates) via 5 pre-trained skills.
untested
Map processes, manage vendors, plan capacity, lead changepluginOpsL2
business-operations-skills · BizOps teams needing BPMN→bottleneck→fix workflows, vendor risk management, call-center capacity math, and process documentation with embedded validation.
untested
Access 33 C-level advisory skills and virtual board agentspluginOpsL2
c-level-skills · Founder, CEO, or board seeking 33-skill C-suite advisory (CEO, CTO, COO, CPO, CMO, CFO, CRO, CISO, CHRO, counsel, data, AI, customer officers, VPE) with forcing-question discipline.
untested
Run 13 C-suite agents for executive decision-makingpluginOpsL2
c-level-agents · Executive team or board needing multi-role deliberation (CEO, CFO, CRO, CMO, CPO, COO, CHRO, CISO, Counsel, CDO, CAO, CCO, VPE) before major decisions.
untested
Calculate AI build-vs-buy and assess regulatory riskpluginProductFinanceL2
chief-ai-officer-advisor · CAO or VP Product deciding between API models (Claude, Gemini), fine-tuning, or self-hosted infrastructure when capital and regulatory trade-offs matter.
untested
Analyze retention, design segments, size CSM hiringpluginOpsSupportL2
chief-customer-officer-advisor · Chief Customer Officer or VP Customer Success sizing the CS org and kill list when GRR is stalling and NRR delta needs targeting.
untested
Audit training data origins and design data productspluginDataLegalL2
chief-data-officer-advisor · Chief Data Officer or CPO deciding on data moat strategy, training data sourcing, and infrastructure architecture when legal + product + financial trade-offs collide.
untested
Scan contracts for founder-killer patterns and riskspluginLegalFinanceL2
general-counsel-advisor · Founder or CEO reviewing term sheet or vendor contract pre-signature when red-flag speed matters more than perfect legal interpretation.
untested
Analyze delivery metrics, hiring funnel, team structurepluginEngineeringOpsL2
vpe-advisor · VP of Engineering or Director of Eng optimizing delivery throughput (DORA), hiring funnel conversion, and org structure when execution velocity stalls.
untested
Design pricing, route deals, structure partnershipspluginSalesProductL2
commercial-skills · When pricing, channel, partnership, or deal-approval decisions require principled frameworks rather than ad-hoc judgment.
untested
Access 32 production-ready engineering and security skillspluginEngineeringL2
engineering-skills · When shipping production-grade code requires discipline across architecture, testing, deployment, and security domains rather than isolated linting.
untested
Audit and fix WCAG 2.2 accessibility violations in frontendpluginEngineeringProductL2
a11y-audit · When shipping accessible UI requires automated detection of common WCAG violations before manual testing.
untested
Automate Gmail, Drive, Calendar, Sheets, and TaskspluginOpsProductivityL2
google-workspace-cli · When administering Google Workspace across mail, drive, calendar, docs, and tasks requires CLI automation instead of manual UI clicking.
untested
Generate, fix, and run production Playwright testspluginEngineeringL2
pw · When shipping e2e tests across multiple browsers and platforms requires templates, flaky-failure fixes, and reporter integrations rather than starting from scratch.
untested
Auto-improve agent memory and rulespluginProductivityL2
si · When an agent must stay sharp by surfacing what it has learned and permanently upgrading its own instructions and tool library.
untested
Build Snowflake pipelines and SQL queriespluginEngineeringDataL2
snowflake-development · When building production data pipelines in Snowflake requires serverless compute, AI-native SQL functions, and dbt workflow management.
untested
Master 40 advanced engineering disciplinespluginEngineeringL2
engineering-advanced-skills · When shipping complex agent systems requires principled design across agent logic, retrieval, data layers, operations, and pre-production compliance gates.
page 118 / 121