The library

Everything we index — ranked by what works, never by stars.

untested
Update project documentationsubagentEngineeringL1
writer · Producing original long-form thought leadership and explainers on emerging topics when subject-matter expertise is available and citations are verifiable.
untested
Verify development plans achieve goalssubagentEngineeringL2
gsd-plan-checker · Catching plan incompleteness and hidden assumptions before executing a complex multi-month project that would otherwise fail due to missing coordination or unresolved dependencies.
untested
Validate implementations against specsubagentEngineeringL2
critic · Independent quality gate after delivery that identifies whether work actually meets the brief (not self-assessed), separates critical fixes from polish, and provides actionable rework guidance.
untested
Write and validate test coveragesubagentEngineeringL2
team-tester · Verifying that frontend, backend, and infrastructure pieces actually work together in realistic scenarios before handing off to production ops.
untested
Build frontend with Vite and LeafletsubagentEngineeringL3
frontend-engineer · Building production Node.js backends with proper query optimization, transaction handling, authorization checks, and comprehensive error responses that match frontend API contracts.
untested
Develop ESP32 firmware with ZigbeesubagentEngineeringL3
firmware-engineer · Implementing resource-constrained embedded code with deterministic timing, memory safety, and robust error recovery for hardware control and sensor integration.
untested
Manage quiz questions in CSVsubagentProductL2
add-quiz-questions · Managing multi-language quiz CSV with AI generation and duplicate detection without manual scripts.
untested
Audit code for security vulnerabilitiessubagentEngineeringL2
security · Auditing code for authorization, RLS, and authentication vulnerabilities before launch.
untested
Debug retail replenishment rulessubagentOpsL3
rule_ars · Validating complex business logic rules against SQL Server data when refactoring allocation algorithms.
untested
Detect accessibility patterns across pagessubagentEngineeringL2
cross-page-analyzer · Computing accessibility severity scores and cross-page patterns faster than manual auditing.
untested
Evaluate AI behavior with LLM evalssubagentEngineeringL2
ai-eval-engineer · Validating LLM behavior against strict criteria (structure, safety, cost) that string assertions cannot verify.
untested
Audit Astro site for SEOsubagentMarketingL2
seo-audit · Identifying crawlability and metadata issues systematically across static Astro sites.
untested
Design system architecture and decisionssubagentEngineeringL2
architect · Auditing code for authorization, RLS, and authentication vulnerabilities before launch.
untested
Review code and flag violationssubagentEngineeringL1
code-inline-reviewer · Validating LLM behavior against strict criteria (structure, safety, cost) that string assertions cannot verify.
untested
Resolve build and type errorssubagentEngineeringL2
build-error-resolver · Auditing code for authorization, RLS, and authentication vulnerabilities before launch.
untested
Document technical architecture decisionssubagentEngineeringL2
adr-author · Auditing code for authorization, RLS, and authentication vulnerabilities before launch.
untested
Refactor code without changing behaviorsubagentEngineeringL2
refactorer · Improving code design while guaranteeing behavior preservation through automated validation.
untested
Validate high-risk planssubagentEngineeringL2
emma-frost · Flagging specific coding-standard violations with inline docs links at scale.
untested
Access AI ecosystem and regulatory datapluginDataL2
tensorfeed · Auditing code for authorization, RLS, and authentication vulnerabilities before launch.
untested
Compare pricing across developer toolspluginFinanceL2
agentdeals · Handling specialized tasks that generic agents cannot perform.
untested
Persist context across sessionspluginEngineeringL3
claude-mem · Handling specialized tasks that generic agents cannot perform.
untested
Hire and pay specialized AI agentspluginOpsL3
swarmwage · Handling specialized tasks that generic agents cannot perform.
untested
Generate images and videos with AIpluginMarketingL2
fal-ai · Handling specialized tasks that generic agents cannot perform.
untested
Run genomics pipelines and experimentspluginDataL3
encode-toolkit · Flagging specific coding-standard violations with inline docs links at scale.
untested
Automate Firefox testing and scrapingpluginEngineeringL2
firefox-devtools · Handling specialized tasks that generic agents cannot perform.
untested
Debug iOS Safari with WebKit InspectorpluginEngineeringL2
iwdp-mcp · Handling specialized tasks that generic agents cannot perform.
untested
Browse pages at 97% fewer tokenspluginEngineeringL2
pagemap · Browsing high-token pages (PDFs, documentation, dense paywalls) where 97% compression outweighs loss of formatting.
untested
Persist planning and progress across AI editorspluginOpsProductivityL2
planning-with-files · Multi-session planning where Markdown context outlives any single LLM conversation, across teams using different coding IDEs.
untested
Build and deploy full-stack apps to Tencent CloudBasepluginEngineeringL3
cloudbase-ai-toolkit · Tencent-native development where CloudBase platform provides cost-effective Alibaba-competitive serverless infrastructure for China-market apps.
untested
Deploy React/Vite web apps on Tencent CloudBasepluginEngineeringL2
cloudbase-sites · Two-stage save→deploy workflow for iterative React/Vite front-end development on CloudBase, inspired by Codex Sites pattern.
untested
Access Tencent CloudBase models, auth, and backend servicespluginEngineeringL2
cloudbase · Tencent stack full-stack apps needing unified auth, NoSQL+SQL, blob storage, serverless, and WeChat Mini Program integration.
untested
Delegate work to Codex, Gemini, and OpenCode agentspluginOpsL2
owlex · Multi-agent workflows requiring load-balancing or task-specific agents (Codex for code, Gemini for breadth, OpenCode for spec-driven generation).
untested
Match tasks to cognitive operations from 679-item librarypluginL2
ejentum · Complex reasoning, code generation, or deception-resistant verification where 679 pre-verified cognitive topologies beat ad-hoc reasoning.
untested
Run plan-execute-validate loops with parallel work and memorypluginOpsEngineeringL2
forge · Multi-file refactors, full-stack features, or spec-driven work where parallel validation and retry beat sequential manual oversight.
untested
Execute terminal commands and manage files across formatspluginEngineeringOpsL2
desktop-commander · Local-only automation across text, code, PDFs, spreadsheets where client-side execution avoids cloud egress and latency.
untested
Run interactive shell, SSH, and serial sessions for agentspluginEngineeringOpsL3
pty-mcp · Long-lived interactive debugging, DevOps orchestration, or system administration where persistent PTY state beats stateless shell commands.
untested
Give agents real email addresses for multi-agent coordinationpluginOpsL3
agenticmail · Multi-agent workflows coordinating via email threads (human-readable audit trail) instead of internal queues or cloud APIs.
untested
Access 5 business growth skills with sales and revenue focuspluginSalesOpsL2
business-growth-skills · Startup GTM (customer success manager hiring, sales engineering processes, RevOps setup, contract/proposal templates) via 5 pre-trained skills.
untested
Map processes, manage vendors, plan capacity, lead changepluginOpsL2
business-operations-skills · BizOps teams needing BPMN→bottleneck→fix workflows, vendor risk management, call-center capacity math, and process documentation with embedded validation.
untested
Access 33 C-level advisory skills and virtual board agentspluginOpsL2
c-level-skills · Founder, CEO, or board seeking 33-skill C-suite advisory (CEO, CTO, COO, CPO, CMO, CFO, CRO, CISO, CHRO, counsel, data, AI, customer officers, VPE) with forcing-question discipline.
untested
Run 13 C-suite agents for executive decision-makingpluginOpsL2
c-level-agents · Executive team or board needing multi-role deliberation (CEO, CFO, CRO, CMO, CPO, COO, CHRO, CISO, Counsel, CDO, CAO, CCO, VPE) before major decisions.
untested
Calculate AI build-vs-buy and assess regulatory riskpluginProductFinanceL2
chief-ai-officer-advisor · CAO or VP Product deciding between API models (Claude, Gemini), fine-tuning, or self-hosted infrastructure when capital and regulatory trade-offs matter.
untested
Analyze retention, design segments, size CSM hiringpluginOpsSupportL2
chief-customer-officer-advisor · Chief Customer Officer or VP Customer Success sizing the CS org and kill list when GRR is stalling and NRR delta needs targeting.
untested
Audit training data origins and design data productspluginDataLegalL2
chief-data-officer-advisor · Chief Data Officer or CPO deciding on data moat strategy, training data sourcing, and infrastructure architecture when legal + product + financial trade-offs collide.
untested
Scan contracts for founder-killer patterns and riskspluginLegalFinanceL2
general-counsel-advisor · Founder or CEO reviewing term sheet or vendor contract pre-signature when red-flag speed matters more than perfect legal interpretation.
untested
Analyze delivery metrics, hiring funnel, team structurepluginEngineeringOpsL2
vpe-advisor · VP of Engineering or Director of Eng optimizing delivery throughput (DORA), hiring funnel conversion, and org structure when execution velocity stalls.
untested
Design pricing, route deals, structure partnershipspluginSalesProductL2
commercial-skills · When pricing, channel, partnership, or deal-approval decisions require principled frameworks rather than ad-hoc judgment.
untested
Configure and operate multi-framework compliance programspluginLegalOpsL3
compliance-os · When a company must align multi-framework compliance programs without building separate audit prep tracks.
untested
Access 32 production-ready engineering and security skillspluginEngineeringL2
engineering-skills · When shipping production-grade code requires discipline across architecture, testing, deployment, and security domains rather than isolated linting.
untested
Audit and fix WCAG 2.2 accessibility violations in frontendpluginEngineeringProductL2
a11y-audit · When shipping accessible UI requires automated detection of common WCAG violations before manual testing.
page 157 / 161