The library
Everything we index — ranked by what works, never by stars.
forSalesMarketingHRFinanceLegalOpsProductEngineeringDataProductivitySupportsetup≤ plug & play≤ + a key≤ multi-tool
● works · ● untested / no effect · ● hurts — every rank is measured against a no-skill baseline
untested★64→untested★9,423→untested★64→untested★9,423→untested★64→untested★9,423→untested★381→untested★64→untested★9,423→untested★194→untested★64→untested★9,423→untested★64→untested★9,423→untested★381→untested★64→untested★9,423→untested★342→untested★64→untested★9,423→untested★381→untested★64→untested★9,423→untested★381→untested★64→untested★9,423→untested★334→untested★64→untested★9,423→untested★64→untested★9,423→untested★381→untested★64→untested★9,423→untested★64→untested★795→untested★64→untested★9,423→untested★64→untested★9,423→untested★381→untested★9,423→untested★819→untested★64→untested★9,423→untested★64→untested★9,423→untested★381→untested★64→untested★9,423→
Retire completed work items safelyskillOpsL2
done-retirement · Ensuring a clean exit where knowledge is preserved, successor is prepared, and systems are documented.
Optimize ML costs across cloud providersskillEngineeringL3
skypilot-multi-cloud-orchestration · Running large training jobs by automatically selecting cheapest cloud provider and handling spot preemption.
Assign and track single work items per agentskillOpsL2
hook-persistence · Maintaining context across multiple hook invocations in long-running workflows without re-querying.
Compress large language models to 4-bitskillEngineeringL2
awq-quantization · Compressing large LLMs for edge deployment while maintaining near-FP16 accuracy.
Send durable messages between agentsskillEngineeringL2
mail-async · Multi-agent orchestration where messages must survive crashes and maintain order by timestamp
Reduce model memory by 50-75 percentskillEngineeringL2
quantizing-models-bitsandbytes · Fitting 7B+ models on consumer GPUs (8-16GB VRAM) when accuracy tolerance permits <1% degradation
Complete CA Lobby phase documentationskillOpsL1
CA Lobby Completion Report · CA Lobby projects requiring standardized completion documentation with automated master plan progression
Signal agent health checks instantlyskillEngineeringL2
nudge-sync · Gastown health checks and stall detection where only the latest signal matters and fast overwrite is critical
Speed up transformer inference 2-4xskillEngineeringL2
optimizing-attention-flash · Long-context LLM inference and training where attention is the bottleneck and 2-4x speedup with 10-20x memory savings is critical
Replace type assertions in test codeskillEngineeringL1
migrate-to-shoehorn · Test suites with large mock objects where only a few properties matter and traditional `as` is heavy-handed
Enforce token budgets per agent convoyskillEngineeringL2
token-budget · Multi-agent Gastown convoys needing hard token gates to prevent runaway spend and cascade failures
Quantize models for CPU and Apple SiliconskillEngineeringL2
gguf-quantization · Deploying LLMs on consumer hardware (MacBook M1+) or servers without NVIDIA GPU when universal hardware support is required
Apply Bayesian methods to statistical inferenceskillDataL1
bayesian-methods · Incorporating prior domain knowledge into inference and updating beliefs with small sample sizes where frequentist confidence intervals fail
Quantize 70B models for consumer GPUsskillEngineeringL2
gptq · Deploying 70B+ models on A100/H100 when 4× compression and <2% accuracy loss is acceptable
Access WoW addon utility functionsskillEngineeringL1
k-fencore · WoW addon authors building complex addon logic without reimplementing common utilities
Summarize and visualize data distributionsskillDataL1
descriptive-statistics · Data exploration when hypothesis testing is premature and you need to understand raw data distribution first
Quantize without calibration data requiredskillEngineeringL2
hqq-quantization · Fast model quantization when calibration data unavailable and extreme compression (2-bit) is acceptable
Automate browser testing and verificationskillEngineeringL2
webapp-testing · End-to-end regression testing when unit tests miss user-facing behavior
Master statistical hypothesis testingskillDataL1
inferential-statistics · Determining whether a sample observation generalizes to the population when descriptive stats alone insufficient
Optimize PyTorch model trainingskillEngineeringL2
ml-training-recipes · Starting model training quickly with expert-vetted defaults instead of tuning from scratch
Add skill to Claude Code templateskillEngineeringL2
commands-gizix-cc-projects · Extending Claude Code project templates with custom agent behaviors
Understand probability fundamentalsskillDataL1
probability-theory · Building rigorous statistical reasoning from first principles when intuitive probability reasoning fails
Benchmark code generation modelsskillEngineeringDataL3
evaluating-code-models · Comparing code model performance across standard benchmarks when new architecture or training method is evaluated
Initialize BMAD plugin configskillEngineeringL2
init-pablolion-bmad-plugin · Starting a new project using BMAD methodology with pre-built scaffolding
Build predictive regression modelsskillDataL2
regression-modeling · Quantifying how variables influence outcomes when you need interpretable coefficients rather than pure prediction.
Evaluate LLM academic benchmarksskillEngineeringDataL3
evaluating-llms-harness · Standardized model comparison using industry-standard benchmarks when you need reproducible academic metrics.
Design REST and GraphQL APIsskillEngineeringProductL1
api-design · Creating APIs that agents and humans consume reliably when predictability beats cleverness.
Run statistical simulations and analysisskillDataL2
statistical-computing · Deriving confidence intervals and p-values when analytical formulas are unavailable or standard assumptions violated.
Scale LLM evaluation across backendsskillEngineeringL4
nemo-evaluator-sdk · Enterprise benchmarking of multiple models at scale when reproducible containerized evaluation is required.
Learn cybersecurity defensive basicsskillOpsL1
cybersecurity-basics · Teaching security fundamentals when professional penetration testing is not yet required.
Deploy LLMs on consumer hardwareskillEngineeringL3
llama-cpp · Edge deployment and local inference on Apple Silicon and non-NVIDIA hardware without Docker complexity.
Detect command injection vulnerabilitiesskillEngineeringL1
detecting-command-injection · Finding command injection during code review when automated scanners miss indirect and complex paths.
Prototype and iterate product designsskillProductL1
design-thinking · Designing solutions that solve real problems for real people, not just technically elegant systems.
Generate structured JSON outputs fasterskillEngineeringL3
sglang · Agentic workflows with repeated prefixes (system prompts, tools) where 5× speedup via caching outweighs setup.
Understand digital computing fundamentalsskillL1
digital-systems · Building systems where correctness cannot be patched after deployment (aerospace, medical, critical infrastructure).
Audit code quality and healthskillEngineeringL1
code-health-check · Regular codebase audits to catch health degradation before it blocks features.
Navigate emerging technology landscapeskillProductL1
emerging-tech · Evaluating novel tech claims when you want to separate signal from hype and zeitgeist.
Deploy high-throughput LLM APIsskillEngineeringL4
serving-llms-vllm · Production serving of open models on NVIDIA hardware when high throughput and predictable latency required.
Improve interface usability and designskillProductL1
human-computer-interaction · Creating interfaces where users don't get stuck because system behavior matches their expectations.
Track ML experiments and modelsskillEngineeringDataL2
mlflow · Managing dozens of experiments with repeatable comparison and governance when you need audit trails.
Write idiomatic Rust from CskillEngineeringL2
idiomatic-rust · Systems code where memory safety must hold at compile-time without runtime GC overhead.
Track ML experiments locallyskillEngineeringDataL2
experiment-tracking-swanlab · Real-time experiment monitoring during training when live curves beat post-hoc analysis.
Strukturieren juristischer ForderungenskillLegalL2
forderungen-interessen-matrix · Navigating multi-stakeholder projects when you need to identify whose buy-in actually matters.
Analyze religions side by sideskillL1
comparative-religion · Understanding religious frameworks systematically when you need historical and comparative analysis, not apologetics.
Visualize ML training metricsskillEngineeringDataL2
tensorboard · When debugging deep learning models with real-time metric visualization and experiment comparison across runs.
Understand ethical foundations across traditionsskillL1
ethics-and-practice · When comparing how religious traditions ground moral claims in sources, virtue, law, and contemplative practice.
Track and optimize ML experimentsskillEngineeringDataL2
weights-and-biases · When managing production ML experiments that need team collaboration, automatic metric logging, and hyperparameter optimization.
Create version control commitsskillEngineeringL2
commit · When creating git commits that reflect user intent and preserve reasoning for future context.
Study spiritual practice across traditionsskillL1
mysticism-and-contemplation · When analyzing contemplative experiences and mapping apophatic vs kataphatic approaches across traditions.
Automatically improve AI agentsskillEngineeringProductL3
evolving-ai-agents · When optimizing agent performance through iterative evolution cycles that automatically mutate prompts, skills, and memory.