The library
Everything we index — ranked by what works, never by stars.
forSalesMarketingHRFinanceLegalOpsProductEngineeringDataProductivitySupportsetup≤ plug & play≤ + a key≤ multi-tool
● works · ● untested / no effect · ● hurts — every rank is measured against a no-skill baseline
untested★9,423→untested★381→untested★9,423→untested★381→untested★9,423→untested★381→untested★9,423→untested★9,423→untested★64→untested★9,423→untested★9,423→untested★381→untested★9,423→untested★64→untested★9,423→untested★9,423→untested★98→untested★64→untested★9,423→untested★9,423→untested★9,423→untested★64→untested★9,423→untested★64→untested★9,423→untested★194→untested★64→untested★9,423→untested★9,423→untested★381→untested★9,423→untested★342→untested★9,423→untested★381→untested★9,423→untested★381→untested★9,423→untested★334→untested★9,423→untested★9,423→untested★381→untested★9,423→untested★795→untested★9,423→untested★9,423→untested★381→untested★9,423→untested★9,423→untested★9,423→untested★381→
Train safer AI with constitutional methodsskillEngineeringL1
constitutional-ai · When you want safety alignment without human labels and need explainable reasoning in refusals.
Review your changes since last commitskillEngineeringL1
diff-since-my-commit · Code review workflows where you need to see what others changed to your files.
Moderate LLM outputs with LlamaGuardskillEngineeringL2
llamaguard · Production input/output filtering where you need a specialized 7B moderation model instead of general LLM.
Validate PubFi DSL server contractsskillEngineeringL1
pubfi-dsl-server-contract · When you need strict input validation and reject SQL/unknown-field queries server-side.
Add runtime guardrails to LLM appsskillEngineeringL2
nemo-guardrails · When you need stateful conversational guardrails beyond single-turn content moderation.
Protect GitLab branches from changesskillEngineeringL1
gitlab-protected-branches · Multi-team workflows where you must enforce review gates and prevent force-push accidents.
Detect prompt injection attacksskillEngineeringL2
prompt-guard · When you need lightweight client-side jailbreak detection before sending to LLM.
Scale PyTorch training across GPUsskillEngineeringL3
huggingface-accelerate · When you have a standard PyTorch training loop and need easy multi-GPU scaling.
Track objects in augmented realityskillEngineeringL1
augmented-reality-tracking · Diagnosing why virtual AR content does not register with physical world; guides technique selection.
Train giant language models efficientlyskillEngineeringL4
training-llms-megatron · Training LLMs over 1B parameters where single GPU is insufficient; achieves 47% MFU on H100.
Distribute large model training across GPUsskillEngineeringL3
pytorch-fsdp2 · Sharding large models across GPUs with DTensor-based parameter sharding and simpler checkpoint semantics vs FSDP1.
Review code for quality and securityskillEngineeringL1
code-review · Post-implementation review to catch security issues, bugs, and code quality gaps before merge.
Train models cleanly with PyTorch LightningskillEngineeringL2
pytorch-lightning · Scaling training from laptop to multi-node/multi-GPU without rewriting boilerplate; automatic DDP/FSDP/DeepSpeed support.
Master spatial reasoning for 3D designskillEngineeringL1
spatial-reasoning-fundamentals · Grounding 3D spatial thinking in platforms like Minecraft, VR, CAD; makes blueprint→world translation transferable.
Distribute model training across clustersskillEngineeringL3
ray-train · Running hyperparameter searches or long training runs across unstable/cloud clusters with automatic fault recovery.
Scale GPU training on reserved instancesskillEngineeringL2
lambda-labs-gpu-cloud · Quick provisioning of single or batch GPU instances without long-term commitment; lower cost than AWS on-demand.
Compile Rust native librariesskillEngineeringL2
rust-build · Building production Rust binaries with optimizations (LTO, codegen-units) for deployment.
Persist orchestration state atomicallyskillEngineeringL3
beads-state · Building reactive UIs in Beads where state changes automatically propagate to views.
Deploy ML models serverless with GPUsskillEngineeringL2
modal-serverless-gpu · Deploying inference-only ML models without managing containers or servers; pay-per-invocation.
Optimize ML costs across cloud providersskillEngineeringL3
skypilot-multi-cloud-orchestration · Running large training jobs by automatically selecting cheapest cloud provider and handling spot preemption.
Compress large language models to 4-bitskillEngineeringL2
awq-quantization · Compressing large LLMs for edge deployment while maintaining near-FP16 accuracy.
Send durable messages between agentsskillEngineeringL2
mail-async · Multi-agent orchestration where messages must survive crashes and maintain order by timestamp
Reduce model memory by 50-75 percentskillEngineeringL2
quantizing-models-bitsandbytes · Fitting 7B+ models on consumer GPUs (8-16GB VRAM) when accuracy tolerance permits <1% degradation
Signal agent health checks instantlyskillEngineeringL2
nudge-sync · Gastown health checks and stall detection where only the latest signal matters and fast overwrite is critical
Speed up transformer inference 2-4xskillEngineeringL2
optimizing-attention-flash · Long-context LLM inference and training where attention is the bottleneck and 2-4x speedup with 10-20x memory savings is critical
Replace type assertions in test codeskillEngineeringL1
migrate-to-shoehorn · Test suites with large mock objects where only a few properties matter and traditional `as` is heavy-handed
Enforce token budgets per agent convoyskillEngineeringL2
token-budget · Multi-agent Gastown convoys needing hard token gates to prevent runaway spend and cascade failures
Quantize models for CPU and Apple SiliconskillEngineeringL2
gguf-quantization · Deploying LLMs on consumer hardware (MacBook M1+) or servers without NVIDIA GPU when universal hardware support is required
Quantize 70B models for consumer GPUsskillEngineeringL2
gptq · Deploying 70B+ models on A100/H100 when 4× compression and <2% accuracy loss is acceptable
Access WoW addon utility functionsskillEngineeringL1
k-fencore · WoW addon authors building complex addon logic without reimplementing common utilities
Quantize without calibration data requiredskillEngineeringL2
hqq-quantization · Fast model quantization when calibration data unavailable and extreme compression (2-bit) is acceptable
Automate browser testing and verificationskillEngineeringL2
webapp-testing · End-to-end regression testing when unit tests miss user-facing behavior
Optimize PyTorch model trainingskillEngineeringL2
ml-training-recipes · Starting model training quickly with expert-vetted defaults instead of tuning from scratch
Add skill to Claude Code templateskillEngineeringL2
commands-gizix-cc-projects · Extending Claude Code project templates with custom agent behaviors
Benchmark code generation modelsskillEngineeringDataL3
evaluating-code-models · Comparing code model performance across standard benchmarks when new architecture or training method is evaluated
Initialize BMAD plugin configskillEngineeringL2
init-pablolion-bmad-plugin · Starting a new project using BMAD methodology with pre-built scaffolding
Evaluate LLM academic benchmarksskillEngineeringDataL3
evaluating-llms-harness · Standardized model comparison using industry-standard benchmarks when you need reproducible academic metrics.
Design REST and GraphQL APIsskillEngineeringProductL1
api-design · Creating APIs that agents and humans consume reliably when predictability beats cleverness.
Scale LLM evaluation across backendsskillEngineeringL4
nemo-evaluator-sdk · Enterprise benchmarking of multiple models at scale when reproducible containerized evaluation is required.
Deploy LLMs on consumer hardwareskillEngineeringL3
llama-cpp · Edge deployment and local inference on Apple Silicon and non-NVIDIA hardware without Docker complexity.
Detect command injection vulnerabilitiesskillEngineeringL1
detecting-command-injection · Finding command injection during code review when automated scanners miss indirect and complex paths.
Generate structured JSON outputs fasterskillEngineeringL3
sglang · Agentic workflows with repeated prefixes (system prompts, tools) where 5× speedup via caching outweighs setup.
Audit code quality and healthskillEngineeringL1
code-health-check · Regular codebase audits to catch health degradation before it blocks features.
Deploy high-throughput LLM APIsskillEngineeringL4
serving-llms-vllm · Production serving of open models on NVIDIA hardware when high throughput and predictable latency required.
Track ML experiments and modelsskillEngineeringDataL2
mlflow · Managing dozens of experiments with repeatable comparison and governance when you need audit trails.
Write idiomatic Rust from CskillEngineeringL2
idiomatic-rust · Systems code where memory safety must hold at compile-time without runtime GC overhead.
Track ML experiments locallyskillEngineeringDataL2
experiment-tracking-swanlab · Real-time experiment monitoring during training when live curves beat post-hoc analysis.
Visualize ML training metricsskillEngineeringDataL2
tensorboard · When debugging deep learning models with real-time metric visualization and experiment comparison across runs.
Track and optimize ML experimentsskillEngineeringDataL2
weights-and-biases · When managing production ML experiments that need team collaboration, automatic metric logging, and hyperparameter optimization.
Create version control commitsskillEngineeringL2
commit · When creating git commits that reflect user intent and preserve reasoning for future context.