cyberneticlibrary

The library

Everything we index — ranked by what works, never by stars.

Artifacts Capabilities Recipes

kindall skill workflow mcp_server command subagent plugin|✓ proven ✗ harmful measured only4,285 artifacts

forSales Marketing HR Finance Legal Ops Product Engineering Data Productivity Supportsetup≤ plug & play ≤ + a key ≤ multi-tool

● works · ● untested / no effect · ● hurts — every rank is measured against a no-skill baseline

Train safer AI with constitutional methodsskillEngineeringL1

constitutional-ai · When you want safety alignment without human labels and need explainable reasoning in refusals.

★9,423→untested

Review your changes since last commitskillEngineeringL1

diff-since-my-commit · Code review workflows where you need to see what others changed to your files.

★381→untested

Moderate LLM outputs with LlamaGuardskillEngineeringL2

llamaguard · Production input/output filtering where you need a specialized 7B moderation model instead of general LLM.

★9,423→untested

Validate PubFi DSL server contractsskillEngineeringL1

pubfi-dsl-server-contract · When you need strict input validation and reject SQL/unknown-field queries server-side.

★381→untested

Add runtime guardrails to LLM appsskillEngineeringL2

nemo-guardrails · When you need stateful conversational guardrails beyond single-turn content moderation.

★9,423→untested

Protect GitLab branches from changesskillEngineeringL1

gitlab-protected-branches · Multi-team workflows where you must enforce review gates and prevent force-push accidents.

★381→untested

Detect prompt injection attacksskillEngineeringL2

prompt-guard · When you need lightweight client-side jailbreak detection before sending to LLM.

★9,423→untested

Scale PyTorch training across GPUsskillEngineeringL3

huggingface-accelerate · When you have a standard PyTorch training loop and need easy multi-GPU scaling.

★9,423→untested

Track objects in augmented realityskillEngineeringL1

augmented-reality-tracking · Diagnosing why virtual AR content does not register with physical world; guides technique selection.

★64→untested

Train giant language models efficientlyskillEngineeringL4

training-llms-megatron · Training LLMs over 1B parameters where single GPU is insufficient; achieves 47% MFU on H100.

★9,423→untested

Distribute large model training across GPUsskillEngineeringL3

pytorch-fsdp2 · Sharding large models across GPUs with DTensor-based parameter sharding and simpler checkpoint semantics vs FSDP1.

★9,423→untested

Review code for quality and securityskillEngineeringL1

code-review · Post-implementation review to catch security issues, bugs, and code quality gaps before merge.

★381→untested

Train models cleanly with PyTorch LightningskillEngineeringL2

pytorch-lightning · Scaling training from laptop to multi-node/multi-GPU without rewriting boilerplate; automatic DDP/FSDP/DeepSpeed support.

★9,423→untested

Master spatial reasoning for 3D designskillEngineeringL1

spatial-reasoning-fundamentals · Grounding 3D spatial thinking in platforms like Minecraft, VR, CAD; makes blueprint→world translation transferable.

★64→untested

Distribute model training across clustersskillEngineeringL3

ray-train · Running hyperparameter searches or long training runs across unstable/cloud clusters with automatic fault recovery.

★9,423→untested

Scale GPU training on reserved instancesskillEngineeringL2

lambda-labs-gpu-cloud · Quick provisioning of single or batch GPU instances without long-term commitment; lower cost than AWS on-demand.

★9,423→untested

Compile Rust native librariesskillEngineeringL2

rust-build · Building production Rust binaries with optimizations (LTO, codegen-units) for deployment.

★98→untested

Persist orchestration state atomicallyskillEngineeringL3

beads-state · Building reactive UIs in Beads where state changes automatically propagate to views.

★64→untested

Deploy ML models serverless with GPUsskillEngineeringL2

modal-serverless-gpu · Deploying inference-only ML models without managing containers or servers; pay-per-invocation.

★9,423→untested

Optimize ML costs across cloud providersskillEngineeringL3

skypilot-multi-cloud-orchestration · Running large training jobs by automatically selecting cheapest cloud provider and handling spot preemption.

★9,423→untested

Compress large language models to 4-bitskillEngineeringL2

awq-quantization · Compressing large LLMs for edge deployment while maintaining near-FP16 accuracy.

★9,423→untested

Send durable messages between agentsskillEngineeringL2

mail-async · Multi-agent orchestration where messages must survive crashes and maintain order by timestamp

★64→untested

Reduce model memory by 50-75 percentskillEngineeringL2

quantizing-models-bitsandbytes · Fitting 7B+ models on consumer GPUs (8-16GB VRAM) when accuracy tolerance permits <1% degradation

★9,423→untested

Signal agent health checks instantlyskillEngineeringL2

nudge-sync · Gastown health checks and stall detection where only the latest signal matters and fast overwrite is critical

★64→untested

Speed up transformer inference 2-4xskillEngineeringL2

optimizing-attention-flash · Long-context LLM inference and training where attention is the bottleneck and 2-4x speedup with 10-20x memory savings is critical

★9,423→untested

Replace type assertions in test codeskillEngineeringL1

migrate-to-shoehorn · Test suites with large mock objects where only a few properties matter and traditional `as` is heavy-handed

★194→untested

Enforce token budgets per agent convoyskillEngineeringL2

token-budget · Multi-agent Gastown convoys needing hard token gates to prevent runaway spend and cascade failures

★64→untested

Quantize models for CPU and Apple SiliconskillEngineeringL2

gguf-quantization · Deploying LLMs on consumer hardware (MacBook M1+) or servers without NVIDIA GPU when universal hardware support is required

★9,423→untested

Quantize 70B models for consumer GPUsskillEngineeringL2

gptq · Deploying 70B+ models on A100/H100 when 4× compression and <2% accuracy loss is acceptable

★9,423→untested

Access WoW addon utility functionsskillEngineeringL1

k-fencore · WoW addon authors building complex addon logic without reimplementing common utilities

★381→untested

Quantize without calibration data requiredskillEngineeringL2

hqq-quantization · Fast model quantization when calibration data unavailable and extreme compression (2-bit) is acceptable

★9,423→untested

Automate browser testing and verificationskillEngineeringL2

webapp-testing · End-to-end regression testing when unit tests miss user-facing behavior

★342→untested

Optimize PyTorch model trainingskillEngineeringL2

ml-training-recipes · Starting model training quickly with expert-vetted defaults instead of tuning from scratch

★9,423→untested

Add skill to Claude Code templateskillEngineeringL2

commands-gizix-cc-projects · Extending Claude Code project templates with custom agent behaviors

★381→untested

Benchmark code generation modelsskillEngineeringDataL3

evaluating-code-models · Comparing code model performance across standard benchmarks when new architecture or training method is evaluated

★9,423→untested

Initialize BMAD plugin configskillEngineeringL2

init-pablolion-bmad-plugin · Starting a new project using BMAD methodology with pre-built scaffolding

★381→untested

Evaluate LLM academic benchmarksskillEngineeringDataL3

evaluating-llms-harness · Standardized model comparison using industry-standard benchmarks when you need reproducible academic metrics.

★9,423→untested

Design REST and GraphQL APIsskillEngineeringProductL1

api-design · Creating APIs that agents and humans consume reliably when predictability beats cleverness.

★334→untested

Scale LLM evaluation across backendsskillEngineeringL4

nemo-evaluator-sdk · Enterprise benchmarking of multiple models at scale when reproducible containerized evaluation is required.

★9,423→untested

Deploy LLMs on consumer hardwareskillEngineeringL3

llama-cpp · Edge deployment and local inference on Apple Silicon and non-NVIDIA hardware without Docker complexity.

★9,423→untested

Detect command injection vulnerabilitiesskillEngineeringL1

detecting-command-injection · Finding command injection during code review when automated scanners miss indirect and complex paths.

★381→untested

Generate structured JSON outputs fasterskillEngineeringL3

sglang · Agentic workflows with repeated prefixes (system prompts, tools) where 5× speedup via caching outweighs setup.

★9,423→untested

Audit code quality and healthskillEngineeringL1

code-health-check · Regular codebase audits to catch health degradation before it blocks features.

★795→untested

Deploy high-throughput LLM APIsskillEngineeringL4

serving-llms-vllm · Production serving of open models on NVIDIA hardware when high throughput and predictable latency required.

★9,423→untested

Track ML experiments and modelsskillEngineeringDataL2

mlflow · Managing dozens of experiments with repeatable comparison and governance when you need audit trails.

★9,423→untested

Write idiomatic Rust from CskillEngineeringL2

idiomatic-rust · Systems code where memory safety must hold at compile-time without runtime GC overhead.

★381→untested

Track ML experiments locallyskillEngineeringDataL2

experiment-tracking-swanlab · Real-time experiment monitoring during training when live curves beat post-hoc analysis.

★9,423→untested

Visualize ML training metricsskillEngineeringDataL2

tensorboard · When debugging deep learning models with real-time metric visualization and experiment comparison across runs.

★9,423→untested

Track and optimize ML experimentsskillEngineeringDataL2

weights-and-biases · When managing production ML experiments that need team collaboration, automatic metric logging, and hyperparameter optimization.

★9,423→untested

Create version control commitsskillEngineeringL2

commit · When creating git commits that reflect user intent and preserve reasoning for future context.

←page 43 / 86→