cyberneticlibrary

The library

Everything we index — ranked by what works, never by stars.

Artifacts Capabilities Recipes

kindall skill workflow mcp_server command subagent plugin|✓ proven ✗ harmful measured only7,984 artifacts

forSales Marketing HR Finance Legal Ops Product Engineering Data Productivity Supportsetup≤ plug & play ≤ + a key ≤ multi-tool

● works · ● untested / no effect · ● hurts — every rank is measured against a no-skill baseline

Retire completed work items safelyskillOpsL2

done-retirement · Ensuring a clean exit where knowledge is preserved, successor is prepared, and systems are documented.

★64→untested

Optimize ML costs across cloud providersskillEngineeringL3

skypilot-multi-cloud-orchestration · Running large training jobs by automatically selecting cheapest cloud provider and handling spot preemption.

★9,423→untested

Assign and track single work items per agentskillOpsL2

hook-persistence · Maintaining context across multiple hook invocations in long-running workflows without re-querying.

★64→untested

Compress large language models to 4-bitskillEngineeringL2

awq-quantization · Compressing large LLMs for edge deployment while maintaining near-FP16 accuracy.

★9,423→untested

Send durable messages between agentsskillEngineeringL2

mail-async · Multi-agent orchestration where messages must survive crashes and maintain order by timestamp

★64→untested

Reduce model memory by 50-75 percentskillEngineeringL2

quantizing-models-bitsandbytes · Fitting 7B+ models on consumer GPUs (8-16GB VRAM) when accuracy tolerance permits <1% degradation

★9,423→untested

Complete CA Lobby phase documentationskillOpsL1

CA Lobby Completion Report · CA Lobby projects requiring standardized completion documentation with automated master plan progression

★381→untested

Signal agent health checks instantlyskillEngineeringL2

nudge-sync · Gastown health checks and stall detection where only the latest signal matters and fast overwrite is critical

★64→untested

Speed up transformer inference 2-4xskillEngineeringL2

optimizing-attention-flash · Long-context LLM inference and training where attention is the bottleneck and 2-4x speedup with 10-20x memory savings is critical

★9,423→untested

Replace type assertions in test codeskillEngineeringL1

migrate-to-shoehorn · Test suites with large mock objects where only a few properties matter and traditional `as` is heavy-handed

★194→untested

Enforce token budgets per agent convoyskillEngineeringL2

token-budget · Multi-agent Gastown convoys needing hard token gates to prevent runaway spend and cascade failures

★64→untested

Quantize models for CPU and Apple SiliconskillEngineeringL2

gguf-quantization · Deploying LLMs on consumer hardware (MacBook M1+) or servers without NVIDIA GPU when universal hardware support is required

★9,423→untested

Apply Bayesian methods to statistical inferenceskillDataL1

bayesian-methods · Incorporating prior domain knowledge into inference and updating beliefs with small sample sizes where frequentist confidence intervals fail

★64→untested

Quantize 70B models for consumer GPUsskillEngineeringL2

gptq · Deploying 70B+ models on A100/H100 when 4× compression and <2% accuracy loss is acceptable

★9,423→untested

Access WoW addon utility functionsskillEngineeringL1

k-fencore · WoW addon authors building complex addon logic without reimplementing common utilities

★381→untested

Summarize and visualize data distributionsskillDataL1

descriptive-statistics · Data exploration when hypothesis testing is premature and you need to understand raw data distribution first

★64→untested

Quantize without calibration data requiredskillEngineeringL2

hqq-quantization · Fast model quantization when calibration data unavailable and extreme compression (2-bit) is acceptable

★9,423→untested

Automate browser testing and verificationskillEngineeringL2

webapp-testing · End-to-end regression testing when unit tests miss user-facing behavior

★342→untested

Master statistical hypothesis testingskillDataL1

inferential-statistics · Determining whether a sample observation generalizes to the population when descriptive stats alone insufficient

★64→untested

Optimize PyTorch model trainingskillEngineeringL2

ml-training-recipes · Starting model training quickly with expert-vetted defaults instead of tuning from scratch

★9,423→untested

Add skill to Claude Code templateskillEngineeringL2

commands-gizix-cc-projects · Extending Claude Code project templates with custom agent behaviors

★381→untested

Understand probability fundamentalsskillDataL1

probability-theory · Building rigorous statistical reasoning from first principles when intuitive probability reasoning fails

★64→untested

Benchmark code generation modelsskillEngineeringDataL3

evaluating-code-models · Comparing code model performance across standard benchmarks when new architecture or training method is evaluated

★9,423→untested

Initialize BMAD plugin configskillEngineeringL2

init-pablolion-bmad-plugin · Starting a new project using BMAD methodology with pre-built scaffolding

★381→untested

Build predictive regression modelsskillDataL2

regression-modeling · Quantifying how variables influence outcomes when you need interpretable coefficients rather than pure prediction.

★64→untested

Evaluate LLM academic benchmarksskillEngineeringDataL3

evaluating-llms-harness · Standardized model comparison using industry-standard benchmarks when you need reproducible academic metrics.

★9,423→untested

Design REST and GraphQL APIsskillEngineeringProductL1

api-design · Creating APIs that agents and humans consume reliably when predictability beats cleverness.

★334→untested

Run statistical simulations and analysisskillDataL2

statistical-computing · Deriving confidence intervals and p-values when analytical formulas are unavailable or standard assumptions violated.

★64→untested

Scale LLM evaluation across backendsskillEngineeringL4

nemo-evaluator-sdk · Enterprise benchmarking of multiple models at scale when reproducible containerized evaluation is required.

★9,423→untested

Learn cybersecurity defensive basicsskillOpsL1

cybersecurity-basics · Teaching security fundamentals when professional penetration testing is not yet required.

★64→untested

Deploy LLMs on consumer hardwareskillEngineeringL3

llama-cpp · Edge deployment and local inference on Apple Silicon and non-NVIDIA hardware without Docker complexity.

★9,423→untested

Detect command injection vulnerabilitiesskillEngineeringL1

detecting-command-injection · Finding command injection during code review when automated scanners miss indirect and complex paths.

★381→untested

Prototype and iterate product designsskillProductL1

design-thinking · Designing solutions that solve real problems for real people, not just technically elegant systems.

★64→untested

Generate structured JSON outputs fasterskillEngineeringL3

sglang · Agentic workflows with repeated prefixes (system prompts, tools) where 5× speedup via caching outweighs setup.

★9,423→untested

Understand digital computing fundamentalsskillL1

digital-systems · Building systems where correctness cannot be patched after deployment (aerospace, medical, critical infrastructure).

★64→untested

Audit code quality and healthskillEngineeringL1

code-health-check · Regular codebase audits to catch health degradation before it blocks features.

★795→untested

Navigate emerging technology landscapeskillProductL1

emerging-tech · Evaluating novel tech claims when you want to separate signal from hype and zeitgeist.

★64→untested

Deploy high-throughput LLM APIsskillEngineeringL4

serving-llms-vllm · Production serving of open models on NVIDIA hardware when high throughput and predictable latency required.

★9,423→untested

Improve interface usability and designskillProductL1

human-computer-interaction · Creating interfaces where users don't get stuck because system behavior matches their expectations.

★64→untested

Track ML experiments and modelsskillEngineeringDataL2

mlflow · Managing dozens of experiments with repeatable comparison and governance when you need audit trails.

★9,423→untested

Write idiomatic Rust from CskillEngineeringL2

idiomatic-rust · Systems code where memory safety must hold at compile-time without runtime GC overhead.

★381→untested

Track ML experiments locallyskillEngineeringDataL2

experiment-tracking-swanlab · Real-time experiment monitoring during training when live curves beat post-hoc analysis.

★9,423→untested

Strukturieren juristischer ForderungenskillLegalL2

forderungen-interessen-matrix · Navigating multi-stakeholder projects when you need to identify whose buy-in actually matters.

★819→untested

Analyze religions side by sideskillL1

comparative-religion · Understanding religious frameworks systematically when you need historical and comparative analysis, not apologetics.

★64→untested

Visualize ML training metricsskillEngineeringDataL2

tensorboard · When debugging deep learning models with real-time metric visualization and experiment comparison across runs.

★9,423→untested

Understand ethical foundations across traditionsskillL1

ethics-and-practice · When comparing how religious traditions ground moral claims in sources, virtue, law, and contemplative practice.

★64→untested

Track and optimize ML experimentsskillEngineeringDataL2

weights-and-biases · When managing production ML experiments that need team collaboration, automatic metric logging, and hyperparameter optimization.

★9,423→untested

Create version control commitsskillEngineeringL2

commit · When creating git commits that reflect user intent and preserve reasoning for future context.

★381→untested

Study spiritual practice across traditionsskillL1

mysticism-and-contemplation · When analyzing contemplative experiences and mapping apophatic vs kataphatic approaches across traditions.

★64→untested

Automatically improve AI agentsskillEngineeringProductL3

evolving-ai-agents · When optimizing agent performance through iterative evolution cycles that automatically mutate prompts, skills, and memory.

←page 86 / 160→