The library

Everything we index — ranked by what works, never by stars.

works · untested / no effect · hurts — every rank is measured against a no-skill baseline
untested
Extract specifications from legacy codeskillEngineeringL2
spec-miner · Creating specifications for legacy systems, mapping dependencies, or documenting existing code without docs
untested
Parallelize async operations with DAGskillEngineeringL2
better-all · Running multiple async operations with complex interdependencies using automatic DAG optimization
untested
Build Spring Boot microservicesskillEngineeringL2
spring-boot-engineer · REST APIs, Spring Security 6, reactive WebFlux, Spring Data JPA, or microservices architecture
untested
Optimize database schemas and queriesskillEngineeringL2
sql-pro · Query optimization, indexing strategy, performance tuning, or advanced SQL patterns
untested
Fetch URL content with CLIskillEngineeringL2
simple-fetch · Reliable HTTP requests with automatic retries, error handling, or timeout management
untested
Annotate biological sequencesskillEngineeringL2
bio-annotation · Gene annotation, sequence analysis, or biological data enrichment pipelines
untested
Build iOS and macOS appsskillEngineeringL2
swift-expert · Swift concurrency, SwiftUI, async/await, actor isolation, or Apple platform development
untested
Manage infrastructure as codeskillEngineeringOpsL3
terraform-engineer · Managing multi-environment infrastructure as code where state consistency, dependency ordering, and secrets isolation are critical.
untested
Debug Manim animation scenesskillEngineeringL2
manim-skill · Iterating on complex animations by rendering frames and diagnosing visual issues before committing to final video production.
untested
Design comprehensive test strategiesskillEngineeringL2
test-master · Designing comprehensive test architectures across unit, integration, E2E, performance, and security when coverage gaps risk production failures.
untested
Manage environment variables securelyskillEngineeringL1
env-handler · Securing API keys and secrets in .env.local while distributing safe templates in .env across the team.
untested
Build advanced TypeScript type systemsskillEngineeringL2
typescript-pro · Building type-safe APIs where branded types and discriminated unions catch misuse at compile time rather than runtime.
untested
Build Vue 3 components with vanilla JavaScriptskillEngineeringL2
vue-expert-js · Building Vue 3 Composition API components in JavaScript with JSDoc types when TypeScript setup adds overhead.
untested
Build Vue 3 apps with Composition APIskillEngineeringL2
vue-expert · Building Vue 3 Composition API components with strict type safety across composables, routing, and state management.
untested
Build real-time communication systemsskillEngineeringL3
websocket-engineer · Building real-time bidirectional communication where HTTP polling would waste resources and introduce latency.
untested
Improve quality and test coverageskillEngineeringL2
quality-assurance · Designing QA workflows that balance automated gate checks with manual exploration when shipping must meet quality bars.
untested
Develop WordPress themes and pluginsskillEngineeringL3
wordpress-pro · Extending WordPress with custom post types and REST endpoints when off-the-shelf plugins don't fit the domain model.
untested
Train LLMs with Lightning AIskillEngineeringL3
implementing-llms-litgpt · Production-grade fine-tuning without abstraction layer overhead; ideal for understanding model internals.
untested
Review code against latest standardsskillEngineeringL2
quality-reviewing · Catching ecosystem mismatches before they reach production; complements automatic hook with external verification.
untested
Build state-space LLM architecturesskillEngineeringL3
mamba-architecture · Long sequences (100K+ tokens), streaming inference, or memory-constrained deployments beating Transformer quadratic scaling.
untested
Implement GPT from scratchskillEngineeringL2
nanogpt · Learning transformer internals, quick prototyping, or experimenting with variants without framework overhead.
untested
Build RNN-Transformer hybrid modelsskillEngineeringL3
rwkv-architecture · Combining RNN efficiency with Transformer quality when linear-time inference matters.
untested
Pretrain LLMs at scale with 4D parallelismskillEngineeringL4
distributed-llm-pretraining-torchtitan · Large-scale pretraining where single-node limits are exceeded and distributed coordination is unavoidable.
untested
Tokenize text 1GB in under 20 secondsskillEngineeringL2
huggingface-tokenizers · Fast, parallel tokenization when model-specific vocabularies and special tokens matter.
untested
Build multilingual tokenizers for CJKskillEngineeringL2
sentencepiece · Multilingual models or custom vocabularies where language-agnostic approach beats language-specific tokenizers.
untested
Audit ecosystem version alignmentskillEngineeringL1
ecosystem-alignment · Keeping a custom Claude Code setup in sync with upstream platform updates, preventing drift and discovering new capabilities to adopt.
untested
Fine-tune 70B models with <1% parametersskillEngineeringL3
peft-fine-tuning · Fine-tuning large models (70B+) on consumer hardware by training only 0.17% of parameters in a 6MB adapter, enabling cost-effective task-specific customization without full-model training.
untested
Debug distributed systems with tracesskillEngineeringL3
rca-distributed-systems · Production incidents in microservice meshes (Kubernetes, Istio, Linkerd) where the fault could originate in any of 10-100 services and the causal path runs through network hops, retries, and timeouts.
untested
Interpret 70B models without local GPUskillEngineeringDataL3
nnsight-remote-interpretability · Running the same interpretability code on GPT-2 locally and Llama-405B remotely without code changes, enabling scalable mechanistic interpretability research on massive models.
untested
Patch activations to test causal claimsskillEngineeringL2
pyvene-interventions · guidance for performing causal interventions on PyTorch models using pyvene's declarative intervention framework
untested
Train sparse autoencoders to find featuresskillEngineeringDataL3
sparse-autoencoder-training · guidance for training and analyzing Sparse Autoencoders (SAEs) using SAELens to decompose neural network activations into interpretable f...
untested
Merge docs and fixes to main without releaseskillEngineeringL1
push · Solving push challenges
untested
Reverse-engineer transformer internalsskillEngineeringDataL3
transformer-lens-interpretability · guidance for mechanistic interpretability research using TransformerLens to inspect and manipulate transformer internals via HookPoints a...
untested
Deduplicate and filter training dataskillEngineeringL3
nemo-curator · GPU-accelerated data curation for LLM training
untested
Process ML datasets at scaleskillDataEngineeringL3
ray-data · Batch inference and preprocessing on 100GB+ datasets across multi-node clusters.
untested
Cancel and debounce async requestsskillEngineeringL1
riverpod-cancel · Flutter apps cancelling requests when user navigates or triggers rapid refreshes.
untested
Fine-tune models with GRPOskillEngineeringDataL3
grpo-rl-training · Teaching specific output formats (XML, JSON) and verifiable tasks without preference pairs.
untested
Train large MoE models efficientlyskillEngineeringDataL4
miles-rl-training · Training 1TB+ MoE models with speculative RL for 25%+ rollout speedup.
untested
Scale RLHF training with RayskillEngineeringDataL4
openrlhf-training · Scaling PPO/GRPO/RLOO/DPO training to 70B+ models with multi-node vLLM.
untested
Engineer new InCTRL modulesskillEngineeringL3
model_engineer · Designing end-to-end ML systems from task spec to production deployment.
untested
Align models with SimPOskillEngineeringDataL3
simpo-training · Quick preference optimization without reward model or RL infrastructure.
untested
Build Claude integrationsskillEngineeringProductL2
claude-typescript-sdk · Integrating Claude into TypeScript backends and Node.js scripts.
untested
Train GLM models with SLIMEskillEngineeringDataL4
slime-rl-training · Research-grade RL training with flexible reward functions and algorithm variants.
untested
Avoid Sentry configuration pitfallsskillEngineeringL1
sentry-known-pitfalls · Quick diagnosis of Sentry setup failures and error categorization issues.
untested
Package research into missionsskillOpsEngineeringL2
research-mission-generator · Decomposing business goals into actionable research programs.
untested
Train agents with TorchForgeskillEngineeringDataL4
torchforge-rl-training · PyTorch-native RL training with hardware acceleration and custom loss functions.
untested
Fix PHP type errors automaticallyskillEngineeringL2
phpstan-resolver · Migrating PHP codebases to strict type checking incrementally.
untested
Fine-tune LLMs with TRLskillEngineeringDataL3
fine-tuning-with-trl · Multi-phase RLHF pipelines (SFT→Reward→PPO) where you control each alignment stage.
untested
Scale RL training with VeRLskillEngineeringDataL4
verl-rl-training · Production math/reasoning tasks (GSM8K, MATH) where you need proven RL algorithms at scale.
untested
Manage Node dependencies with pnpmskillEngineeringL1
pnpm · Monorepos needing deterministic builds, strict dependency isolation, and centralized version catalogs.
page 42 / 86