The library
Everything we index — ranked by what works, never by stars.
forSalesMarketingHRFinanceLegalOpsProductEngineeringDataProductivitySupportsetup≤ plug & play≤ + a key≤ multi-tool
● works · ● untested / no effect · ● hurts — every rank is measured against a no-skill baseline
untested★64→untested★9,423→untested★381→untested★64→untested★9,423→untested★381→untested★64→untested★9,423→untested★381→untested★64→untested★9,423→untested★64→untested★9,423→untested★381→untested★64→untested★9,423→untested★381→untested★64→untested★64→untested★381→untested★64→untested★9,423→untested★64→untested★64→untested★9,423→untested★64→untested★9,423→untested★381→untested★64→untested★9,423→untested★144→untested★64→untested★9,423→untested★64→untested★9,423→untested★64→untested★9,423→untested★381→untested★64→untested★9,423→untested★64→untested★9,423→untested★381→untested★64→untested★9,423→untested★381→untested★1→untested★64→untested★9,423→untested★381→
Communicate with project stakeholdersskillOpsL2
stakeholder-communication · Projects where stakeholder misalignment kills adoption; communication is connective tissue.
Build state-space LLM architecturesskillEngineeringL3
mamba-architecture · Long sequences (100K+ tokens), streaming inference, or memory-constrained deployments beating Transformer quadratic scaling.
Track client deliverables in GitHubskillOpsL2
deliverable-tracking · Client work where explicit tracking + dynamic labels beat unstructured task lists.
Learn behavioral neuroscience foundationsskillL1
behavioral-neuroscience · Understanding brain constraints on behavior, design decisions informed by neural limits, or psychopharmacology reasoning.
Implement GPT from scratchskillEngineeringL2
nanogpt · Learning transformer internals, quick prototyping, or experimenting with variants without framework overhead.
Triage Freshdesk tickets by priorityskillSupportL2
freshdesk-triage · High-volume support queues where manual triage is the bottleneck.
Understand clinical psychology disordersskillL1
clinical-foundations · Clinical practice where empirical assessment beats intuition for reliability and ethical practice.
Build RNN-Transformer hybrid modelsskillEngineeringL3
rwkv-architecture · Combining RNN efficiency with Transformer quality when linear-time inference matters.
Preview Denmark statistics tablesskillDataL1
tables · Organizing structured data where tables provide clarity and queryability.
Master core cognitive processesskillL1
cognitive-psychology · Understanding decision-making biases, memory constraints, or attention bottlenecks in system design.
Pretrain LLMs at scale with 4D parallelismskillEngineeringL4
distributed-llm-pretraining-torchtitan · Large-scale pretraining where single-node limits are exceeded and distributed coordination is unavoidable.
Map child and adult development stagesskillHRL1
developmental-psychology · Designing age-appropriate experiences, understanding delays, or intervening during critical periods.
Tokenize text 1GB in under 20 secondsskillEngineeringL2
huggingface-tokenizers · Fast, parallel tokenization when model-specific vocabularies and special tokens matter.
Generate 10-20 fresh ad angles fastskillMarketingL1
angle-generator · Marketing/comms where the same fact told through different angles reaches different segments.
Design and conduct psychology researchskillDataL1
research-methods-psych · Psychology research where internal/external validity and replicability are non-negotiable.
Build multilingual tokenizers for CJKskillEngineeringL2
sentencepiece · Multilingual models or custom vocabularies where language-agnostic approach beats language-specific tokenizers.
Generate images with crypto micropaymentsskillMarketingL2
nano-banana-blockrun · Generating production-ready images without API keys when you control on-chain payments via USDC micropayments, avoiding vendor lock-in.
Understand influence and group behaviorskillSalesHRL1
social-psychology · Diagnosing why teams conform to poor decisions or explaining how organizational pressure shapes individual behavior beyond rational choice.
Write incident postmortems that teachskillOpsL1
blameless-postmortem · Converting incidents into organization-wide learning while protecting the psychological safety required for honest incident reporting, especially in distributed systems and on-call rotations.
Audit ecosystem version alignmentskillEngineeringL1
ecosystem-alignment · Keeping a custom Claude Code setup in sync with upstream platform updates, preventing drift and discovering new capabilities to adopt.
Find root causes with causal inferenceskillOpsDataL2
rca-causal-inference · Incidents with rich quantitative data where you need defensible, reproducible, mathematical causal claims (not just narrative RCA), especially when distinguishing multiple confounding factors.
Fine-tune 70B models with <1% parametersskillEngineeringL3
peft-fine-tuning · Fine-tuning large models (70B+) on consumer hardware by training only 0.17% of parameters in a 6MB adapter, enabling cost-effective task-specific customization without full-model training.
Apply five whys and fishbone diagramsskillOpsL1
rca-classical-methods · Linear, single-cause shop-floor incidents (5 Whys), or multi-pathway quality incidents (Fishbone), or safety-critical system design review (FTA/FMEA) where classical techniques have proven power.
Debug distributed systems with tracesskillEngineeringL3
rca-distributed-systems · Production incidents in microservice meshes (Kubernetes, Istio, Linkerd) where the fault could originate in any of 10-100 services and the causal path runs through network hops, retries, and timeouts.
Interpret 70B models without local GPUskillEngineeringDataL3
nnsight-remote-interpretability · Running the same interpretability code on GPT-2 locally and Llama-405B remotely without code changes, enabling scalable mechanistic interpretability research on massive models.
Analyze human factors in incidentsskillOpsL1
rca-human-factors · Safety-critical industries (aviation, healthcare, emergency response, nuclear) where understanding why the operator's action was reasonable from their perspective reveals system design flaws to fix.
Patch activations to test causal claimsskillEngineeringL2
pyvene-interventions · guidance for performing causal interventions on PyTorch models using pyvene's declarative intervention framework
Screen for psychological distress quicklyskillHRL1
distress-screening · screening for nonspecific psychological distress or tracking symptom burden over time
Analyze complex socio-technical failuresskillOpsL2
rca-systems-theoretic · investigating incidents in healthcare, aviation, nuclear, autonomous systems, distributed microservices, or any system where multiple act...
Train sparse autoencoders to find featuresskillEngineeringDataL3
sparse-autoencoder-training · guidance for training and analyzing Sparse Autoencoders (SAEs) using SAELens to decompose neural network activations into interpretable f...
Merge docs and fixes to main without releaseskillEngineeringL1
push · Solving push challenges
Evaluate arguments and spot author biasskillL1
critical-reading · evaluating arguments, identifying bias, assessing source reliability, analyzing rhetoric, or synthesizing across multiple texts on the sa...
Reverse-engineer transformer internalsskillEngineeringDataL3
transformer-lens-interpretability · guidance for mechanistic interpretability research using TransformerLens to inspect and manipulate transformer internals via HookPoints a...
Find, evaluate, and cite informationskillL1
information-literacy · conducting research, evaluating sources, navigating digital information environments, teaching research skills, or addressing plagiarism ...
Deduplicate and filter training dataskillEngineeringL3
nemo-curator · GPU-accelerated data curation for LLM training
Analyze literary texts and themesskillL1
literary-analysis · interpreting literary texts, analyzing author craft, applying critical lenses, discussing theme and symbolism, or exploring how literary ...
Process ML datasets at scaleskillDataEngineeringL3
ray-data · Batch inference and preprocessing on 100GB+ datasets across multi-node clusters.
Cancel and debounce async requestsskillEngineeringL1
riverpod-cancel · Flutter apps cancelling requests when user navigates or triggers rapid refreshes.
Teach phonics and decoding skillsskillL1
phonics-decoding · Teaching early literacy and diagnosing reading errors via Running Records.
Fine-tune models with GRPOskillEngineeringDataL3
grpo-rl-training · Teaching specific output formats (XML, JSON) and verifiable tasks without preference pairs.
Build reading comprehension skillsskillL1
reading-comprehension · Teaching comprehension when decoding is automatic and meaning-construction needs scaffolding.
Train large MoE models efficientlyskillEngineeringDataL4
miles-rl-training · Training 1TB+ MoE models with speculative RL for 25%+ rollout speedup.
Interpret genetic variantsskillL1
variant-interpretation · Automated clinical variant reporting with ACMG evidence codes.
Develop vocabulary and word skillsskillL1
vocabulary-development · Teaching Tier 2 academic vocabulary when explicit word learning is needed.
Scale RLHF training with RayskillEngineeringDataL4
openrlhf-training · Scaling PPO/GRPO/RLOO/DPO training to 70B+ models with multi-node vLLM.
Check Ralph Specum statusskillL2
ralph-specum-status · Real-time verification of spectrum analyzer readiness before measurement campaigns.
Engineer new InCTRL modulesskillEngineeringL3
model_engineer · Designing end-to-end ML systems from task spec to production deployment.
Verify facts and sourcesskillOpsMarketingL2
data-fidelity · Discovering data quality issues before training to avoid garbage-in-garbage-out.
Align models with SimPOskillEngineeringDataL3
simpo-training · Quick preference optimization without reward model or RL infrastructure.
Build Claude integrationsskillEngineeringProductL2
claude-typescript-sdk · Integrating Claude into TypeScript backends and Node.js scripts.