cyberneticlibrary

Run A/B tests and causal inference

senior-data-scientistskillsetup L317,464
alirezarezvani/claude-skills
What it does

Build ML pipelines with feature engineering, hyperparameter tuning, cross-validation, and experiment tracking

Best for

Reproducible ML research where feature selection, experiment tracking, and cross-validation are non-negotiable.

Inputs
  • · dataset (CSV/Parquet)
  • · model requirements (classification/regression/clustering)
Outputs
  • · trained model
  • · evaluation metrics (F1/AUC/RMSE)
  • · feature importance analysis
Requires
  • · scikit-learn
  • · pandas
  • · PyTorch or TensorFlow
  • · MLflow
Preconditions

Python 3.8+, labeled data, experiment tracking setup

Failure modes
  • · Data leakage between train/test splits
  • · Model overfits on small dataset
Trust signals
  • · Cross-validation k-fold
  • · Feature importance plots
  • · MLflow experiment logs