Run A/B tests and causal inference
senior-data-scientistskillsetup L3★17,464
alirezarezvani/claude-skills ↗What it does
Build ML pipelines with feature engineering, hyperparameter tuning, cross-validation, and experiment tracking
Best for
Reproducible ML research where feature selection, experiment tracking, and cross-validation are non-negotiable.
Inputs
- · dataset (CSV/Parquet)
- · model requirements (classification/regression/clustering)
Outputs
- · trained model
- · evaluation metrics (F1/AUC/RMSE)
- · feature importance analysis
Requires
- · scikit-learn
- · pandas
- · PyTorch or TensorFlow
- · MLflow
Preconditions
Python 3.8+, labeled data, experiment tracking setup
Failure modes
- · Data leakage between train/test splits
- · Model overfits on small dataset
Trust signals
- · Cross-validation k-fold
- · Feature importance plots
- · MLflow experiment logs