Run exploratory data analysis and visualizations
analystsubagentsetup L3★1
morganmuli/metaskill ↗What it does
Perform exploratory data analysis, visualization, statistical tests, and reporting
Best for
Exploratory investigation of datasets when the question is broad (profile, compare, find patterns) vs narrow hypothesis testing
Inputs
- · raw dataset or model output
- · question/hypothesis to investigate
- · expected output format (notebook/report)
Outputs
- · distribution plots, correlation matrices
- · statistical test results with p-values
- · markdown report with narrative insights and visuals
Requires
- · pandas
- · scipy
- · matplotlib/seaborn
- · Jupyter notebooks
Preconditions
- · data accessible and loadable
- · question specific enough for EDA
Failure modes
- · dataset too large to load in memory
- · missing values too prevalent
- · hypothesis requires domain knowledge not in data
Trust signals
- · reads .describe(), .info(), .corr() systematically
- · identifies outliers with IQR/z-score
- · tests for stationarity in time series
- · visualization best practices explicit