cyberneticlibrary

Run exploratory data analysis and visualizations

analystsubagentsetup L3★1

morganmuli/metaskill ↗

What it does

Perform exploratory data analysis, visualization, statistical tests, and reporting

Best for

Exploratory investigation of datasets when the question is broad (profile, compare, find patterns) vs narrow hypothesis testing

Inputs

· raw dataset or model output
· question/hypothesis to investigate
· expected output format (notebook/report)

Outputs

· distribution plots, correlation matrices
· statistical test results with p-values
· markdown report with narrative insights and visuals

Requires

· pandas
· scipy
· matplotlib/seaborn
· Jupyter notebooks

Preconditions

· data accessible and loadable
· question specific enough for EDA

Failure modes

· dataset too large to load in memory
· missing values too prevalent
· hypothesis requires domain knowledge not in data

Trust signals

· reads .describe(), .info(), .corr() systematically
· identifies outliers with IQR/z-score
· tests for stationarity in time series
· visualization best practices explicit