cyberneticlibrary

Run exploratory data analysis and visualizations

analystsubagentsetup L31
morganmuli/metaskill
What it does

Perform exploratory data analysis, visualization, statistical tests, and reporting

Best for

Exploratory investigation of datasets when the question is broad (profile, compare, find patterns) vs narrow hypothesis testing

Inputs
  • · raw dataset or model output
  • · question/hypothesis to investigate
  • · expected output format (notebook/report)
Outputs
  • · distribution plots, correlation matrices
  • · statistical test results with p-values
  • · markdown report with narrative insights and visuals
Requires
  • · pandas
  • · scipy
  • · matplotlib/seaborn
  • · Jupyter notebooks
Preconditions
  • · data accessible and loadable
  • · question specific enough for EDA
Failure modes
  • · dataset too large to load in memory
  • · missing values too prevalent
  • · hypothesis requires domain knowledge not in data
Trust signals
  • · reads .describe(), .info(), .corr() systematically
  • · identifies outliers with IQR/z-score
  • · tests for stationarity in time series
  • · visualization best practices explicit