Exploratory data analysis
Palmer Penguins — load, inspect, missing values, plots.
Ten modules with Jupyter notebooks and PDF exports. Each section includes a PDF preview and a link to open the notebook on GitHub (when repo metadata is available). Jump to a module below.
Palmer Penguins — load, inspect, missing values, plots.
Auto MPG — train/test split, metrics, coefficients.
Iris — scaling, confusion matrix, decision boundary.
Wine dataset — train a tree, plot, feature importance.
Sonar — rock vs mine classification.
Iris — DMatrix API and sklearn classifier.
XGBoost, LightGBM, CatBoost on bank marketing data.
Mall customers — elbow method, segments, silhouette.
Penguins & wine — variance, 2D projection, reconstruction.
Microsoft stock — seasonality, future horizon, MAE.