Explaining models through agnostic approaches

Lecture 17

Dr. Benjamin Soltoff

Cornell University
INFO 4940/5940 - Fall 2024

November 5, 2024

Announcements

Homework 3
Extra credit

Learning objectives

Review the importance of explainability for machine learning models
Identify techniques for local and global explanations of machine learning predictions
Implement Shapley values for local explanations
Estimate permutation-based feature importance measures
Evaluate marginal effects of features using partial dependence plots

Explainability

Explanation

Answer to the “why” question

Why does my model predict

That Trump/Harris will win the election?
I will default on my loan?
My house is worth $300,000?

What is a good explanation?

Contrastive: why was this prediction made instead of another prediction?
Selected: Focuses on just a handful of reasons, even if the problem is more complex
Social: Needs to be understandable by your audience
Truthful: Explanation should predict the event as truthfully as possible
Generalizable: Explanation could apply to many predictions

Types of model explanations

Global
Local

White-box model

Models that lend themselves naturally to interpretation

Linear regression
Logistic regression
Generalized linear model
Decision tree

Black-box model

Random forests
Boosted trees
Neural networks
Deep learning

Implementations in R

Predicting student debt

Application exercise

`ae-17`

Go to the course GitHub org and find your ae-17 (repo name will be suffixed with your GitHub name).
Clone the repo in RStudio, run renv::restore() to install the required packages, open the Quarto document in the repo, and follow along and complete the exercises.
Render, commit, and push your edits by the AE deadline – end of the day

Predicting student debt

College Scorecard
rscorecard
rcis::scorecard

Predicting student debt

Rows: 1,719
Columns: 14
$ unitid    <dbl> 100654, 100663, 100706, 100724, 100751, 100830, 100858, 1009…
$ name      <chr> "Alabama A & M University", "University of Alabama at Birmin…
$ state     <chr> "AL", "AL", "AL", "AL", "AL", "AL", "AL", "AL", "AL", "AL", …
$ type      <fct> "Public", "Public", "Public", "Public", "Public", "Public", …
$ admrate   <dbl> 0.7160, 0.8854, 0.7367, 0.9799, 0.7890, 0.9680, 0.7118, 0.65…
$ satavg    <dbl> 954, 1266, 1300, 955, 1244, 1069, NA, 1214, 1042, NA, 1111, …
$ cost      <dbl> 21924, 26248, 24869, 21938, 31050, 20621, 32678, 33920, 3645…
$ netcost   <dbl> 13057, 16585, 17250, 13593, 21534, 13689, 23258, 21098, 2037…
$ avgfacsal <dbl> 79011, 104310, 88380, 69309, 94581, 70965, 99837, 68724, 564…
$ pctpell   <dbl> 0.6853, 0.3253, 0.2377, 0.7205, 0.1712, 0.4821, 0.1301, 0.21…
$ comprate  <dbl> 0.2807, 0.6245, 0.6072, 0.2843, 0.7223, 0.3569, 0.8088, 0.69…
$ firstgen  <dbl> 0.3658281, 0.3412237, 0.3101322, 0.3434343, 0.2257127, 0.381…
$ debt      <dbl> 16600, 15832, 13905, 17500, 17986, 13119, 17750, 16000, 1500…
$ locale    <fct> City, City, City, City, City, City, City, City, City, Suburb…

Fitted models

library(tidyverse)
library(tidymodels)
library(rcis)
library(here)

# get scorecard dataset
data("scorecard")
scorecard <- scorecard |>
  # remove ID columns - causing issues when interpreting/explaining
  select(-unitid, -name) |>
  # convert factor to character columns
  mutate(across(.cols = where(is.factor), .f = as.character))

# split into training and testing
set.seed(123)

scorecard_split <- initial_split(data = scorecard, prop = .75, strata = debt)
scorecard_train <- training(scorecard_split)
scorecard_test <- testing(scorecard_split)

scorecard_folds <- vfold_cv(data = scorecard_train, v = 10)

# basic feature engineering recipe
scorecard_rec <- recipe(debt ~ ., data = scorecard_train) |>
  # catch all category for missing state values
  step_novel(state) |>
  # use median imputation for numeric predictors
  step_impute_median(all_numeric_predictors()) |>
  # use modal imputation for nominal predictors
  step_impute_mode(all_nominal_predictors()) |>
  # remove rows with missing values for
  # outcomes - glmnet won't work if any of this column is NA
  step_naomit(all_outcomes())

# generate random forest model
rf_mod <- rand_forest() |>
  set_engine("ranger") |>
  set_mode("regression")

# combine recipe with model
rf_wf <- workflow() |>
  add_recipe(scorecard_rec) |>
  add_model(rf_mod)

# fit using training set
set.seed(123)
rf_wf <- fit(
  rf_wf,
  data = scorecard_train
)

# fit penalized regression model
## recipe
glmnet_recipe <- scorecard_rec |>
  # use median imputation for numeric predictors
  step_impute_median(all_numeric_predictors()) |>
  step_dummy(all_nominal_predictors()) |>
  step_zv(all_predictors()) |>
  step_normalize(all_numeric_predictors())

## model specification
glmnet_spec <- linear_reg(penalty = tune(), mixture = tune()) |>
  set_mode("regression") |>
  set_engine("glmnet")

## workflow
glmnet_workflow <- workflow() |>
  add_recipe(glmnet_recipe) |>
  add_model(glmnet_spec)

## tuning grid
glmnet_grid <- expand_grid(
  penalty = 10^seq(-6, -1, length.out = 20),
  mixture = c(0.05, 0.2, 0.4, 0.6, 0.8, 1)
)

## hyperparameter tuning
glmnet_tune <- tune_grid(
  glmnet_workflow,
  resamples = scorecard_folds,
  grid = glmnet_grid
)

# select best model
glmnet_best <- select_best(glmnet_tune, metric = "rmse")
glmnet_wf <- finalize_workflow(glmnet_workflow, glmnet_best) |>
  last_fit(scorecard_split) |>
  extract_workflow()

# nearest neighbors model
## use glmnet recipe
kknn_spec <- nearest_neighbor(neighbors = 10) |>
  set_mode("regression") |>
  set_engine("kknn")

kknn_workflow <-
  workflow() |>
  add_recipe(glmnet_recipe) |>
  add_model(kknn_spec)

## fit using training set
set.seed(123)
kknn_wf <- fit(
  kknn_workflow,
  data = scorecard_train
)

# save all required objects to a .Rdata file
save(scorecard_train, scorecard_test, rf_wf, glmnet_wf, kknn_wf,
     file = here("slides/data/scorecard-models.RData"))

Evaluating test set performance

Explainers with {DALEX} and {DALEXtra}

Create an explainer object

explainer_glmnet <- explain_tidymodels(
  model = glmnet_wf,
  data = scorecard_train |> select(-debt),
  y = scorecard_train$debt,
  label = "penalized regression",
  verbose = FALSE
)

explainer_rf <- explain_tidymodels(
  model = rf_wf,
  data = scorecard_train |> select(-debt),
  y = scorecard_train$debt,
  label = "random forest",
  verbose = FALSE
)

explainer_kknn <- explain_tidymodels(
  model = kknn_wf,
  data = scorecard_train |> select(-debt),
  y = scorecard_train$debt,
  label = "k nearest neighbors",
  verbose = FALSE
)

Local explanations

Provide information about a prediction for a single observation.

Cornell University

Rows: 1
Columns: 12
$ state     <chr> "NY"
$ type      <fct> "Private, nonprofit"
$ admrate   <dbl> 0.0869
$ satavg    <dbl> 1510
$ cost      <dbl> 77047
$ netcost   <dbl> 29011
$ avgfacsal <dbl> 141849
$ pctpell   <dbl> 0.1737
$ comprate  <dbl> 0.9414
$ firstgen  <dbl> 0.154164
$ debt      <dbl> 13000
$ locale    <fct> City

Ithaca College

Rows: 1
Columns: 12
$ state     <chr> "NY"
$ type      <fct> "Private, nonprofit"
$ admrate   <dbl> 0.7773
$ satavg    <dbl> NA
$ cost      <dbl> 65274
$ netcost   <dbl> 33748
$ avgfacsal <dbl> 81369
$ pctpell   <dbl> 0.2029
$ comprate  <dbl> 0.7717
$ firstgen  <dbl> 0.1375752
$ debt      <dbl> 19500
$ locale    <fct> Suburb

Breakdown methods

How contributions attributed to individual features change the mean model’s prediction for a particular observation
Sequentially fix the value of individual features and examine the change in the prediction

Breakdown of Cornell University using the random forest model

Break it down again

Breakdown of random forest

Breakdown plots

Advantages

Easy to understand
Compact visualization
Intuitive explanation for linear models

Disadvantages

Ignores interactive contributions (assumes everything is additive)
Ordering of the explanatory variables influences the breakdown and resulting explanation
Harder to interpret for models with lost of predictors

Shapley Additive Explanations (SHAP)

Average contributions of features are computed under different coalitions of feature orderings
Randomly permute feature order using $B$ combinations
Average across individual breakdowns to calculate feature contribution to individual prediction

SHAP for Cornell

Cornell University vs. Ithaca College

Cornell University vs. Ithaca College

Shapley values

Advantages

Model-agnostic
Strong formal foundation from game theory
Considers all (or many) possible feature orderings

Disadvantages

Ignores interactive contributions (assumes everything is additive)
Larger number of predictors makes it impossible to consider all possible coalitions
Computationally expensive

⏱️ Your turn

Review the DALEX code to estimate Shapley values for Cornell University
Estimate Shapley values for Ithaca College and interpret
Estimate Shapley values for universities of your choosing

10:00

Global explanations

Understand which features are most important in driving the predictions of the models overall, aggregated across the training set.

Feature importance

Relative importance of each feature in a dataset for predicting the outcome
How each feature contributes to the model’s predictions
Model-specific techniques
- Lasso regression
- Random forests
Model-agnostic approach

Permutation-based feature importance

Calculate the increase in the model’s prediction error after permuting the feature
- Randomly shuffle the feature’s values across observations
Important feature
Unimportant feature

For any given loss function do
1: compute loss function for original model
2: for variable i in {1,...,p} do
     | randomize values
     | apply given ML model
     | estimate loss function
     | compute feature importance (permuted loss / original loss)
   end
3. Sort variables by descending feature importance

Random forest feature importance

Number of observations permuted

Measuring changes

Compare all models

Permutation-based feature importance

Advantages

Clear interpretation
Succinct measure
Does not require retraining the model
Takes into account all interactions

Disadvantages

Permutation adds randomness to results - results may vary greatly
Computationally expensive
Linked to the error of the model
Need access to the true outcome
Takes into account all interactions

⏱️ Your turn

Review the DALEX code to estimate permutation-based feature importance for the debt prediction models
Estimate permutation-based feature importance for the debt prediction models
Interpret the resulting statistics - which features are most/least important? How does the model choice influence the results?

10:00

Building global explanations from local explanations

Individual conditional expectation (ICE)

Ceteris peribus - “other things held constant”
Marginal effect a feature has on the predictor
Counterfactual comparison - what if this observation had $Y$ value instead of $X$?
Plot one observation that shows how the observation’s prediction changes when a feature changes

For a selected predictor (x)
1. Construct a grid of j evenly spaced values across the distribution 
   of x: {x1, x2, ..., xj}
2. For i in {1,...,j} do
     | Copy the training data and replace the original values of x 
       with the constant xi
     | Apply given ML model (i.e., obtain vector of predictions)
   End
3. Plot the predictions against x1, x2, ..., xj with lines connecting 
   oberservations that correspond to the same row number in the original 
   training data

Partial dependence plot (PDP)

Average multiple ICEs to estimate the marginal effect of a feature on the outcome of interest

For a selected predictor (x)
1. Construct a grid of j evenly spaced values across the distribution
   of x: {x1, x2, ..., xj}
2. For i in {1,...,j} do
     | Copy the training data and replace the original values of x 
       with the constant xi
     | Apply given ML model (i.e., obtain vector of predictions)
     | Average predictions together
   End
3. Plot the averaged predictions against x1, x2, ..., xj

Net cost (PDP)

Net cost (PDP + ICE)

Net cost (PDP + ICE) – all models

Type (PDP)

State (PDP)

State (PDP + ICE)

Partial dependence plot (PDP)

Advantages

Intuitive
Clear interpretation (assuming variables are uncorrelated)
Straightforward implementation

Disadvantages

Limited to one or two variables
Assumes independence of features
Heterogeneous effects might be hidden - add ICE curves to visualize heterogeneity

⏱️ Your turn

Review the DALEX code to estimate permutation-based feature importance for the debt prediction models
Interpret how the average faculty salary influences the predictions of the debt models

10:00

Wrap-up

Recap

Explainability is crucial for understanding and trusting machine learning models
Breakdown profiles and Shapley values provide local explanations for individual predictions
Permutation-based feature importance provides insight into the importance of features in a model
Partial dependence plots provide a global view of the relationship between a feature and the model’s predictions

Explaining models through agnostic approaches

Announcements

Announcements

Learning objectives

Explainability

Explanation

What is a good explanation?

Types of model explanations

White-box model

Black-box model

Implementations in R

Predicting student debt

Application exercise

ae-17

Predicting student debt

Predicting student debt

Fitted models

Evaluating test set performance

Explainers with {DALEX} and {DALEXtra}

Create an explainer object

Local explanations

Local explanations

Cornell University

Ithaca College

Breakdown methods

Breakdown of Cornell University using the random forest model

Break it down again

Breakdown of random forest

Breakdown plots

Shapley Additive Explanations (SHAP)

Shapley Additive Explanations (SHAP)

SHAP for Cornell

Cornell University vs. Ithaca College

Cornell University vs. Ithaca College

Shapley values

⏱️ Your turn

Global explanations

Global explanations

Feature importance

Feature importance

Permutation-based feature importance

Random forest feature importance

Number of observations permuted

Measuring changes

Compare all models

Permutation-based feature importance

⏱️ Your turn

Building global explanations from local explanations

Individual conditional expectation (ICE)

Partial dependence plot (PDP)

Net cost (PDP)

Net cost (PDP + ICE)

Net cost (PDP + ICE) – all models

Type (PDP)

State (PDP)

State (PDP + ICE)

Partial dependence plot (PDP)

⏱️ Your turn

Wrap-up

Recap

Halloween 2024

`ae-17`