Computation of variable importance from a bag of models. Variable importance can
be computed either by dropping variables or through permutation of the variables values
(parameter type
), typically by evaluating the effects on the model evaluation metric
in the validation set(s). If type = "drop"
, each variable is dropped from the model
at a time and the variation in model evaluation metric is computed. If type = "permutation"
,
The observations of each variable are permutated and the variation in model evaluation
metric is computed.
Arguments
- x
[list]
Bag of models, result ofbag_models()
. It contains multiple information from the models, such as formula, weights, coefficients, and metric used to evaluate the models.- data
[data.frame]
Complete data set to which the models were applied.- samples
[list]
List of samples used to fit the models in the bag. The list contains at least three elements: train, test, and validate. Each elements might have several elements, each representing the lines ofdata
to be sampled for each resample. Typically, this is computed by the functioncreate_resamples()
.- type
[character(1)="drop"]{"drop", "permutation"}
Type of computation for variable importance. Iftype = "drop"
(default), each variable is dropped from the model at a time and the variation in model evaluation metric is computed. Iftype = "permutation"
, the observations of each variable are permutated and the variation in the model evaluation metric is computed.- colH0
[string(1)=NULL]
String with the name of the column indata
representing the blockH0, in case we want the variable importance to be evaluated for each block. Default isNULL
, in case variable importance is assessed for all the data.- n_permutations
[numeric(1)=100]
Number of permutations, iftype = "permutation"
.- order
[character,logical(1)="desc"]{"desc", "asc", FALSE}
Whether or not to order the output variables according to descending (order = "desc"
) or ascending order of variable importance (order = "asc"
). IfFALSE
, the variables are shown in the same order as present in the bag of models,x
.- plot
[logical(1)=FALSE]
Should variable importance be plotted? Default isFALSE
.- remove_threshold
[numeric(1)]
Threshold for excluding variable with little importance in the variable importance plot (i.e. only considered ifplot = TRUE
). See more inplot_importance()
.
See also
For plotting variable importance, see plot_importance()
.