Distill knowledge of additive tree models into generalized linear models

Abstract

Generalized additive models (GAMs) are a leading model class for interpretable machine learning. GAMs were originally trained using smoothing splines. Recently, tree-based GAMs where shape functions are gradient-boosted ensembles of bagged trees were proposed (e.g. Explainable Boosting Machine). In this paper, we introduce a competing three-step GAM learning approach where we combine i) the knowledge of the way to split the covariates space brought by an Additive tree model (ATM), ii) an ensemble of predictive linear scores derived from Generalized linear models (GLMs) using a binning strategy based on the ATM, iii) a final GLM to have a prediction model that ensures auto-calibration. Numerical experiments illustrate the very good performances of our approach on several datasets compared to GAM with splines, EBM or GLM with binarsity penalization. A case-study in trade credit insurance is also provided.

Keywords: Additive tree ensembles, Auto-calibration, Generalized additive models, Generalized linear models, Partitioning methods, XAI.

Sector: Insurance
Expertise: Machine learning
Authors: Arthur Maillart,
Christian Y. Robert

Publisher: Detralytics
Date: September 2023
Language: English
Pages: 22
Reference : Detra Note 2023-6

About the authors

Rond_Arthur

Arthur Maillart

Christian Y. Robert

Share This Post

More To Explore

Do You Want To Boost Your Business?

drop us a Message and keep in touch