We are pleased to share with you our FAQctuary 2020-2 "Features with flat partial dependence plots: not important?"
This paper focuses on partial dependence plots which are often used when modeling with machine learning techniques in order to better understand the effects of the features on the conditional expectation of the response variable. However, these plots must be interpreted with caution. Indeed, they can easily lead to wrong interpretations in case the analyst is not enough familiar with these plots. A typical situation is the case where a feature is important because of its interactions with others while its partial dependence plot is flat. In such a case, an analyst who would only base his analysis on this plot could be tempted to conclude that the feature is not important to explain the conditional expectation of the response while he would be wrong. In this FAQctuary, we aim to illustrate such a situation with the help of a simulated example that is very simple.
To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data. Not consenting or withdrawing consent, may adversely affect certain features and functions.
The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
The technical storage or access that is used exclusively for statistical purposes.The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.