Partial Dependence Plots: “Let me show you what the model predicts on average when each data instance has the value v for that feature. I ignore whether the value v makes sense for all data instances.”
M-Plots: “Let me show you what the model predicts on average for data instances that have values close to v for that feature. The effect could be due to that feature, but also due to correlated features.”
ALE plots: “Let me show you how the model predictions change in a small”window" of the feature around v for data instances in that window."
Advantages
可以處理有關聯性的變數
計算速度較PDP快
解讀較易懂,由於ALE plots集中在0,條件機率下因子的變化較為明顯
Disadvantages
ALE plots can become a bit shaky (many small ups and downs) with a high number of intervals.
無法看到每一個因子異質性的影響效果
Second-order ALE 在不同feature 隔窗的穩定性差異大,因為每一的隔窗的資料是不同的
Second-order effect plots can be a bit annoying to interpret
implementation of ALE plots is much more complex也較PDP不直覺