389 KB, PDF document
Final published version
Research output: Working paper › Preprint
Research output: Working paper › Preprint
}
TY - UNPB
T1 - An Empirical Evaluation of the Rashomon Effect in Explainable Machine Learning
AU - Müller, Sebastian
AU - Toborek, Vanessa
AU - Beckh, Katharina
AU - Jakobs, Matthias
AU - Bauckhage, Christian
AU - Welke, Pascal
PY - 2023/6/27
Y1 - 2023/6/27
N2 - The Rashomon Effect describes the following phenomenon: for a given dataset there may exist many models with equally good performance but with different solution strategies. The Rashomon Effect has implications for Explainable Machine Learning, especially for the comparability of explanations. We provide a unified view on three different comparison scenarios and conduct a quantitative evaluation across different datasets, models, attribution methods, and metrics. We find that hyperparameter-tuning plays a role and that metric selection matters. Our results provide empirical support for previously anecdotal evidence and exhibit challenges for both scientists and practitioners.
AB - The Rashomon Effect describes the following phenomenon: for a given dataset there may exist many models with equally good performance but with different solution strategies. The Rashomon Effect has implications for Explainable Machine Learning, especially for the comparability of explanations. We provide a unified view on three different comparison scenarios and conduct a quantitative evaluation across different datasets, models, attribution methods, and metrics. We find that hyperparameter-tuning plays a role and that metric selection matters. Our results provide empirical support for previously anecdotal evidence and exhibit challenges for both scientists and practitioners.
KW - cs.LG
KW - cs.AI
M3 - Preprint
BT - An Empirical Evaluation of the Rashomon Effect in Explainable Machine Learning
PB - Arxiv
ER -