Explaining Deep Learning Models Through Rule-Based Approximation and Visualization

Home > Research > Publications & Outputs > Explaining Deep Learning Models Through Rule-Ba...

Computing and Communications

Associated organisational units

Electronic data

Explaining Deep Learning Models Through Rule-Based Approximation and Visualization
Rights statement: ©2020 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
Accepted author manuscript, 834 KB, PDF document
Available under license: CC BY-NC: Creative Commons Attribution-NonCommercial 4.0 International License

Text available via DOI:

https://doi.org/10.1109/TFUZZ.2020.2999776
Final published version

Keywords

Deep Reinforcement Learning, explainable AI, rule-based models, prototype- and density-based models, density-based input selection, autonomous driving

View graph of relations

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Published

Eduardo Almeida Soares
Plamen Angelov
Bruno Costa
Marcos Castro
Subramanya Nageshrao
Dimitar Filev

More...

<mark>Journal publication date</mark>	31/08/2021
<mark>Journal</mark>	IEEE Transactions on Fuzzy Systems
Issue number	8
Volume	29
Number of pages	9
Pages (from-to)	2399-2407
Publication Status	Published
Early online date	3/06/20
<mark>Original language</mark>	English

Abstract

This paper describes a novel approach to the problem of developing explainable machine learning models. We consider a Deep Reinforcement Learning (DRL) model representing a highway path planning policy for autonomous highway driving. The model constitutes a mapping from the continuous multidimensional state space characterizing vehicle positions and velocities to a discrete set of actions in longitudinal and lateral direction. It is obtained by applying a customized version of the Double Deep Q-Network (DDQN) learning algorithm. The main idea is to approximate the DRL model with a set of IF…THEN rules that provide an alternative interpretable model, which is further enhanced by visualizing the rules. This concept is rationalized by the universal approximation properties of the rule-based models with fuzzy predicates. The proposed approach includes a learning engine composed of 0-order fuzzy rules, which generalize locally around the prototypes by using multivariate function models. The adjacent (in the data space) prototypes, which correspond to the same action are further grouped and merged into so-called "MegaClouds" reducing significantly the number of fuzzy rules. The input selection method is based on ranking the density of the individual inputs. Experimental results show that the specific DRL agent can be interpreted by approximating with families of rules of different granularity. The method is computationally efficient and can be potentially extended to addressing the explainability of the broader set of fully connected deep neural network models

Bibliographic note

©2020 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Research

Associated organisational units

Electronic data

Links

Text available via DOI:

Keywords