Layer-wise relevance propagation

The notations are slightly different from the original papers; I have added some more index notations to clarify the meanings of each terms.

Overview

Layer-wise Relevance Propagation (LRP) finds relevance scores for individual features in the input data by decomposing the output predictions of the neural network.

xai-lrp-example

The propagation rule strictly obeys the conservation property: what has been received by a neuron must be redistributed to the lower layer in equal amount. In other words,

\sum R_{in} = \sum R_{out}

for any neuron.

LRP rules

xai-lrp-general-process

Relevance score propagation at a given layer $l + 1$ onto the lower layer $l$ follows:

i \in I^{(l)} : R_{i}^{(l)} = j \in I^{(l + 1)} \sum \frac{z _{i, j}}{\sum _{k \in I^{(l)}} z _{k, j}} R_{j}^{(l + 1)}

where $I^{(l)}$ is the index set of all neurons in the $l$ -th layer.

Consequently,

f (x) = \dots = i \in I^{(l)} \sum R_{i}^{(l)} = \dots = i \in I^{(1)} \sum R_{i}^{(1)}

where $f (x)$ is the prediction of the neural network for an input $x$ .

The quantity $z_{i, j}$ indicates how much neuron $i$ has contributed to make neuron $j$ relevant. There are several possible variants of LRP according to the choice of $z_{i, j}$ .

Basic rule (LRP-0)

LRP-0 redistributes relevance score proportionally to the contributions of each input to the neuron activation.

Let $a_{i}$ be the activation of neuron $i$ , and $w_{i, j}$ be the weight parameter applied to neuron $i$ to neuron $j$ of subsequent layer. Then, LRP-0 is defined as

z_{i, j} = a_{i} w_{i, j}

or in other words,

i \in I^{(l)} : R_{i}^{(l)} = j \in I^{(l + 1)} \sum \frac{a _{i} w _{i, j}}{\sum _{k \in I^{(l)}} a _{k} w _{k, j}} R_{j}^{(l + 1)}

Note that the sum runs over all neurons of the lower layer, including that of the bias term.

Although this rule looks intuitive, a uniform application of LRP-0 to the entire network is empirically shown to be equivalent to gradient $\times$ input, where the gradients are usually noisy.

Epsilon rule (LRP- $ε$ )

LRP- $ε$ adds a small positive term $ε$ in the denominator.

i \in I^{(l)} : R_{i}^{(l)} = j \in I^{(l + 1)} \sum \frac{a _{i} w _{i, j}}{ε + \sum _{k \in I^{(l)}} a _{k} w _{k, j}} R_{j}^{(l + 1)}

The term $ε$ absorbs some relevance when the contributions to the activation of neuron $k$ are weak or contradictory (i.e., diminishing the relevance score).

As the term $ε$ becomes bigger, only the most salient explanation factors survive the absorption, resulting in a less noisy explanation.

Gamma rule (LRP- $γ$ )

LRP- $γ$ favours the effect of positive contributions over negative contributions.

i \in I^{(l)} : R_i^{(l)} = \sum_j \in I^{(l + 1)} \frac{a _ i ( w _ i , j + γ w _ i , j ^{+} )}{\sum _ k \in I ^{(l)} a _ k ( w _ k , j + γ w _ k , j ^{+} )} R_j^{(l + 1)} = \sum_j \in I^{(l + 1)} \frac{a _ i ( w _ i , j ^{-} + ( 1 + γ ) w _ i , j ^{+} )}{\sum _ k \in I ^{(l)} a _ k ( w _ k , j ^{-} + ( 1 + γ ) w _ k , j ^{+} )} R_j^{(l + 1)}

Here, $w_{i, j}^{-}$ and $w_{i, j}^{+}$ denotes the negative and positive parts of $w_{i, j}$ , respectively.

The term $γ$ quantifies how much the positive contributions are favoured.

The prevalence of positive contributions has a limiting effect on how large positive and negative relevance can grow in the propagation phase, resulting in a more stable explanation.

Deep Taylor decomposition

The previous rules can be interpreted with Deep Taylor Decomposition (DTD), which views LRP as a succession of Taylor expansions performed locally at each neuron.

Let $a^{(l)} = (a_{1}^{(l)}, \dots, a_{∣ I^{(l)} ∣}^{(l)})$ denote the vector of lower-level (layer $l$ ) activations, and let $\tilde{a}^{(l)}$ be some reference point near $a^{(l)}$ . Then,

R_j^{(l + 1)} (a^{(l)}) = R_j^{(l + 1)} (\tilde{a}^{(l)}) + \sum_i \in I^{(l)} \frac{\partial R _ j ^{(l + 1)}}{\partial a _ i ^{(l)}}_a^{(l)} = \tilde{a}^{(l)} (a_i^{(l)} - \tilde{a}_i^{(l)}) + ε = R_j^{(l + 1)} (\tilde{a}^{(l)}) + \sum_i \in I^{(l)} ([\nabla R_j^{(l + 1)} (\tilde{a}^{(l)})]_i (a_i^{(l)} - \tilde{a}_i^{(l)})) + ε

where $ε$ denotes the error term, which consists of the second and higher order terms of the Taylor expansion that are difficult to compute.

Relevance model

By substituting the relevance function $R_{j}^{(l + 1)}$ , a closed-form expression can be derived. One popular choice is the modulated ReLU activation:

\hat{R}_j^{(l + 1)} (a^{(l)}) = c_j \cdot max (0, \sum_i \in I^{(l)} a_i^{(l)} w_i, j)

where $c_{j}$ is the modulation term set to satisfy $R_{j}^{(l + 1)} (a^{(l)}) = \hat{R}_{j}^{(l + 1)} (a^{(l)})$ .

Then, the Taylor expansion of $\hat{R}_{j}^{(l + 1)}$ can be given as

\hat{R}_j^{(l + 1)} (a^{(l)}) = \hat{R}_j^{(l + 1)} (\tilde{a}^{(l)}) + \sum_i \in I^{(l)} ([\nabla \hat{R}_j^{(l + 1)} (\tilde{a}^{(l)})]_i (a_{i}^{(l)} - \tilde{a}_i^{(l)})) + ε = \hat{R}_j^{(l + 1)} (\tilde{a}^{(l)}) + \sum_i \in I^{(l)} (c_j \cdot w_i, j (a_i^{(l)} - \tilde{a}_i^{(l)}))

Here, due to the linearity of $\hat{R}$ , $ε = 0$ (Recall that $ε$ consists of the second and higher order terms of the Taylor expansion).

Once the reference point $\tilde{a}$ is chosen, relevance scores can be easily computed because they only consist of first-order terms.

xai-lrp-relevance-model

References

Bach, S., Binder, A., Montavon, G., Klauschen, F., Müller, K. R., & Samek, W. (2015). On pixel-wise explanations for non-linear classifier decisions by layer-wise relevance propagation. PloS one, 10(7), e0130140.
Montavon, G., Binder, A., Lapuschkin, S., Samek, W., & Müller, K. R. (2019). Layer-wise relevance propagation: an overview. Explainable AI: interpreting, explaining and visualizing deep learning, 193-209.
Kim, S. W., Kang, S. H., Kim, S. J., & Lee, S. (2021). Estimating the phase volume fraction of multi-phase steel via unsupervised deep learning. Scientific Reports, 11(1), 5902.
Montavon, G., Lapuschkin, S., Binder, A., Samek, W., & Müller, K. R. (2017). Explaining nonlinear classification decisions with deep taylor decomposition. Pattern recognition, 65, 211-222.

Layer-wise relevance propagation

Table of Contents

Overview

LRP rules

Basic rule (LRP-0)

Epsilon rule (LRP- $ε$ )

Gamma rule (LRP- $γ$ )

Deep Taylor decomposition

Relevance model

References

Graph View

Backlinks

Table of Contents

Graph View

Backlinks

Layer-wise relevance propagation

Table of Contents

Overview

LRP rules

Basic rule (LRP-0)

Epsilon rule (LRP-ε)

Gamma rule (LRP-γ)

Deep Taylor decomposition

Relevance model

References

Graph View

Backlinks

Table of Contents

Graph View

Backlinks

Epsilon rule (LRP- $ε$ )

Gamma rule (LRP- $γ$ )