Contrastive Explanations #14

TobiasGoerke · 2019-07-26T12:45:25Z

Anchors is able to explain any model's decision (e.g. for a label it predicted). However, the explained label does not necessarily have to equal the value the model did actually predict but can be freely chosen.

So, we can force the model to explain a decision it has not made. This would reveil its motivation to classify an instance differently - even though it didn't.

I'd like to start a discussion about how this information could be used.

Surely, visualization is one use-case. Showing some sort of matrix for an explanations that displays which features voted for and which voted against the decision would be possible and helpful.
Any more ideas?

fkoehne · 2019-07-26T13:39:24Z

A new aspect on the upcoming master thesis? Basically I would want both approachs with an integrated conclusion.

TobiasGoerke added enhancement New feature or request help wanted Extra attention is needed labels Jul 26, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Contrastive Explanations #14

Contrastive Explanations #14

TobiasGoerke commented Jul 26, 2019

fkoehne commented Jul 26, 2019

Contrastive Explanations #14

Contrastive Explanations #14

Comments

TobiasGoerke commented Jul 26, 2019

fkoehne commented Jul 26, 2019