Learning how to explain neural networks: PatternNet and PatternAttribution

Pieter-jan Kindermans

Kristof T. Schütt

Maximilian Alber

Klaus-Robet Müller

Dumitru Erhan

Been Kim

Sven Dähne

ICLR (2018)

Download Google Scholar

Abstract

DeConvNet, Guided BackProp, LRP, were invented to better understand deep neural networks. We show that these methods do not produce the theoretically correct explanation for a linear model. Yet they are used on multi-layer networks with millions of parameters. This is a cause for concern since linear models are simple neural networks. We argue that explanation methods for neural nets should work reliably in the limit of simplicity, the linear models. Based on our analysis of linear models we propose a generalization that yields two explanation techniques (PatternNet and PatternAttribution) that are theoretically sound for linear models and produce improved explanations for deep networks.

Research Areas

Machine Translation
Responsible AI

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Learning how to explain neural networks: PatternNet and PatternAttribution

Abstract

Research Areas

Learn more about how we conduct our research

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Learning how to explain neural networks: PatternNet and PatternAttribution

Abstract

Research Areas

Learn more about how we conduct our research

AI/ML Foundations  & Capabilities