A Formal Approach to Explainability

Wolf, Lior; Galanti, Tomer; Hazan, Tamir

Computer Science > Machine Learning

arXiv:2001.05207 (cs)

[Submitted on 15 Jan 2020]

Title:A Formal Approach to Explainability

Authors:Lior Wolf, Tomer Galanti, Tamir Hazan

View PDF

Abstract:We regard explanations as a blending of the input sample and the model's output and offer a few definitions that capture various desired properties of the function that generates these explanations. We study the links between these properties and between explanation-generating functions and intermediate representations of learned models and are able to show, for example, that if the activations of a given layer are consistent with an explanation, then so do all other subsequent layers. In addition, we study the intersection and union of explanations as a way to construct new explanations.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2001.05207 [cs.LG]
	(or arXiv:2001.05207v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2001.05207
Journal reference:	Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society, January 2019, Pages 255-261

Submission history

From: Tomer Galanti [view email]
[v1] Wed, 15 Jan 2020 10:06:47 UTC (892 KB)

Full-text links:

Access Paper:

view license

Current browse context:

< prev | next >

new | recent | 2020-01

Change to browse by:

cs.LG
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Lior Wolf
Tomer Galanti
Tamir Hazan

export BibTeX citation

Computer Science > Machine Learning

Title:A Formal Approach to Explainability

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Formal Approach to Explainability

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators