Embrace the Gap: VAEs Perform Independent Mechanism Analysis

Reizinger, Patrik; Gresele, Luigi; Brady, Jack; von K�gelgen, Julius; Zietlow, Dominik; Sch�lkopf, Bernhard; Martius, Georg; Brendel, Wieland; Besserve, Michel

Statistics > Machine Learning

arXiv:2206.02416 (stat)

[Submitted on 6 Jun 2022 (v1), last revised 27 Jan 2023 (this version, v3)]

Title:Embrace the Gap: VAEs Perform Independent Mechanism Analysis

Authors:Patrik Reizinger, Luigi Gresele, Jack Brady, Julius von K�gelgen, Dominik Zietlow, Bernhard Sch�lkopf, Georg Martius, Wieland Brendel, Michel Besserve

View PDF

Abstract:Variational autoencoders (VAEs) are a popular framework for modeling complex data distributions; they can be efficiently trained via variational inference by maximizing the evidence lower bound (ELBO), at the expense of a gap to the exact (log-)marginal likelihood. While VAEs are commonly used for representation learning, it is unclear why ELBO maximization would yield useful representations, since unregularized maximum likelihood estimation cannot invert the data-generating process. Yet, VAEs often succeed at this task. We seek to elucidate this apparent paradox by studying nonlinear VAEs in the limit of near-deterministic decoders. We first prove that, in this regime, the optimal encoder approximately inverts the decoder -- a commonly used but unproven conjecture -- which we refer to as {\em self-consistency}. Leveraging self-consistency, we show that the ELBO converges to a regularized log-likelihood. This allows VAEs to perform what has recently been termed independent mechanism analysis (IMA): it adds an inductive bias towards decoders with column-orthogonal Jacobians, which helps recovering the true latent factors. The gap between ELBO and log-likelihood is therefore welcome, since it bears unanticipated benefits for nonlinear representation learning. In experiments on synthetic and image data, we show that VAEs uncover the true latent factors when the data generating process satisfies the IMA assumption.

Comments:	NeurIPS2022 final version
Subjects:	Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2206.02416 [stat.ML]
	(or arXiv:2206.02416v3 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2206.02416

Submission history

From: Patrik Reizinger [view email]
[v1] Mon, 6 Jun 2022 08:19:19 UTC (790 KB)
[v2] Thu, 27 Oct 2022 17:18:43 UTC (853 KB)
[v3] Fri, 27 Jan 2023 16:56:17 UTC (853 KB)

Statistics > Machine Learning

Title:Embrace the Gap: VAEs Perform Independent Mechanism Analysis

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Embrace the Gap: VAEs Perform Independent Mechanism Analysis

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators