Skip to main navigation Skip to search Skip to main content

A Geometric Understanding of Deep Learning

  • Na Lei
  • , Dongsheng An
  • , Yang Guo
  • , Kehua Su
  • , Shixia Liu
  • , Zhongxuan Luo
  • , Shing Tung Yau
  • , Xianfeng Gu
  • Dalian University of Technology
  • Stony Brook University
  • Wuhan University
  • Tsinghua University
  • Harvard University

Research output: Contribution to journalArticlepeer-review

117 Scopus citations

Abstract

This work introduces an optimal transportation (OT) view of generative adversarial networks (GANs). Natural datasets have intrinsic patterns, which can be summarized as the manifold distribution principle: the distribution of a class of data is close to a low-dimensional manifold. GANs mainly accomplish two tasks: manifold learning and probability distribution transformation. The latter can be carried out using the classical OT method. From the OT perspective, the generator computes the OT map, while the discriminator computes the Wasserstein distance between the generated data distribution and the real data distribution; both can be reduced to a convex geometric optimization process. Furthermore, OT theory discovers the intrinsic collaborative—instead of competitive—relation between the generator and the discriminator, and the fundamental reason for mode collapse. We also propose a novel generative model, which uses an autoencoder (AE) for manifold learning and OT map for probability distribution transformation. This AE–OT model improves the theoretical rigor and transparency, as well as the computational stability and efficiency; in particular, it eliminates the mode collapse. The experimental results validate our hypothesis, and demonstrate the advantages of our proposed model.

Original languageEnglish
Pages (from-to)361-374
Number of pages14
JournalEngineering
Volume6
Issue number3
DOIs
StatePublished - Mar 2020

Keywords

  • Adversarial
  • Deep learning
  • Generative
  • Mode collapse
  • Optimal transportation

Fingerprint

Dive into the research topics of 'A Geometric Understanding of Deep Learning'. Together they form a unique fingerprint.

Cite this