Disentangling geometry and appearance with regularised geometry-aware generative adversarial networks
File(s)Tran2019_Article_DisentanglingGeometryAndAppear.pdf (7.96 MB)
Published version
Author(s)
Tran, L
Kossaifi, J
Panagakis, Y
Pantic, M
Type
Journal Article
Abstract
Deep generative models have significantly advanced image generation, enabling generation of visually pleasing images with realistic texture. Apart from the texture, it is the shape geometry of objects that strongly dictates their appearance. However, currently available generative models do not incorporate geometric information into the image generation process. This often yields visual objects of degenerated quality. In this work, we propose a regularized Geometry-Aware Generative Adversarial Network (GAGAN) which disentangles appearance and shape in the latent space. This regularized GAGAN enables the generation of images with both realistic texture and shape. Specifically, we condition the generator on a statistical shape prior. The prior is enforced through mapping the generated images onto a canonical coordinate frame using a differentiable geometric transformation. In addition to incorporating geometric information, this constrains the search space and increases the model’s robustness. We show that our approach is versatile, able to generalise across domains (faces, sketches, hands and cats) and sample sizes (from as little as ∼200-30,000 to more than 200, 000). We demonstrate superior performance through extensive quantitative and qualitative experiments in a variety of tasks and settings. Finally, we leverage our model to automatically and accurately detect errors or drifting in facial landmarks detection and tracking in-the-wild.
Date Issued
2019-06-01
Date Acceptance
2019-01-29
Citation
International Journal of Computer Vision, 2019, 127 (6-7), pp.824-844
ISSN
0920-5691
Publisher
Springer Verlag
Start Page
824
End Page
844
Journal / Book Title
International Journal of Computer Vision
Volume
127
Issue
6-7
Copyright Statement
© The Author(s) 2019. This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
Sponsor
Commission of the European Communities
Grant Number
688835
Subjects
Science & Technology
Technology
Computer Science, Artificial Intelligence
Computer Science
Generative adversarial network
Image generation
Active shape model
Disentanglement
Representation learning
Face analysis
Deep learning
Generative models
GAN
MODELS
Artificial Intelligence & Image Processing
0801 Artificial Intelligence and Image Processing
Publication Status
Published
Date Publish Online
2019-03-02