LatentKeypointGAN: Controlling Images via Latent Keypoints -- Extended Abstract

He, Xingzhe; Wandt, Bastian; Rhodin, Helge

Computer Science > Computer Vision and Pattern Recognition

arXiv:2205.03448 (cs)

[Submitted on 6 May 2022 (v1), last revised 17 May 2022 (this version, v2)]

Title:LatentKeypointGAN: Controlling Images via Latent Keypoints -- Extended Abstract

Authors:Xingzhe He, Bastian Wandt, Helge Rhodin

View PDF

Abstract:Generative adversarial networks (GANs) can now generate photo-realistic images. However, how to best control the image content remains an open challenge. We introduce LatentKeypointGAN, a two-stage GAN internally conditioned on a set of keypoints and associated appearance embeddings providing control of the position and style of the generated objects and their respective parts. A major difficulty that we address is disentangling the image into spatial and appearance factors with little domain knowledge and supervision signals. We demonstrate in a user study and quantitative experiments that LatentKeypointGAN provides an interpretable latent space that can be used to re-arrange the generated images by re-positioning and exchanging keypoint embeddings, such as generating portraits by combining the eyes, and mouth from different images. Notably, our method does not require labels as it is self-supervised and thereby applies to diverse application domains, such as editing portraits, indoor rooms, and full-body human poses.

Comments:	arXiv admin note: substantial text overlap with arXiv:2103.15812
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2205.03448 [cs.CV]
	(or arXiv:2205.03448v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2205.03448
Journal reference:	CVPR Workshop 2022

Submission history

From: Xingzhe He [view email]
[v1] Fri, 6 May 2022 19:00:07 UTC (32,535 KB)
[v2] Tue, 17 May 2022 18:53:20 UTC (32,536 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:LatentKeypointGAN: Controlling Images via Latent Keypoints -- Extended Abstract

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:LatentKeypointGAN: Controlling Images via Latent Keypoints -- Extended Abstract

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators