liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
LatentKeypointGAN: Controlling Images via Latent Keypoints
University of British Columbia,Computer Science Department,Vancouver,Canada.
University of British Columbia,Computer Science Department,Vancouver,Canada.
University of British Columbia,Computer Science Department,Vancouver,Canada.
2023 (English)In: 2023 20th Conference on Robots and Vision (CRV), Institute of Electrical and Electronics Engineers (IEEE), 2023Conference paper, Published paper (Refereed)
Abstract [en]

Generative adversarial networks (GANs) can now generate photorealistic images. However, how to best control the image content remains an open challenge. We introduce LatentKeypointGAN, a two-stage GAN internally conditioned on a set of keypoints and associated appearance embeddings providing control of the position and style of the generated objects and their respective parts. A major difficulty that we address is disentangling the image into spatial and appearance factors with little domain knowledge and supervision signals. We demonstrate in a user study and quantitative experiments that LatentKeypointGAN provides an interpretable latent space that can be used to re-arrange the generated images by re-positioning, adding, removing, and exchanging keypoint embeddings, such as generating portraits by combining the eyes, and mouth from different images. Notably, our method does not require labels as it is self-supervised and thereby applies to diverse application domains, such as editing portraits, indoor rooms, and full-body human poses. In addition, the explicit generation of keypoints and matching images enables a new, GAN-based method for unsupervised keypoint detection.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2023.
Series
Canadian Conference on Computer and Robot Vision, CRV
National Category
Computer graphics and computer vision
Identifiers
URN: urn:nbn:se:liu:diva-209827DOI: 10.1109/crv60082.2023.00009ISBN: 9798350341393 (electronic)ISBN: 9798350341409 (print)OAI: oai:DiVA.org:liu-209827DiVA, id: diva2:1913282
Conference
2023 20th Conference on Robots and Vision (CRV), Montreal, QC, Canada, 06-08 June 2023
Available from: 2024-11-14 Created: 2024-11-14 Last updated: 2025-02-07

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full text

Authority records

Wandt, Bastian

Search in DiVA

By author/editor
Wandt, Bastian
Computer graphics and computer vision

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 58 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf