Scholar - SciOpen

| Sign up

Follow this author

Lu Yuan

Downloads: 0 Citations: 0 Articles: 1

Publication Fields

Computer Science

Publications

Article type

Year

Co-author

Sort：

Published

Cited

Download

Open Access Research Article Issue

Multi3D: 3D-aware multimodal image synthesis

Wenyang Zhou, Lu Yuan, Taijiang Mu

Computational Visual Media 2024, 10(6): 1205-1217

Published: 03 April 2024

Abstract

PDF (6.4 MB) Collect Collected

Downloads：7

3D-aware image synthesis has attained high quality and robust 3D consistency. Existing 3D controllable generative models are designed to synthesize 3D-aware images through a single modality, such as 2D segmentation or sketches, but lack the ability to finely control generated content, such as texture and age. In pursuit of enhancing user-guided controllability, we propose Multi3D, a 3D-aware controllable image synthesis model that supports multi-modal input. Our model can govern the geometry of the generated image using a 2D label map, such as a segmentation or sketch map, while concurrently regulating the appearance of the generated image through a textual description. To demonstrate the effectiveness of our method, we have conducted experiments on multiple datasets, including CelebAMask-HQ, AFHQ-cat, and shapenet-car. Qualitative and quantitative evaluations show that our method outperforms existing state-of-the-art methods.

Total 1

<1/11>GOpage