Shape embedding and retrieval in multi-flow deformation

Baiqiang Leng; Jingwei Huang; Guanlin Shen; Bin Wang

doi:10.1007/s41095-022-0315-3

| Sign up

PDF (4.6 MB)

Cite

EndNote(RIS) BibTeX

Collect

Submit Manuscript

Research Article | Open Access

Shape embedding and retrieval in multi-flow deformation

Baiqiang Leng^¹, Jingwei Huang^², Guanlin Shen^¹, Bin Wang^¹()

1School of Software, Tsinghua University, Beijing, China

2Huawei Technologies, Shenzhen, China

Show Author Information

Graphical Abstract

View original image Download original image

Abstract

We propose a unified 3D flow framework for joint learning of shape embedding and deformation for different categories. Our goal is to recover shapes from imperfect point clouds by fitting the best shape template in a shape repository after deformation. Accordingly, we learn a shape embedding for template retrieval and a flow-based network for robust deformation. We note that the deformation flow can be quite different for different shape categories. Therefore, we introduce a novel multi-hub module to learn multiple modes of deformation to incorporate such variation, providing a network which can handle a wide range of objects from different categories. The shape embedding is designed to retrieve the best-fit template as the nearest neighbor in a latent space. We replace the standard fully connected layer with a tiny structure in the embedding that significantly reduces network complexity and further improves deformation quality. Experiments show the superiority of our method to existing state-of-the-art methods via qualitative and quantitative comparisons. Finally, our method provides efficient and flexible deformation that can further be used for novel shape design.

Keywords

deformation shape retrieval embedding reconstruction

References

[1]

Lee,

D. T.

; Schachter,

B. J.

Two algorithms for constructing a Delaunay triangulation. International Journal of Computer & Information Sciences Vol. 9, No. 3, 219–242, 1980.

Crossref Google Scholar

[2]

Kazhdan,

; Hoppe,

Screened Poisson surface reconstruction. ACM Transactions on Graphics Vol. 32, No. 3, Article No. 29, 2013.

Crossref Google Scholar

[3]

Chang,

A. X.

; Funkhouser,

; Guibas,

; Hanrahan,

; Huang,

; Li,

; Savarese,

; Savva,

; Song,

; Su,

; et al. ShapeNet: An information-rich 3D model repository. arXiv preprint arXiv:1512.03012, 2015.

Google Scholar

[4]

Achlioptas,

; Diamanti,

; Mitliagkas,

; Guibas,

Learning representations and generative models for 3D point clouds. In: Proceedings of the 35th International Conference on Machine Learning, 40–49, 2018.

[5]

Dai,

; Qi,

C. R.

; NieBner,

Shape completion using 3D-encoder-predictor CNNs and shape synthesis. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 6545–6554, 2017.

Crossref

[6]

Niu,

C. J.

; Li,

; Xu,

Im2Struct: Recovering 3D shape structure from a single RGB image. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 4521–4529, 2018.

Crossref

[7]

Yang,

M. Y.

; Wen,

Y. X.

; Chen,

W. K.

; Chen,

Y. W.

; Jia,

Deep optimized priors for 3D shape modeling and reconstruction. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 3268–3277, 2021.

Crossref

[8]

Uy,

M. A.

; Huang,

J. W.

; Sung,

; Birdal,

; Guibas,

Deformation-aware 3D model embedding and retrieval. In: Computer Vision – ECCV 2020. Lecture Notes in Computer Science, Vol. 12352. Vedaldi,

; Bischof,

; Brox,

; Frahm,

J. M.

Eds. Springer Cham, 397–413, 2020.

Crossref

[9]

Jiang,

; Huang,

J. W.

; Tagliasacchi,

; Guibas,

ShapeFlow: Learnable deformation flows among 3D shapes. In: Proceedings of the 34th International Conference on Neural Information Processing Systems, Article No. 817, 9745–9757, 2020.

[10]

Guo,

M. H.

; Cai,

J. X.

; Liu,

Z. N.

; Mu,

T. J.

; Martin,

R. R.

; Hu,

S. M.

PCT: Point cloud transformer. Computational Visual Media Vol. 7, No. 2, 187–199, 2021.

Crossref Google Scholar

[11]

Han,

W. K.

; Wu,

; Wen,

C. L.

; Wang,

; Li,

BLNet: Bidirectional learning network for point clouds. Computational Visual Media Vol. 8, No. 4, 585–596, 2022.

Crossref Google Scholar

[12]

Sorkine,

; Alexa,

As-rigid-as-possible surface modeling. In: Proceedings of the 5th Eurographics Symposium on Geometry Processing, 109–116, 2007.

[13]

Chen,

Z. Q.

; Zhang,

Learning implicit fields for generative shape modeling. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 5932–5941, 2019.

Crossref

[14]

Szegedy,

; Liu,

; Jia,

Y. Q.

; Sermanet,

; Reed,

; Anguelov,

; Erhan,

; Vanhoucke,

; Rabinovich,

Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1–9, 2015.

Crossref

[15]

Wang,

Y. F.

; Aigerman,

; Kim,

V. G.

; Chaudhuri,

; Sorkine-Hornung,

Neural cages for detail-preserving 3D deformations. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 72–80, 2020.

[16]

Sahillioglu,

; Yemez,

Coarse-to-fine combinatorial matching for dense isometric shape correspondence. Computer Graphics Forum Vol. 30, No. 5, 1461–1470, 2011.

Crossref Google Scholar

[17]

Gao,

; Yang,

; Wu,

; Yuan,

Y. J.

; Fu,

H. B.

; Lai,

Y. K.

; Zhang,

SDM-NET: Deep generative network for structured deformable mesh. ACM Transactions on Graphics Vol. 38, No. 6, Article No. 243, 2019.

Crossref Google Scholar

[18]

Yin,

K. X.

; Chen,

Z. Q.

; Huang,

; Cohen-Or,

; Zhang,

LOGAN: Unpaired shape transform in latent overcomplete space. ACM Transactions on Graphics Vol. 38, No. 6, Article No. 198, 2019.

Crossref Google Scholar

[19]

Yang,

; Gao,

; Lai,

Y. K.

; Rosin,

P. L.

; Xia,

S. H.

Biharmonic deformation transfer with automatic key point selection. Graphical Models Vol. 98, 1–13, 2018.

Crossref Google Scholar

[20]

Zhou,

; Xu,

W. W.

; Tong,

Y. Y.

; Desbrun,

Deformation transfer to multi-component objects. Computer Graphics Forum Vol. 29, No. 2, 319–325, 2010.

Crossref Google Scholar

[21]

Igarashi,

; Moscovich,

; Hughes,

J. F.

As-rigid-as-possible shape manipulation. ACM Transactions on Graphics Vol. 24, No. 3, 1134–1141, 2005.

Crossref Google Scholar

[22]

Li,

; Sumner,

R. W.

; Pauly,

Global correspondence optimization for non-rigid registrationof depth scans. In: Proceedings of the Symposium on Geometry Processing, 1421–1430, 2008.

Crossref

[23]

Jack,

; Pontes,

J. K.

; Sridharan,

; Fookes,

; Shirazi,

; Maire,

; Eriksson,

Learning free-form deformations for 3D object reconstruction. In: Computer Vision – ACCV 2018. Lecture Notes in Computer Science, Vol. 11362. Jawahar,

; Li,

; Mori,

; Schindler,

Eds. Springer Cham, 317–333, 2019.

Crossref

[24]

Kurenkov,

; Ji,

J. W.

; Garg,

; Mehta,

; Gwak,

; Choy,

; Savarese,

DeformNet: Free-form deformation network for 3D shape reconstruction from a single image. In: Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 858–866, 2018.

Crossref

[25]

Mehr,

; Jourdan,

; Thome,

; Cord,

; Guitteny,

DiscoNet: Shapes learning on disconnected manifolds for 3D editing. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, 3473–3482, 2019.

Crossref

[26]

Wang,

W. Y.

; Ceylan,

; Mech,

; Neumann,

3DN: 3D deformation network. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 1038–1046, 2019.

Crossref

[27]

Jiang,

Z. H.

; Wu,

Q. Y.

; Chen,

K. Y.

; Zhang,

J. Y.

Disentangled representation learning for 3D face shape. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 11949–11958, 2019.

Crossref

[28]

Lipman,

; Sorkine,

; Cohen-Or,

; Levin,

; Rossl,

; Seidel,

H. P.

Differential coordinates for interactive mesh editing. In: Proceedings of the Shape Modeling Applications, 181–190, 2004.

[29]

Groueix,

; Fisher,

; Kim,

V. G.

; Russell,

B. C.

; Aubry,

3D-CODED: 3D correspondences by deep deformation. In: Computer Vision – ECCV 2018. Lecture Notes in Computer Science, Vol. 11206. Ferrari,

; Hebert,

; Sminchisescu,

; Weiss,

Eds. Springer Cham, 235–251, 2018.

Crossref

[30]

Wang,

N. Y.

; Zhang,

Y. D.

; Li,

Z. W.

; Fu,

Y. W.

; Liu,

; Jiang,

Y. G.

Pixel2Mesh: Generating 3D mesh models from single RGB images. In: Computer Vision – ECCV 2018. Lecture Notes in Computer Science, Vol. 11215. Ferrari,

; Hebert,

; Sminchisescu,

; Weiss,

Eds. Springer Cham, 55–71, 2018.

Crossref

[31]

Joshi,

; Meyer,

; DeRose,

; Green,

; Sanocki,

Harmonic coordinates for character articulation. ACM Transactions on Graphics Vol. 26, No. 3, 71–es, 2007.

Crossref Google Scholar

[32]

Lipman,

; Levin,

; Cohen-Or,

Green coordinates. ACM Transactions on Graphics Vol. 27, No. 3, 1–10, 2008.

Crossref Google Scholar

[33]

Yumer,

M. E.

; Mitra,

N. J.

Learning semantic deformation flows with 3D convolutional networks. In: Computer Vision – ECCV 2016. Lecture Notes in Computer Science, Vol. 9910. Leibe,

; Matas,

; Sebe,

; Welling,

Eds. Springer Cham, 294–311, 2016.

Crossref

[34]

Hanocka,

; Fish,

; Wang,

Z. H.

; Giryes,

; Fleishman,

; Cohen-Or,

ALIGNet: Partial-shape agnostic alignment via unsupervised learning. ACM Transactions on Graphics Vol. 38, No. 1, Article No. 1, 2018.

Crossref Google Scholar

[35]

Niemeyer,

; Mescheder,

; Oechsle,

; Geiger,

Occupancy flow: 4D reconstruction by learning particle dynamics. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, 5378–5388, 2019.

Crossref

[36]

Ishimtsev,

; Bokhovkin,

; Artemov,

; Ignatyev,

; Nießner,

; Zorin,

; Burnaev,

CAD-deform: Deformable fitting of CAD models to 3D scans. In: Computer Vision – ECCV 2020. Lecture Notes in Computer Science, Vol. 12358. Vedaldi,

; Bischof,

; Brox,

; Frahm,

J. M.

Eds. Springer Cham, 599–628, 2020.

Crossref

[37]

Chen,

R. T. Q.

; Rubanova,

; Bettencourt,

; Duvenaud,

Neural ordinary differential equations. In: Proceedings of the 32nd International Conference on Neural Information Processing Systems, 6572–6583, 2018.

[38]

Yang,

G. D.

; Huang,

; Hao,

Z. K.

; Liu,

M. Y.

; Belongie,

; Hariharan,

PointFlow: 3D point cloud generation with continuous normalizing flows. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, 4540–4549, 2019.

Crossref

[39]

Oord,

; Dieleman,

; Zen,

; Simonyan,

; Vinyals,

; Graves,

; Kalchbrenner,

; Senior,

; Kavukcuoglu,

Wavenet: A generative model for raw audio. arXiv preprint arXiv:1609.03499, 2016.

Google Scholar

[40]

Van den Oord,

; Kalchbrenner,

; Vinyals,

; Espeholt,

; Graves,

; Kavukcuoglu,

Conditional image generation with PixelCNN decoders. In: Proceedings of the 30th International Conference on Neural Information Processing Systems, 4797–4805, 2016.

[41]

Kingma,

D. P.

; Salimans,

; Jozefowicz,

; Chen,

; Sutskever,

; Welling,

Improved variational inference with inverse autoregressive flow. In: Proceedings of the 30th International Conference on Neural Information Processing Systems, 4743–4751, 2016.

[42]

Dinh,

; Sohl-Dickstein,

; Bengio,

Density estimation using real NVP. arXiv preprint arXiv: 1605.08803, 2016.

Google Scholar

[43]

Papamakarios,

; Pavlakou,

; Murray,

Masked autoregressive flow for density estimation. In: Proceedings of the 31st International Conference on Neural Information Processing Systems, 2335–2344, 2017.

[44]

Huang,

C. W.

; Krueger,

; Lacoste,

; Courville,

Neural autoregressive flows. In: Proceedings of the 35th International Conference on Machine Learning, 2078–2087, 2018.

[45]

De Cao,

; Aziz,

; Titov,

Block neural autoregressive flow. In Proceedings of the 35th Uncertainty in Artificial Intelligence Conference, 1263–1273, 2020.

[46]

Rezende,

D. J.

; Mohamed,

Variational inference with normalizing flows. In: Proceedings of the 32nd International Conference on International Conference on Machine Learning, Vol. 37, 1530–1538, 2015.

[47]

Van Den Berg,

; Hasenclever,

; Tomczak,

J. M.

; Welling,

Sylvester normalizing flows for variational inference. In: Proceedings of the 34th Conference on Uncertainty in Artificial Intelligence, 393–402, 2018.

[48]

Grathwohl,

; Chen,

R. T.

; Bettencourt,

; Sutskever,

; Duvenaud,

Ffjord: Free-form continuous dynamics for scalable reversible generative models. arXiv preprint arXiv:1810.01367, 2018.

Google Scholar

[49]

Tatarchenko,

; Richter,

S. R.

; Ranftl,

; Li,

Z. W.

; Koltun,

; Brox,

What do single-view 3D reconstruction networks learn? In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 3400–3409, 2019.

Crossref

[50]

Nan,

L. L.

; Xie,

; Sharf,

A search-classify approach for cluttered indoor scene understanding. ACM Transactions on Graphics Vol. 31, No. 6, Article No. 137, 2012.

Crossref Google Scholar

[51]

Li,

Y. Y.

; Su,

; Qi,

C. R.

; Fish,

; Cohen-Or,

; Guibas,

L. J.

Joint embeddings of shapes and images via CNN image purification. ACM Transactions on Graphics Vol. 34, No. 6, Article No. 234, 2015.

Crossref Google Scholar

[52]

Tabia,

; Laga,

Learning shape retrieval from different modalities. Neurocomputing Vol. 253, 24–33, 2017.

Crossref Google Scholar

[53]

Wu,

Z. Z.

; Zhang,

Y. H.

; Zeng,

; Qin,

F. W.

; Wang,

Y. G.

Joint analysis of shapes and images via deep domain adaptation. Computers & Graphics Vol. 70, 140–147, 2018.

Crossref Google Scholar

[54]

Lee,

; Lin,

Y. L.

; Chiang,

; Chiu,

M. W.

; Hsu,

; Huang,

Cross-domain image-based 3D shape retrieval by view sequence learning. In: Proceedings of the International Conference on 3D Vision, 258–266, 2018.

Crossref

[55]

Jin,

A. B.

; Fu,

; Deng,

Z. G.

Contour-based 3D modeling through joint embedding of shapes and contours. In: Proceedings of the Symposium on Interactive 3D Graphics and Games, Article No. 9, 2020.

Crossref

[56]

Chen,

M. J.

; Wang,

C. B.

; Liu,

L. G.

Cross-domain retrieving sketch and shape using cycle CNNs. Computers & Graphics Vol. 89, 50–58, 2020.

Crossref Google Scholar

[57]

Dahnert,

; Dai,

; Guibas,

; Niessner,

Joint embedding of 3D scan and CAD objects. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, 8748–8757, 2019.

Crossref

[58]

Wu,

Z. J.

; Wang,

; Lin,

; Lischinski,

; Cohen-Or,

; Huang,

SAGNet: Structure-aware generative network for 3D-shape modeling. ACM Transactions on Graphics Vol. 38, No. 4, Article No. 91, 2019.

Crossref Google Scholar

[59]

Liu,

M. M.

; Zhang,

K. X.

; Zhu,

; Wang,

; Guo,

Y. W.

Data-driven indoor scene modeling from a single color image with iterative object segmentation and model retrieval. IEEE Transactions on Visualization and Computer Graphics Vol. 26, No. 4, 1702–1715, 2020.

Google Scholar

[60]

Kuo,

W. C.

; Angelova,

; Lin,

T. Y.

; Dai,

Mask2CAD: 3D shape prediction by learning to segment and retrieve. In: Computer Vision – ECCV 2020. Lecture Notes in Computer Science, Vol. 12348. Vedaldi,

; Bischof,

; Brox,

; Frahm,

J. M.

Eds. Springer Cham, 260–277, 2020.

Crossref

[61]

Choy,

C. B.

; Xu,

D. F.

; Gwak,

; Chen,

; Savarese,

3D-R2N2: A unified approach for single and multi-view 3D object reconstruction. In: Computer Vision – ECCV 2016. Lecture Notes in Computer Science, Vol. 9912. Leibe,

; Matas,

; Sebe,

; Welling,

Eds. Springer Cham, 628–644, 2016.

Crossref

[62]

Pan,

J. Y.

; Han,

X. G.

; Chen,

W. K.

; Tang,

J. P.

; Jia,

Deep mesh reconstruction from single RGB images via topology modification networks. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, 9963–9972, 2019.

Crossref

[63]

Chen,

Z. Q.

; Tagliasacchi,

; Zhang,

BSP-net: Generating compact meshes via binary space partitioning. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 42–51, 2020.

Crossref

[64]

Groueix,

; Fisher,

; Kim,

V. G.

; Russell,

B. C.

; Aubry,

Unsupervised cycle-consistent deformation for shape matching. Computer Graphics Forum Vol. 38, No. 5, 123–133, 2019.

Crossref Google Scholar

Computational Visual Media

Volume 10 Issue 3,
June 2024

Pages 439-451

DOI: 10.1007/s41095-022-0315-3

Cite this article:

Leng B, Huang J, Shen G, et al. Shape embedding and retrieval in multi-flow deformation. Computational Visual Media, 2024, 10(3): 439-451. https://doi.org/10.1007/s41095-022-0315-3