Reconstructing piecewise planar scenes with multi-view regularization

Weijie Xi; Xuejin Chen

doi:10.1007/s41095-019-0159-7

AI Chat Paper

Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.

Chat more with AI

| Sign up

Browse by Subject

Search for peer-reviewed journals with full access.

Journals A - Z

About Us

Discover the SciOpen Platform and Achieve Your Research Goals with Ease.

About Us

Publish with Us

Support

Journals A - Z

About Us

Publish with Us

Support

PDF (11.1 MB)

Cite

EndNote(RIS) BibTeX

Collect

Submit Manuscript

AI Chat Paper

Show Outline

Outline

Show full outline

Hide outline

Outline

Show full outline

Hide outline

Research Article | Open Access

Reconstructing piecewise planar scenes with multi-view regularization

Weijie Xi^¹, Xuejin Chen^¹(

)

1University of Science and Technology of China, Hefei, 230026, China.

Show Author Information

Abstract

Reconstruction of man-made scenes from multi-view images is an important problem in computer vision and computer graphics. Observing that man-made scenes are usually composed of planar surfaces, we encode plane shape prior in reconstructing man-made scenes. Recent approaches for single-view reconstruction employ multi-branch neural networks to simultaneouslysegment planes and recover 3D plane parameters. However, the scale of available annotated data heavily limits the generalizability and accuracy of these supervised methods. In this paper, we propose multi-view regularization to enhance the capability of piecewise planar reconstruction during the training phase, without demanding extra annotated data. Our multi-view regularization enables the consistency among multiple views by making the feature embedding more robust against view change and lighting variations. Thus, the neural network trained by multi-view regularization performs better on a wide range of views and lightings in the test phase. Based on more consistent prediction results, we merge the recovered models from multiple views to reconstruct scenes. Our approach achieves state-of-the-art reconstruction performance compared to previous approaches on the public ScanNet dataset.

Keywords

scene modeling multi-view regularization neural network

References

[1]

D. Gallup,; J.-M. Frahm,; P. Mordohai,; Q. Yang,; M. Pollefeys,Real-time plane-sweeping stereo with multiple sweeping directions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1-8, 2007.

[2]

H. Hirschmuller, Stereo processing by semiglobal matching and mutual information. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 30, No. 2, 328-341, 2008.

Crossref Google Scholar

[3]

Y. Yao,; Z. X. Luo,; S. W. Li,; T. Fang,; L. Quan,MVSNet: Depth inference for unstructured multi-view stereo. In: Computer Vision - ECCV 2018. Lecture Notes in Computer Science, Vol. 11212. V. Ferrari,; M. Hebert,; C. Sminchisescu,; Y Weiss,. Eds. Springer International Publishing, 785-801, 2018.

[4]

Y. Yao,; Z. Luo,; S. Li,; T. Shen,; T. Fang,; L. Quan,Recurrent MVSNet for high-resolution multiview stereo depth inference. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 5525-5534, 2019.

[5]

R. Chen,; S. Han,; J. Xu,; H. Su,Point-based multiview stereo network. In: Proceedings of the IEEE International Conference on Computer Vision, 1538-1547, 2019.

[6]

K. Luo,; T. Guan,; L. Ju,; H. Huang,; Y. Luo,PMVSNet: Learning patch-wise matching confidence aggregation for multi-view stereo. In: Proceedings of the IEEE International Conference on Computer Vision, 10452-10461, 2019.

[7]

R. Yang,; M. Pollefeys,Multi-resolution real-time stereo on commodity graphics hardware. In: Pro-ceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003.

[8]

A. Monszpart,; N. Mellado,; G. J. Brostow,; N. J. Mitra, RAPter: Rebuilding man-made scenes with regular arrangements of planes. ACM Transactions on Graphics Vol. 34, No. 4, Article No. 103, 2015.

Crossref Google Scholar

[9]

C. Liu,; J. Yang,; D. Ceylan,; E. Yumer,; Y. Furukawa,PlaneNet: Piece-wise planar reconstruction from a single RGB image. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2579-2588, 2018.

[10]

F. T. Yang,; Z. H. Zhou,Recovering 3D planes from a single image via convolutional neural networks. In: Computer Vision - ECCV 2018. Lecture Notes in Computer Science, Vol. 11214. V. Ferrari,; M. Hebert,; C. Sminchisescu,; Y Weiss,. Eds. Springer Cham, 87-103, 2018.

[11]

C. Liu,; K. Kim,; J. Gu,; Y. Furukawa,; J. Kautz,PlaneRCNN: 3D plane detection and reconstruction from a single image. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 4450-4459, 2019.

[12]

Z. Yu,; J. Zheng,; D. Lian,; Z. Zhou,; S. Gao,Single-image piece-wise planar 3D reconstruction via associative embedding. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1029-1037, 2019.

[13]

Y. Z. Zhang,; W. W. Xu,; Y. Y. Tong,; K. Zhou, Online structure analysis for real-time indoor scene reconstruction. ACM Transactions on Graphics Vol. 34, No. 5, Article No. 159, 2015.

Crossref Google Scholar

[14]

A. Dai,; A. X. Chang,; M. Savva,; M. Halber,; T. Funkhouser,; M. Niessner,ScanNet: Richlyannotated 3D reconstructions of indoor scenes. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, 5828-5839, 2017.

[15]

Y. Furukawa,; J. Ponce, Accurate, dense, and robust multiview stereopsis. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 32, No. 8, 1362-1376, 2010.

Crossref Google Scholar

[16]

J. L. Schönberger,; E. L. Zheng,; J. M. Frahm,; M. Pollefeys,Pixelwise view selection for unstructured multi-view stereo. In: Computer Vision - ECCV 2016. Lecture Notes in Computer Science, Vol. 9907. B. Leibe,; J. Matas,; N. Sebe,; M Welling,. Eds. Springer International Publishing, 501-518, 2016.

[17]

R. Jensen,; A. Dahl,; G. Vogiatzis,; E. Tola,; H. Aanaes,Large scale multi-view stereopsis evaluation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 406-413, 2014.

[18]

A. Knapitsch,; J. Park,; Q.-Y. Zhou,; V. Koltun, Tanks and temples: Benchmarking large-scale scene reconstruction. ACM Transactions on Graphics Vol. 36, No. 4, Article No. 78, 2017.

Crossref Google Scholar

[19]

E. Delage,; H. Lee,; A. Y. Ng,Automatic single-image 3d reconstructions of indoor manhattan world scenes. In: Robotics Research. Springer Tracts in Advanced Robotics, Vol. 28. S. Thrun,; R. Brooks,; H Durrant-Whyte,. Eds. Springer Berlin Heidelberg, 305-321, 2007.

[20]

O. Barinova,; V. Konushin,; A. Yakubenko,; K. Lee,; H. Lim,; A. Konushin,Fast automatic single-view 3-d reconstruction of urban scenes. In: Computer Vision - ECCV 2008. Lecture Notes in Computer Science, Vol. 5303. D. Forsyth,; P. Torr,; A Zisserman,. Eds. Springer Berlin Heidelberg, 100-113, 2008.

Crossref

[21]

A. Saxena,; S. H. Chung,; A. Y. Ng,Learning depth from single monocular images. In: Proceedings of the 18th International Conference on Neural Information Processing Systems, 1161-1168, 2005.

[22]

B. De Brabandere,; D. Neven,; L. Van Gool,Semantic instance segmentation for autonomous driving. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 7-9, 2017.

[23]

A. Paszke,; S. Gross,; S. Chintala,; G. Chanan,; E. Yang,; Z. DeVito,; Z. Lin,; A. Desmaison,; L. Antiga,; A. Lerer,Automatic differentiation in PyTorch. In: Proceedings of the 31st Conference on Neural Information Processing Systems, 2017.

[24]

T. Zhang, Solving large scale linear prediction problems using stochastic gradient descent algorithms. In: Proceedings of the 21st International Conference on Machine Learning, 2004.

[25]

N. Silberman,; D. Hoiem,; P. Kohli,; R. Fergus,Indoor segmentation and support inference from RGBD images. In: Computer Vision - ECCV 2012. Lecture Notes in Computer Science, Vol. 7576. A. Fitzgibbon,; S. Lazebnik,; P. Perona,; Y. Sato,; C Schmid,. Eds. Springer Berlin Heidelberg, 746-760, 2012.

Crossref

Computational Visual Media

Volume 5 Issue 4,
December 2019

Pages 337-345

DOI: 10.1007/s41095-019-0159-7

Cite this article:

Xi W, Chen X. Reconstructing piecewise planar scenes with multi-view regularization. Computational Visual Media, 2019, 5(4): 337-345. https://doi.org/10.1007/s41095-019-0159-7

562

Views

Downloads

Crossref

N/A

Web of Science

Scopus

CSCD

Google Scholar
Citation

Altmetrics

Revised: 17 December 2019

Accepted: 24 December 2019

Published: 17 January 2020

This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduc-tion in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.

The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Other papers from this open access journal are available free of charge from http://www.springer.com/journal/41095. To submit a manuscript, please go to https://www. editorialmanager.com/cvmj.