Neighborhood co-occurrence modeling in 3D point cloud segmentation

Jingyu Gong; Zhou Ye; Lizhuang Ma

doi:10.1007/s41095-021-0244-6

| Sign up

PDF (4.5 MB)

Cite

EndNote(RIS) BibTeX

Collect

Submit Manuscript

Research Article | Open Access

Neighborhood co-occurrence modeling in 3D point cloud segmentation

Jingyu Gong^¹, Zhou Ye^², Lizhuang Ma^{¹^,³}()

1Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai 200240, China

2Shanghai CLS Fintech Co., LTD, Shanghai 200030, China

3MoE Key Lab of Artificial Intelligence, Shanghai Jiao Tong University, Shanghai 200240, China

Show Author Information

Graphical Abstract

View original image Download original image

Abstract

A significant performance boost has been achieved in point cloud semantic segmentation by utilization of the encoder-decoder architecture and novel convolution operations for point clouds. However, co-occurrence relationships within a local region which can directly influence segmentation results are usually ignored by current works. In this paper, we propose a neighborhood co-occurrence matrix (NCM) to model local co-occurrence relationships in a point cloud. Wegenerate target NCM and prediction NCM fromsemantic labels and a prediction map respectively. Then,Kullback-Leibler (KL) divergence is used to maximize the similarity between the target and prediction NCMs to learn the co-occurrence relationship. Moreover, for large scenes where the NCMs for a sampled point cloud and the whole scene differ greatly, we introduce a reverse form of KL divergence which can better handle the difference to supervise the prediction NCMs. We integrate our method into an existing backbone and conduct comprehensive experiments on three datasets: Semantic3D for outdoor space segmentation, and S3DIS and ScanNet v2 for indoor scene segmentation. Results indicate that our method can significantly improve upon the backbone and outperform many leading competitors.

Keywords

3D vision point cloud co-occurrence relationmodeling semantic segmentation

References

[1]

Verdoja,

; Thomas,

; Sugimoto,

Fast 3D point cloud segmentation using supervoxels with geometry and color for 3D scene understanding. In: Proceedings of the IEEE International Conference on Multimedia and Expo, 1285-1290, 2017.

Crossref

[2]

Xu,

J. C.

; Gong,

J. Y.

; Zhou,

; Tan,

; Xie,

; Ma,

L. Z.

SceneEncoder: Scene-aware semantic segmentation of point clouds with a learnable scene descriptor. In: Proceedings of the 29th International Joint Conference on Artificial Intelligence, 601-607, 2020.

[3]

Charles,

R. Q.

; Hao,

; Mo,

K. C.

; Guibas,

L. J.

PointNet: Deep learning on point sets for 3D classification and segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 77-85, 2017.

Crossref

[4]

Wu,

W. X.

; Qi,

; Fuxin,

PointConv: Deep convolutional networks on 3D point clouds. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 9613-9622, 2019.

[5]

Thomas,

; Qi,

C. R.

; Deschaud,

J. E.

; Marcotegui,

; Goulette,

; Guibas,

KPConv: Flexible and deformable convolution for point clouds. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, 6410-6419, 2019.

Crossref

[6]

Hu,

S.-M.

; Cai,

J.-X.

; Lai,

Y.-K.

Semantic labeling and instance segmentation of 3D point clouds using patch context analysis and multiscale processing. IEEE Transactions on Visualization and Computer Graphics Vol. 26, No. 7, 2485-2498, 2020.

Crossref Google Scholar

[7]

Qi,

C. R.

; Yi,

; Su,

; Guibas,

L. J.

PointNet++: Deep hierarchical feature learning on point sets in a metric space. In: Proceedings of the 31st International Conference on Neural Information Processing Systems, 5105-5114, 2017.

[8]

Wang,

; Huang,

Y. C.

; Hou,

Y. L.

; Zhang,

S. M.

; Shan,

Graph attention convolution for point cloud semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 10288-10297, 2019.

Crossref

[9]

Hu,

Q. Y.

; Yang,

; Xie,

L. H.

; Rosa,

; Guo,

Y. L.

; Wang,

Z. H.

; Trigoni,

; Markham,

RandLA-net: Efficient semantic segmentation of large-scale point clouds. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 11105-11114, 2020.

[10]

Zhao,

; Tao,

W. B.

JSNet: Joint instance and semantic segmentation of 3D point clouds. Proceedings of the AAAI Conference on Artificial Intelligence Vol. 34, No. 7, 12951-12958, 2020.

Crossref Google Scholar

[11]

Pham,

Q. H.

; Nguyen,

; Hua,

B. S.

; Roig,

; Yeung,

S. K.

JSIS3D: Joint semantic-instance segmentation of 3D point clouds with multi-task pointwise networks and multi-value conditional random fields. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 8819-8828, 2019.

Crossref

[12]

Hu,

; Zhen,

; Bai,

; Fu,

; Tai,

JSENet: Joint semantic segmentation and edge detection network for 3D point clouds. In: Computer Vision-ECCV 2020. Lecture Notes in Computer Science, Vol. 12365. Vedaldi,

; Bischof,

; Brox,

; Frahm,

J. M.

Eds. Springer Cham, 222-239, 2020.

[13]

Gong,

; Xu,

; Tan,

; Zhou,

; Qu,

; Xie,

; Ma,

Boundary-aware geometric encoding for semantic segmentation of point clouds. In: Proceedings of the AAAI Conference on Artificial Intelligence, 2021.

[14]

Zhang,

J. Z.

; Zhu,

C. Y.

; Zheng,

L. T.

; Xu,

Fusion-aware point convolution for online semantic 3D scene segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 4533-4542, 2020.

Crossref

[15]

Mattausch,

; Panozzo,

; Mura,

; Sorkine-Hornung,

; Pajarola,

Object detection and classification from large-scale cluttered indoor scans. Computer Graphics Forum Vol. 33, No. 2, 11-21, 2014.

Crossref Google Scholar

[16]

Mottaghi,

; Chen,

X. J.

; Liu,

X. B.

; Cho,

N. G.

; Lee,

S. W.

; Fidler,

; Urtasun,

; Yuille,

The role of context for object detection and semantic segmentation in the wild. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 891-898, 2014.

Crossref

[17]

Zhao,

; Wang,

; Yang,

; Cai,

Region mutual information loss for semantic segmentation. In: Proceedings of the 33rd Conference on Neural Information Processing Systems, 11117-11127, 2019.

[18]

Wu,

B. C.

; Wan,

; Yue,

X. Y.

; Keutzer,

SqueezeSeg: Convolutional neural nets with recurrent CRF for real-time road-object segmentation from 3D LiDAR point cloud. In: Proceedings of the IEEE International Conference on Robotics and Automation, 1887-1893, 2018.

[19]

Ye,

X. Q.

; Li,

J. M.

; Huang,

H. X.

; Du,

; Zhang,

X. L.

3D recurrent neural networks with context fusion for point cloud semantic segmentation. In: Computer Vision-ECCV 2018. Lecture Notes in Computer Science, Vol. 11211. Ferrari,

; Hebert,

; Sminchisescu,

; Weiss,

Eds. Springer Cham, 415-430, 2018.

[20]

Jiang,

; Zhao,

H. S.

; Liu,

; Shen,

X. Y.

; Fu,

C. W.

; Jia,

J. Y.

Hierarchical point-edge interaction network for point cloud semantic segmentation. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, 10432-10440, 2019.

Crossref

[21]

Zhang,

; Zhang,

; Wang,

C. G.

; Xie,

J. Y.

Co-occurrent features in semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 548-557, 2019.

Crossref

[22]

Deng,

; Todorovic,

; Latecki,

L. J.

Semantic segmentation of RGBD images with mutex constraints. In: Proceedings of the IEEE International Conference on Computer Vision, 1733-1741, 2015.

Crossref

[23]

Koppula,

H. S.

; Anand,

; Joachims,

; Saxena,

Semantic labeling of 3D point clouds for indoor scenes. In: Proceedings of the 24th International Conference on Neural Information Processing Systems, 244-252, 2011.

[24]

Zhao,

; Liu,

; Li,

B. F.

; Du,

X. Y.

Ngram2vec: Learning improved word representations from ngram co-occurrence statistics. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, 244-253, 2017.

Crossref

[25]

Wang,

Y. S.

; Ma,

X. J.

; Chen,

Z. Y.

; Luo,

; Yi,

J. F.

; Bailey,

Symmetric cross entropy for robust learning with noisy labels. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, 322-330, 2019.

Crossref

[26]

Hackel,

; Savinov,

; Ladicky,

; Wegner,

J. D.

; Schindler,

; Pollefeys,

Semantic3D.net: A new large-scale point cloud classification benchmark. arXiv preprint arXiv:1704.03847, 2017.

Crossref Google Scholar

[27]

Armeni,

; Sener,

; Zamir,

A. R.

; Jiang,

; Brilakis,

; Fischer,

; Savarese,

3D semantic parsing of large-scale indoor spaces. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1534-1543, 2016.

Crossref

[28]

Dai,

; Chang,

A. X.

; Savva,

; Halber,

; Funkhouser,

; Nießner,

ScanNet: Richly-annotated 3D reconstructions of indoor scenes. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2432-2443, 2017.

Crossref

[29]

Gong,

; Xu,

; Tan,

; Song,

; Qu,

; Xie,

; Ma,

Omni-supervised point cloud segmentation via gradual receptive field component reasoning. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 11673-11682, 2021.

Crossref

[30]

Tchapmi,

; Choy,

; Armeni,

; Gwak,

; Savarese,

SEGCloud: Semantic segmentation of 3D point clouds. In: Proceedings of the International Conference on 3D Vision, 537-547, 2017.

Crossref

[31]

Thomas,

; Goulette,

; Deschaud,

J. E.

; Marcotegui,

; LeGall,

Semantic classification of 3D point clouds with multiscale spherical neighborhoods. In: Proceedings of the International Conference on 3D Vision, 390-398, 2018.

Crossref

[32]

Landrieu,

; Simonovsky,

Large-scale point cloud semantic segmentation with superpoint graphs. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 4558-4567, 2018.

Crossref

[33]

Zhang,

Z. Y.

; Hua,

B. S.

; Yeung,

S. K.

ShellNet: Efficient point cloud convolutional neural networks using concentric shells statistics. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, 1607-1616, 2019.

Crossref

[34]

Khan,

S. A.

; Shi,

Y. L.

; Shahzad,

; Zhu,

X. X.

FGCN: Deep feature-based graph convolutional network for semantic segmentation of urban 3D point clouds. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 778-787, 2020.

Crossref

[35]

Ma,

Y. N.

; Guo,

Y. L.

; Liu,

; Lei,

Y. J.

; Wen,

G. J.

Global context reasoning for semantic segmentation of 3D point clouds. In: Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2920-2929, 2020.

[36]

Lei,

; Akhtar,

; Mian,

SegGCN: Efficient 3D point cloud segmentation with fuzzy spherical kernel. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 11608-11617, 2020.

Crossref

[37]

Huang,

Q. G.

; Wang,

W. Y.

; Neumann,

Recurrent slice networks for 3D segmentation of point clouds. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2626-2635, 2018.

Crossref

[38]

Lin,

Y. Q.

; Yan,

Z. Z.

; Huang,

H. B.

; Du,

; Liu,

L. G.

; Cui,

S. G.

; Han,

FPConv: Learning local flattening for point convolution. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 4292-4301, 2020.

Crossref

[39]

Han,

W. K.

; Wen,

C. L.

; Wang,

; Li,

Point2Node: Correlation learning of dynamic-node for point cloud feature modeling. Proceedings of the AAAI Conference on Artificial Intelligence Vol. 34, No. 7, 10925-10932, 2020.

Crossref Google Scholar

[40]

Schult,

; Engelmann,

; Kontogianni,

; Leibe,

DualConvMesh-net: Joint geodesic and Euclidean convolutions on 3D meshes. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 8609-8619, 2020.

Crossref

[41]

Zhang,

F. H.

; Fang,

; Wah,

; Torr,

Deep FusionNet for point cloud semantic segmentation. In: Computer Vision-ECCV 2020. Lecture Notes in Computer Science, Vol. 12369. Vedaldi,

; Bischof,

; Brox,

; Frahm,

J. M.

Eds. Springer Cham, 644-663, 2020.

[42]

Li,

Y. Y.

; Bu,

; Sun,

M. C.

; Wu,

; Di,

X. H.

; Chen,

B. Q.

PointCNN: Convolution on

X

-transformed points. In: Proceedings of the 32nd International Conference on Neural Information Processing Systems, 828-838, 2018.

[43]

Huang,

J. W.

; Zhang,

H. T.

; Yi,

; Funkhouser,

; Nießner,

; Guibas,

L. J.

TextureNet: Consistent local parametrizations for learning from high-resolution signals on meshes. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 4435-4444, 2019.

Crossref

[44]

Lei,

; Akhtar,

; Mian,

Spherical kernel for efficient graph convolution on 3D point clouds. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 43, No. 10, 3664-3680, 2021.

Crossref Google Scholar

[45]

Yan,

; Zheng,

C. D.

; Li,

; Wang,

; Cui,

S. G.

PointASNL: Robust point clouds processing using nonlocal neural networks with adaptive sampling. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 5588-5597, 2020.

Crossref

Computational Visual Media

Volume 8 Issue 2,
June 2022

Pages 303-315

DOI: 10.1007/s41095-021-0244-6

Cite this article:

Gong J, Ye Z, Ma L. Neighborhood co-occurrence modeling in 3D point cloud segmentation. Computational Visual Media, 2022, 8(2): 303-315. https://doi.org/10.1007/s41095-021-0244-6