OAAFormer: Robust and Efficient Point Cloud Registration Through Overlapping-Aware Attention in Transformer

Jun-Jie Gao; Qiu-Jie Dong; Rui-An Wang; Shuang-Min Chen; Shi-Qing Xin; Chang-He Tu; Wenping Wang

doi:10.1007/s11390-024-4165-6

| Sign up

Article Link

Cite

EndNote(RIS) BibTeX

Collect

Submit Manuscript

Show Outline

Outline

Abstract

Keywords

Electronic Supplementary Material

References

Show full outline

Hide outline

Regular Paper

OAAFormer: Robust and Efficient Point Cloud Registration Through Overlapping-Aware Attention in Transformer

Jun-Jie Gao^¹, Qiu-Jie Dong^¹, Rui-An Wang^¹, Shuang-Min Chen^², Shi-Qing Xin^¹(), Chang-He Tu^¹, Wenping Wang^³

1School of Computer Science and Technology, Shandong University, Qingdao 266237, China

2School of Information and Technology, Qingdao University of Science and Technology, Qingdao 266061, China

3College of Engineering, Texas A&M University, Texas, TX 77843, U.S.A.

Show Author Information

Abstract

In the domain of point cloud registration, the coarse-to-fine feature matching paradigm has received significant attention due to its impressive performance. This paradigm involves a two-step process: first, the extraction of multi-level features, and subsequently, the propagation of correspondences from coarse to fine levels. However, this approach faces two notable limitations. Firstly, the use of the Dual Softmax operation may promote one-to-one correspondences between superpoints, inadvertently excluding valuable correspondences. Secondly, it is crucial to closely examine the overlapping areas between point clouds, as only correspondences within these regions decisively determine the actual transformation. Considering these issues, we propose OAAFormer to enhance correspondence quality. On the one hand, we introduce a soft matching mechanism to facilitate the propagation of potentially valuable correspondences from coarse to fine levels. On the other hand, we integrate an overlapping region detection module to minimize mismatches to the greatest extent possible. Furthermore, we introduce a region-wise attention module with linear complexity during the fine-level matching phase, designed to enhance the discriminative capabilities of the extracted features. Tests on the challenging 3DLoMatch benchmark demonstrate that our approach leads to a substantial increase of about 7% in the inlier ratio, as well as an enhancement of 2%–4% in registration recall. Finally, to accelerate the prediction process, we replace the Conventional Random Sample Consensus (RANSAC) algorithm with the selection of a limited yet representative set of high-confidence correspondences, resulting in a 100 times speedup while still maintaining comparable registration performance.

Keywords

point cloud registration coarse-to-fine overlapping region feature matching Transformer

Electronic Supplementary Material

Download File(s)

JCST-2402-14165-Highlights.pdf (858.2 KB)

References

[1]

Bai X Y, Luo Z X, Zhou L, Fu H B, Quan L, Tai C L. D3Feat: Joint learning of dense detection and description of 3D local features. In Proc. the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Jun. 2020, pp.6358–6366. DOI: 10.1109/CVPR42600.2020.00639.