6DOF pose estimation of a 3D rigid object based on edge-enhanced point pair features

Chenyi Liu; Fei Chen; Lu Deng; Renjiao Yi; Lintao Zheng; Chenyang Zhu; Jia Wang; Kai Xu

doi:10.1007/s41095-022-0308-2

| Sign up

PDF (7 MB)

Cite

EndNote(RIS) BibTeX

Collect

Submit Manuscript

Research Article | Open Access

6DOF pose estimation of a 3D rigid object based on edge-enhanced point pair features

Chenyi Liu^¹, Fei Chen^²(), Lu Deng^³, Renjiao Yi^¹, Lintao Zheng^⁴, Chenyang Zhu^¹, Jia Wang^⁵, Kai Xu^¹

1 College of Computing, National University of Defense Technology, Changsha 410073, China

2 Department of Spine Surgery, the Second Xiangya Hospital, Central South University, Changsha 410011, China

3 Clinical Nursing Teaching and Research Section, the Second Xiangya Hospital, Changsha 410011, China

4 College of Meteorology and Oceanography, National University of Defense Technology, Changsha 410073, China

5 Beijing Institute of Tracking and Communication Technology, Beijing 100094, China

Show Author Information

Graphical Abstract

View original image Download original image

Abstract

The point pair feature (PPF) is widely used for 6D pose estimation. In this paper, we propose an efficient 6D pose estimation method based on the PPF framework. We introduce a well-targeted down-sampling strategy that focuses on edge areas for efficient feature extraction for complex geometry. A pose hypothesis validation approach is proposed to resolve ambiguity due to symmetry by calculating the edge matching degree. We perform evaluations on two challenging datasets and one real-world collected dataset, demonstrating the superiority of our method for pose estimation for geometrically complex, occluded, symmetrical objects. We further validate our method by applying it to simulated punctures.

Keywords

point pair feature (PPF)pose estimation object recognition 3D point cloud

References

[1]

Li,

R. T.

; Si,

W. X.

; Liao,

X. Y.

; Wang,

; Klein,

; Heng,

P. A.

Mixed reality based respiratory liver tumor puncture navigation. Computational Visual Media Vol. 5, No. 4, 363–374, 2019.

Crossref Google Scholar

[2]

Wang,

; Cao,

; Chen,

S. L.

; Li,

Y. M.

; Zheng,

Y. W.

; Ohkohchi,

Current trends in three-dimensional visualization and real-time navigation as well as robot-assisted technologies in hepatobiliary surgery. World Journal of Gastrointestinal Surgery Vol. 13, No. 9, 904–922, 2021.

Crossref Google Scholar

[3]

Kim,

; Lee,

Vertebrae localization in CT using both local and global symmetry features. Computerized Medical Imaging and Graphics Vol. 58, 45–55, 2017.

Crossref Google Scholar

[4]

Rusu,

R. B.

; Bradski,

; Thibaux,

; Hsu,

Fast 3D recognition and pose using the Viewpoint Feature Histogram. In: Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2155–2162, 2010.

Crossref

[5]

Marton,

Z. C.

; Pangercic,

; Blodow,

; Beetz,

Combined 2D–3D categorization and classification for multimodal perception systems. The International Journal of Robotics Research Vol. 30, No. 11, 1378–1402, 2011.

Crossref Google Scholar

[6]

Madry,

; Ek,

C. H.

; Detry,

; Hang,

K. Y.

; Kragic,

Improving generalization for 3D object categorization with Global Structure Histograms. In: Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 1379–1386, 2012.

Crossref

[7]

Johnson,

A. E.

; Hebert,

Using spin images for efficient object recognition in cluttered 3D scenes. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 21, No. 5, 433–449, 1999.

Crossref Google Scholar

[8]

Rusu,

R. B.

; Blodow,

; Beetz,

Fast point feature histograms (FPFH) for 3D registration. In: Proceedings of the IEEE International Conference on Robotics and Automation, 3212–3217, 2009.

Crossref

[9]

Tombari,

; Salti,

; Di Stefano,

Unique signatures of histograms for local surface description. In: Computer Vision – ECCV 2010. Lecture Notes in Computer Science, Vol. 6313. Daniilidis,

; Maragos,

; Paragios,

Eds. Springer Berlin Heidelberg, 356–369, 2010.

Crossref

[10]

Rusu,

R. B.

; Holzbach,

; Beetz,

; Bradski,

Detecting and segmenting objects for mobile manipulation. In: Proceedings of the IEEE 12th International Conference on Computer Vision Workshops, 47–54, 2009.

Crossref

[11]

Hinterstoisser,

; Holzer,

; Cagniart,

; Ilic,

; Konolige,

; Navab,

; Lepetit,

Multimodal templates for real-time detection of texture-less objects in heavily cluttered scenes. In: Proceedings of the International Conference on Computer Vision, 858–865, 2011.

Crossref

[12]

Besl,

P. J.

; McKay,

N. D.

Method for registration of 3-D shapes. In: Proceedings of the SPIE Volume 1611, Sensor Fusion IV: Control Paradigms and Data Structures, 586–606, 1992.

[13]

Chen,

; Medioni,

Object modelling by registration of multiple range images. Image and Vision Computing Vol. 10, No. 3, 145–155, 1992.

Crossref Google Scholar

[14]

Rusinkiewicz,

; Levoy,

Efficient variants of the ICP algorithm. In: Proceedings of the 3rd International Conference on 3-D Digital Imaging and Modeling, 145–152, 2001.

[15]

Park,

; Patten,

; Vincze,

Pix2Pose: Pixel-wise coordinate regression of objects for 6D pose estimation. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, 7667–7676, 2019.

Crossref

[16]

Hodaň

; Baráth,

; Matas,

EPOS: Estimating 6D pose of objects with symmetries. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 11700–11709, 2020.

Crossref

[17]

Liang,

H. Z.

; Ma,

X. J.

; Li,

; Görner,

; Tang,

; Fang,

; Sun,

F. C.

; Zhang,

J. W.

PointNetGPD: Detecting grasp configurations from point sets. In: Proceedings of the International Conference on Robotics and Automation, 3629–3635, 2019.

Crossref

[18]

Lan,

Y. Q.

; Duan,

; Liu,

C. Y.

; Zhu,

C. Y.

; Xiong,

Y. S.

; Huang,

; Xu,

ARM3D: Attention-based relation module for indoor 3D object detection. Computational Visual Media Vol. 8, No. 3, 395–414, 2022.

Crossref Google Scholar

[19]

Zeng,

; Lv,

W. J.

; Dong,

Z. K.

; Liu,

Y. J.

PPR-net: Accurate 6-D pose estimation in stacked scenarios. IEEE Transactions on Automation Science and Engineering Vol. 19, No. 4, 3139–3151, 2022.

Crossref Google Scholar

[20]

Drost,

; Ulrich,

; Navab,

; Ilic,

Model globally, match locally: Efficient and robust 3D object recognition. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 998–1005, 2010.

Crossref

[21]

Choi,

; Christensen,

H. I.

RGB-D object pose estimation in unstructured environments. Robotics and Autonomous Systems Vol. 75, 595–613, 2016.

Crossref Google Scholar

[22]

Drost,

; Ilic,

3D object detection and localization using multimodal point pair features. In: Proceedings of the 2nd International Conference on 3D Imaging, Modeling, Processing, Visualization & Transmission, 9–16, 2012.

Crossref

[23]

Liu,

D. Y.

; Arai,

; Miao,

J. Q.

; Kinugawa,

; Wang,

; Kosuge,

Point pair feature-based pose estimation with multiple edge appearance models (PPF-MEAM) for robotic Bin picking. Sensors Vol. 18, No. 8, 2719, 2018.

Crossref Google Scholar

[24]

Vock,

; Dieckmann,

; Ochmann,

; Klein,

Fast template matching and pose estimation in 3D point clouds. Computers & Graphics Vol. 79, 36–45, 2019.

Crossref Google Scholar

[25]

Lu,

R. R.

; Zhu,

; Wu,

Q. X.

; Chen,

F. J.

; Cui,

Y. G.

; Kong,

Y. Z.

Three-dimensional object recognition based on enhanced point pair features. Acta Optica Sinica Vol. 39, No. 8, 0815006, 2019.

Crossref Google Scholar

[26]

Vidal,

; Lin,

C. Y.

; Lladó,

; Martí,

A method for 6D pose estimation of free-form rigid objects using point pair features on range data. Sensors Vol. 18, No. 8, 2678, 2018.

Crossref Google Scholar

[27]

Guo,

J. W.

; Xing,

X. J.

; Quan,

W. Z.

; Yan,

D. M.

; Gu,

Q. Y.

; Liu,

; Zhang,

X. P.

Efficient center voting for object detection and 6D pose estimation in 3D point cloud. IEEE Transactions on Image Processing Vol. 30, 5072–5084, 2021.

Crossref Google Scholar

[28]

Hinterstoisser,

; Lepetit,

; Rajkumar,

; Konolige,

Going further with point pair features. In: Computer Vision – ECCV 2016. Lecture Notes in Computer Science, Vol. 9907. Leibe,

; Matas,

; Sebe,

; Welling,

Eds. Springer Cham, 834–848, 2016.

Crossref

[29]

Papazov,

; Burschka,

An efficient RANSAC for 3D object recognition in noisy and occluded scenes. In: Computer Vision – ACCV 2010. Lecture Notes in Computer Science, Vol. 6492. Kimmel,

; Klette,

; Sugimoto,

Eds. Springer Berlin Heidelberg, 135–148, 2011.

Crossref

[30]

Mian,

A. S.

; Bennamoun,

; Owens,

Three-dimensional model-based object recognition and segmentation in cluttered scenes. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 28, No. 10, 1584–1601, 2006.

Crossref Google Scholar

[31]

Sølund,

; Buch,

A. G.

; Krüger,

; Aanæs,

A large-scale 3D object recognition dataset. In: Proceedings of the 4th International Conference on 3D Vision, 73–82, 2016.

Crossref

[32]

Hodaň

; Matas,

; Obdržálek,

Š.

On evaluation of 6D object pose estimation. In: Computer Vision – ECCV 2016 Workshops. Lecture Notes in Computer Science, Vol. 9915. Hua,

; Jégou,

Eds. Springer Cham, 606–619, 2016.

Crossref

[33]

Buch,

A. G.

; Kiforenko,

; Kraft,

Rotational subgroup voting and pose clustering for robust 3D object recognition. In: Proceedings of the IEEE International Conference on Computer Vision, 4137–4145, 2017.

Crossref

[34]

Jørgensen,

T. B.

; Buch,

A. G.

; Kraft,

Geometric edge description and classification in point cloud data with application to 3D object recognition. In: Proceedings of the 10th International Conference on Computer Vision Theory and Applications, Vol. 2, 333–340, 2015.

Crossref

[35]

Buch,

A. G.

; Petersen,

H. G.

; Krüger,

Local shape feature fusion for improved matching, pose estimation and 3D object recognition. SpringerPlus Vol. 5, No. 1, 1–33, 2016.

Crossref Google Scholar

[36]

Salti,

; Tombari,

; Di Stefano,

SHOT: Unique signatures of histograms for surface and texture description. Computer Vision and Image Understanding Vol. 125, 251–264, 2014.

Crossref Google Scholar

Computational Visual Media

Volume 10 Issue 1,
February 2024

Pages 61-77

DOI: 10.1007/s41095-022-0308-2

Cite this article:

Liu C, Chen F, Deng L, et al. 6DOF pose estimation of a 3D rigid object based on edge-enhanced point pair features. Computational Visual Media, 2024, 10(1): 61-77. https://doi.org/10.1007/s41095-022-0308-2