AI Chat Paper
Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.
{{lang === 'zh_CN' ? '文章概述' : 'Summary'}}
{{lang === 'en_US' ? '中' : 'Eng'}}
Chat more with AI
PDF (1.6 MB)
Collect
Submit Manuscript AI Chat Paper
Show Outline
Outline
Show full outline
Hide outline
Outline
Show full outline
Hide outline
Open Access | Just Accepted

DefDeN: A Deformable Denoising-Based LiDAR and Camera Feature Fusion Model for 3D Object Detection

Peng Zhi1Xiaowei Xu1Hongtao Nie1Binbin Yong1( )Jun Shen2Qingguo Zhou1

1 School of Information & Engineering, Lanzhou University, Lanzhou 730000, China

2 School of Computing and Information Technology, University of Wollongong, NSW 2522, Australia

Show Author Information

Abstract

As a typical application of edge intelligence, 3D object detection in autonomous driving often requires multimodal information fusion to accurately perceive the environment. With images and point clouds serving as critical sensory data sources, 3D object detection integrates multimodal fusion to enhance detection accuracy. Generally, fusion algorithms leveraging attention mechanism can intelligently extract and integrate multimodal sensing information to overcome limitations posed by sensor calibration. However, attention mechanism may cause challenges such as slow model convergence and high false positives. Therefore, in this paper, we propose the Deformable Denoising (DefDeN) model to effectively integrate modules including gated information fusion networks, multi-scale deformable attention mechanisms, noise addition and denoising method, and contrastive learning for multi-sensor feature fusion. Experimental results on the nuScenes dataset demonstrate the superiority of DefDeN in detection accuracy, and the effectiveness of precise and stable perception for complex scenarios in autonomous driving systems.

Tsinghua Science and Technology
Cite this article:
Zhi P, Xu X, Nie H, et al. DefDeN: A Deformable Denoising-Based LiDAR and Camera Feature Fusion Model for 3D Object Detection. Tsinghua Science and Technology, 2024, https://doi.org/10.26599/TST.2024.9010173

252

Views

52

Downloads

0

Crossref

0

Web of Science

0

Scopus

0

CSCD

Altmetrics

Received: 29 June 2024
Accepted: 13 September 2024
Available online: 12 October 2024

© The author(s) 2025.

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).

Return