دورية أكاديمية

基于原始点云网格自注意力机制的三维目标检测方法.

التفاصيل البيبلوغرافية
العنوان: 基于原始点云网格自注意力机制的三维目标检测方法. (Chinese)
Alternate Title: Grid self-attention mechanism 3D object detection method based on raw point cloud. (English)
المؤلفون: 鲁斌, 孙洋, 杨振宇
المصدر: Journal on Communication / Tongxin Xuebao; Oct2023, Vol. 44 Issue 10, p72-84, 13p
Abstract (English): To enhance the feature representation of region of interest (RoI), which incorporated a spatial context encoding module and soft regression loss, a grid self-attention mechanism 3D object detection method based on raw point cloud, named GT3D, was proposed. The spatial context encoding module was designed to effectively weight the local and spatial features of points through the attention mechanism, considering the contribution of different point cloud features for a more accurate feature representation. The soft regression loss was introduced to address label ambiguity arising during the data annotation phase. Experiments conducted on the public KITTI 3D object detection dataset demonstrate that the proposed method achieves significant improvements in detection accuracy compared to other publicly available point cloud-based 3D object detection methods. The detection results of the test set are submitted to the official KITTI server for public evaluation, achieving detection accuracies of 91.45%, 82.76%, and 79.74% for easy, moderate, and hard difficulty levels in car detection, respectively. [ABSTRACT FROM AUTHOR]
Abstract (Chinese): 为了增强感兴趣区域(RoI)的特征表达, 包括空间网格特征编码模块和软回归损失, 提出了一种基于 原始点云网格自注意力机制的三维目标检测方法GT3D。网格特征编码模块用于通过自注意力机制对点的局部特 征和空间特征进行有效加权, 充分考虑点云之间的几何关系, 以提供更准确的特征表达;软回归损失用于改善数 据标注过程中由于标注不准确而产生的回归歧义问题。将所提方法在公开的三维目标检测数据集KITTI 上进行实 验。结果表明, 所提方法相比其他已公开的基于点云的三维目标检测方法检测准确率提升明显, 并提交了KITTI 官方测试集进行公开测试, 对简单、中等和困难3 个难度等级的汽车检测准确率分别达到91.45%、82.76%和 79.74%。 [ABSTRACT FROM AUTHOR]
Copyright of Journal on Communication / Tongxin Xuebao is the property of Journal on Communications Editorial Office and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
قاعدة البيانات: Complementary Index
الوصف
تدمد:1000436X
DOI:10.11959/j.issn.1000-436x.2023189