《计算机应用》唯一官方网站

• •    下一篇

基于加强特征提取的道路病害检测算法

龙伍丹1,彭博2,胡节1,申颖1,丁丹妮3   

  1. 1. 西南交通大学计算机与人工智能学院
    2. 西南交通大学信息科学与技术学院, 成都 610031
    3. 成都信息工程大学计算机学院
  • 收稿日期:2023-07-17 修回日期:2023-09-10 发布日期:2023-10-26 出版日期:2023-10-26
  • 通讯作者: 龙伍丹

Road damage detection algorithm based on enhanced feature extraction

  • Received:2023-07-17 Revised:2023-09-10 Online:2023-10-26 Published:2023-10-26

摘要: 针对道路病害区域小、类别数量不均衡导致检测困难的问题,提出基于YOLOv7-tiny的检测算法RDD-YOLO。首先,采用K-means++算法得到拟合目标尺寸更好的锚框。其次,在小目标检测支路上使用量化感知重参数化模块(QARepVGG),增强浅层特征提取,同时构建加强注意力模块(AM-CBAM)嵌入到颈部的三个输入,抑制复杂背景干扰。然后,设计特征融合模块(Res-RFB),模拟人眼扩大感受野融合多尺度信息,提高表征能力。另外,构造轻量级解耦头(S-DeHead)提高小目标检测准确率。最后,采用归一化Wasserstein距离度量(NWD)优化小目标定位过程,并缓解样本不均衡问题。实验结果表明,所提方法在仅增加0.71×106参数量和1.7GFLOPs计算量的成本下,mAP50提高6.19个百分点,F1-Score提高5.31个百分点,并且帧速率达到135.26frame/s,满足道路养护工作中对检测精度和速度的需求。

Abstract: In response to the challenge posed by the difficulty in detecting small road defect areas and the uneven distribution of defect categories, a detection algorithm termed RDD-YOLO was introduced, based on the YOLOv7-tiny architecture. Firstly, the K-means++ algorithm was employed to determine anchor boxes that better conform to target dimensions. Subsequently, in the realm of small object detection, the Quantization Aware RepVGG (QARepVGG) module was utilized within the auxiliary detection branch, thereby enhancing the extraction of shallow features. Concurrently, the Addition and Multiplication Convolutional Block Attention Module (AM-CBAM) was embedded into the three inputs of the neck, effectively suppressing disturbances arising from intricate backgrounds. Furthermore, the feature fusion module (Resblock with Receptive Field Block, Res-RFB) was devised to emulate the expansion of receptive fields in human visual perception, consequently amalgamating information across multiple scales and thereby amplifying representational aptitude. Additionally, a lightweight Small Decoupled Head (S-DeHead) is introduced to elevate the accuracy of detecting diminutive objects. Ultimately, the process of localizing small objects is optimized through the application of the Normalized Wasserstein Distance (NWD) metric, which in turn mitigates the challenge of imbalanced samples. Empirical findings underscore the efficacy of the proposed methodology, manifesting in a notable 6.19% enhancement in mAP50 and a 5.31% elevation in F1-Score, all achieved while upholding a frame rate of 135.26 frames per second. Impressively, these advancements are realized through the incorporation of a mere 0.71M additional parameters and a computational overhead of 1.7 GFLOPs, which can meet the requirements for both accuracy and expeditiousness in road maintenance.

中图分类号: