基于改进Fast-SCNN的裂缝图像实时分割算法

Real-time segmentation algorithm of crack images based on improved Fast-SCNN

  • 摘要: 裂缝检测是一项关键工程任务,针对现有的主流裂缝语义分割模型参数量大、计算量高、实时性弱等问题,提出一种基于Fast-SCNN(fast segmentation convolution neural network)改进的裂缝图像实时分割算法。首先,该算法在Fast-SCNN基础上优化了空间金字塔池化模块SPP(spatial pyramid pooling) 存在像素位置信息丢失以及计算量大的不足,提出了一种轻量级的特征金字塔注意力模块;其次,改进了上采样的方式,充分考虑像素之间的关系,提出了一种轻量级的位置自注意力模块用于上采样,以此来提升检测精度;最后,双分支的各自输出通过注意力门突显裂缝相关区域和抑制无关背景。所提算法能够为模型提供更为精确的像素级别的注意力,更加有效识别细小裂缝和提升复杂背景裂缝分割的鲁棒性。实验结果表明:与现有的主流模型和其他轻量级模型相比,该算法进一步平衡了分割精度与检测速度,在裂缝数据集上达到 80.31%的平均交并比,F1 score为76.74%,参数量为1.20 M,计算量不足1 G,推理速度达到151 f/s,对裂缝图像实时分割检测任务具有较高的应用价值。

     

    Abstract: Crack detection is a key engineering task. Aiming at the problems of large number of parameters, large amount of calculation and weak real-time performance of the existing mainstream crack semantic segmentation models, an improved real-time segmentation algorithm of crack images based on fast segmentation convolution neural network (Fast-SCNN) was proposed. First, the spatial pyramid pooling (SPP) module with disadvantages of loss of pixel position information and large amount of calculation was optimized on the basis of Fast-SCNN. Then, the up-sampling method was improved to fully consider the relationship between pixels, and a lightweight positional self-attention module was proposed for up-sampling to improve the detection accuracy. Finally, the respective outputs of the dual branches highlight the crack-related regions and suppress the irrelevant backgrounds through the attention gates. The proposed algorithm can provide a more accurate pixel-level attention for the model, and can more effectively identify small cracks as well as improve the robustness of crack segmentation in complex backgrounds. Experiments show that, compared with the existing mainstream models and other lightweight models, the proposed algorithm further balances the segmentation accuracy and detection speed, and achieves an average intersection ratio of 80.31% and an F1 score of 76.74% on the crack dataset. The parameter amount is 1.20 M, the calculation amount is less than 1 G, and the inference speed reaches 151 f/s, which has high application value for the real-time segmentation and detection task of crack images.

     

/

返回文章
返回