面向端侧实时应用的红外小目标轻量级Transformer跟踪算法

    Lightweight Transformer-based tracking algorithm for infrared tiny targets with edge deployment

    • 摘要: 红外小目标在空中监测、野生动物保护与安防巡查等无人机应用中具有重要价值,具备低可见性、抗干扰强、全天候工作的优势。然而,由于红外图像中目标通常具有尺度小、对比度低、边界模糊、背景干扰强等特点,传统的目标跟踪方法在特征感知、位置回归与实时性方面面临严峻挑战。面向红外小目标跟踪任务,提出一种轻量级Transformer端侧实时跟踪算法,以提升红外场景下小目标的感知鲁棒性和部署适应性。该方法设计了基于MobileNetV3的高效特征提取主干网络,在保证特征表达能力的同时显著降低模型计算复杂度,满足国产嵌入式平台的资源约束需求;构建了双层热感知特征交互结构,通过融合局部显著性和全局热响应信息,有效增强小目标区域的判别能力;同时引入归一化Wasserstein距离的回归损失,以提升小目标在尺寸变化和视角变换条件下的定位精度与几何稳定性。在红外小目标公开数据集BIRDSAI上开展了实验验证,结果表明所提方法能有效应对红外小目标跟踪挑战,并可在国产化嵌入式平台上实现稳定实时运行,跟踪准确率达80.8%,帧率达43 frame/s,在实际应用场景中展现出良好性能。

       

      Abstract: Infrared tiny object tracking plays a critical role in various unmanned aerial vehicle (UAV) applications, including aerial surveillance, wildlife protection, and security patrol, owing to its advantages such as low visibility, strong anti-interference capability, and all-weather operability. However, infrared images typically present challenges including small target scale, low contrast, blurred boundaries, and complex background interference, which severely hinder the performance of conventional tracking algorithms in terms of feature perception, localization accuracy, and real-time deployment. To address these issues, a lightweight Transformer-based tracking algorithm tailored for edge-side real-time infrared small object tracking was proposed. The proposed method designed an efficient backbone network based on MobileNetV3, which could significantly reduce the computational complexity while preserving feature representation capacity, thereby meeting the resource constraints of domestic embedded platforms. Furthermore, a dual-level thermal-aware feature interaction structure was constructed to enhance the discriminability of small target regions by integrating both local saliency and global thermal responses.In addition, the regression loss was formulated based on the normalized Wasserstein distance (NWD) to improve the spatial localization accuracy and geometric stability of small targets under scale variation and perspective change. Extensive experimental results conducted on the public BIRDSAI dataset demonstrate that the proposed method effectively addresses the challenges of infrared small object tracking and achieves stable real-time performance on domestic embedded platforms with an precision of up to 80.8% and a frame rate of 43 frame/s, indicating strong potential for practical deployment.

       

    /

    返回文章
    返回