赵高强,潘志松,李云波.基于高性能特征提取和任务解耦的无人机航拍图像小目标检测[J]. 微电子学与计算机,2024,41(4):55-63. doi: 10.19304/J.ISSN1000-7180.2023.0258
引用本文: 赵高强,潘志松,李云波.基于高性能特征提取和任务解耦的无人机航拍图像小目标检测[J]. 微电子学与计算机,2024,41(4):55-63. doi: 10.19304/J.ISSN1000-7180.2023.0258
ZHAO G Q,PAN Z S,LI Y B. Small target detection in UAV aerial image based on high-performance feature extraction and task decoupling[J]. Microelectronics & Computer,2024,41(4):55-63. doi: 10.19304/J.ISSN1000-7180.2023.0258
Citation: ZHAO G Q,PAN Z S,LI Y B. Small target detection in UAV aerial image based on high-performance feature extraction and task decoupling[J]. Microelectronics & Computer,2024,41(4):55-63. doi: 10.19304/J.ISSN1000-7180.2023.0258

基于高性能特征提取和任务解耦的无人机航拍图像小目标检测

Small target detection in UAV aerial image based on high-performance feature extraction and task decoupling

  • 摘要: 无人机广泛应用于环境监测、资源规划和电力巡检等多种领域,其航拍图像中存在大量小尺寸目标,给目标检测任务带来难度。为此,在YOLOv5s的基础上提出一种基于高性能特征提取和任务解耦的目标检测网络(HFTT-Net)算法。首先,针对航拍图像中小尺寸目标特征提取困难的问题,在原始骨干网络的基础上引入多头自注意力机制,使网络充分关注小目标信息,在多尺度特征融合过程中使用SPD(Space-to-depth)组件,增强待检测目标的特征;接着,对于目标检测中普遍存在的任务冲突的问题,将分类头与回归头进行解耦操作,进一步提升目标检测精度;最后,结合基于EIoU的回归损失对网络进行监督,提升网络收敛速度,实现无人机航拍图像中目标的精确检测。在VisDrone2019数据集上的实验结果表明,HFTT-Net中的高性能特征提取和任务解耦操作能够充分提升网络的小目标检测能力,在航拍图像的多密集小目标场景任务中表现突出,该算法在能够满足实时检测的情况下与经典的YOLOv5s算法相比精度提升了2.5%。

     

    Abstract: Unmanned Aerial Vehicles (UAVs) are widely used in many fields, such as environmental monitoring, resource planning and power patrol inspection. There are a large number of small targets in their aerial images, which makes the target detection task difficult. Therefore, based on YOLOv5s, a target detection network (HFTT-Net) algorithm based on high-performance feature extraction and task decoupling is proposed. First, aiming at the difficulty of feature extraction of small and medium-sized targets in aerial images, the multi-head self-attention mechanism is introduced on the basis of the original backbone network to make the network pay full attention to the small target information, and the SPD (Space-to-Depth) component is used in the multi-scale feature fusion process to enhance the features of the target to be detected. Secondly, for the common problem of task conflict in target detection, the classification head and regression head are decoupled to further improve the target detection accuracy. Finally, combined with the regression loss based on EIoU, the network is supervised to improve the convergence speed of the network and realize the accurate detection of targets in the aerial image of UAV. The experimental results on the VisDrone2019 dataset show that the high-performance feature extraction and task decoupling operation in HFTT-Net can fully improve the network's small target detection ability, which is outstanding in the multi-dense small target scene tasks of aerial images. The accuracy of this algorithm is improved by 2.5% compared with the classic YOLOv5s algorithm under the condition that it can meet real-time detection.

     

/

返回文章
返回