Transformer object detection in street view images based on improved YOLOv8
LIAO Fangzhou1, YANG Xiaoxia1, YANG Ronghao2, SHI Qiqi2
1. College of Geography and Planning, Chengdu University of Technology, Chengdu 610059; 2. College of Earth and Planetary Science, Chengdu University of Technology, Chengdu 610059
Abstract:Street view images are a form of geospatial big data at the urban street level. Utilizing street view images not only enables large-scale and efficient transformer inspection but also reduces inspection costs. However, transformers in street view images often have few pixels, low resolution and complex backgrounds, leading to unsatisfactory precision of existing object detection methods. To address these issues, this paper proposes an improved YOLOv8 algorithm named YOLOv8-WSX. Firstly, wise intersection over union (WIoU) is used as the loss function to strengthen the detection performance of the algorithm for difficult samples. Secondly, the spatial group-wise enhance (SGE) attention mechanism module is introduced to improve the feature extraction ability of the algorithm. Finally, an extra-small object detection head is added to solve the problem of missing detection of extra-small transformer objects. The experimental results show that compared to YOLOv8, YOLOv8-WSX increases the F1 score by 5.9 percentage points, increases the mean average precision by 6.3 percentage points for IoU at 50%, and increases the mean average precision by 3.2 percentage points for IoU from 50% to 95%. Additionally, the model has fewer parameters.
廖方舟, 杨晓霞, 杨容浩, 施琪琦. 基于改进YOLOv8的街景图像变压器目标检测[J]. 电气技术, 2024, 25(12): 12-20.
LIAO Fangzhou, YANG Xiaoxia, YANG Ronghao, SHI Qiqi. Transformer object detection in street view images based on improved YOLOv8. Electrical Engineering, 2024, 25(12): 12-20.