USD-YOLO: An enhanced YOLO algorithm for small object detection in unmanned systems perception
File(s)applsci-15-03795.pdf (13.06 MB)
Published version
Author(s)
Deng, Hongqiang
Zhang, Shuzhe
Wang, Xiaodong
Han, Tianxin
Ye, Yun
Type
Journal Article
Abstract
In the perception of unmanned systems, small object detection faces numerous challenges, including small size, low resolution, dense distribution, and occlusion, leading to suboptimal perception performance. To address these issues, we propose a specialized algorithm named Unmanned-system Small-object Detection-You Only Look Once (USD-YOLO). First, we designed an innovative module called the Anchor-Free Precision Enhancer to achieve more accurate bounding box overlap measurements and provide a smarter processing mechanism, thereby improving the localization accuracy of candidate boxes for small and densely distributed objects. Second, we introduced the Spatial and Channel Reconstruction Convolution module to reduce redundancy in spatial and channel features while extracting key features of small objects. Additionally, we designed a novel C2f-Global Attention Mechanism module to expand the receptive field and capture more contextual information, optimizing the detection head’s ability to handle small and low-resolution objects. We conducted extensive experimental comparisons with state-of-the-art models on three mainstream unmanned system datasets and a real unmanned ground vehicle. The experimental results demonstrate that USD-YOLO achieves higher detection precision and faster speed. On the Citypersons dataset, compared with the baseline, USD-YOLO improves mAP50-95, mAP50, and Recall by 8.5%, 5.9%, and 2.3%, respectively. Additionally, on the Flow-Img and DOTA-v1.0 datasets, USD-YOLO improves mAP50-95 by 2.5% and 2.5%, respectively.
Date Issued
2025-03-30
Date Acceptance
2025-03-28
Citation
Applied Sciences, 2025, 15 (7)
ISSN
2076-3417
Publisher
MDPI AG
Start Page
3795
End Page
3795
Journal / Book Title
Applied Sciences
Volume
15
Issue
7
Copyright Statement
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
License URL
Publication Status
Published
Article Number
ARTN 3795
Date Publish Online
2025-03-30