» Articles » PMID: 39275498

USSC-YOLO: Enhanced Multi-Scale Road Crack Object Detection Algorithm for UAV Image

Overview
Journal Sensors (Basel)
Publisher MDPI
Specialty Biotechnology
Date 2024 Sep 14
PMID 39275498
Authors
Affiliations
Soon will be listed here.
Abstract

Road crack detection is of paramount importance for ensuring vehicular traffic safety, and implementing traditional detection methods for cracks inevitably impedes the optimal functioning of traffic. In light of the above, we propose a USSC-YOLO-based target detection algorithm for unmanned aerial vehicle (UAV) road cracks based on machine vision. The algorithm aims to achieve the high-precision detection of road cracks at all scale levels. Compared with the original YOLOv5s, the main improvements to USSC-YOLO are the ShuffleNet V2 block, the coordinate attention (CA) mechanism, and the Swin Transformer. First, to address the problem of large network computational spending, we replace the backbone network of YOLOv5s with ShuffleNet V2 blocks, reducing computational overhead significantly. Next, to reduce the problems caused by the complex background interference, we introduce the CA attention mechanism into the backbone network, which reduces the missed and false detection rate. Finally, we integrate the Swin Transformer block at the end of the neck to enhance the detection accuracy for small target cracks. Experimental results on our self-constructed UAV near-far scene road crack i(UNFSRCI) dataset demonstrate that our model reduces the giga floating-point operations per second (GFLOPs) compared to YOLOv5s while achieving a 6.3% increase in mAP@50 and a 12% improvement in mAP@ [50:95]. This indicates that the model remains lightweight meanwhile providing excellent detection performance. In future work, we will assess road safety conditions based on these detection results to prioritize maintenance sequences for crack targets and facilitate further intelligent management.

Citing Articles

Vision-Based Localization Method for Picking Points in Tea-Harvesting Robots.

Yang J, Li X, Wang X, Fu L, Li S Sensors (Basel). 2024; 24(21).

PMID: 39517674 PMC: 11548263. DOI: 10.3390/s24216777.

References
1.
Ren S, He K, Girshick R, Sun J . Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Trans Pattern Anal Mach Intell. 2016; 39(6):1137-1149. DOI: 10.1109/TPAMI.2016.2577031. View

2.
Qu Z, Cao C, Liu L, Zhou D . A Deeply Supervised Convolutional Neural Network for Pavement Crack Detection With Multiscale Feature Fusion. IEEE Trans Neural Netw Learn Syst. 2021; 33(9):4890-4899. DOI: 10.1109/TNNLS.2021.3062070. View

3.
Zou Q, Zhang Z, Li Q, Qi X, Wang Q, Wang S . DeepCrack: Learning Hierarchical Convolutional Features for Crack Detection. IEEE Trans Image Process. 2018; . DOI: 10.1109/TIP.2018.2878966. View

4.
Lin T, Goyal P, Girshick R, He K, Dollar P . Focal Loss for Dense Object Detection. IEEE Trans Pattern Anal Mach Intell. 2018; 42(2):318-327. DOI: 10.1109/TPAMI.2018.2858826. View

5.
Lv Z, Cheng C, Lv H . Automatic identification of pavement cracks in public roads using an optimized deep convolutional neural network model. Philos Trans A Math Phys Eng Sci. 2023; 381(2254):20220169. PMC: 10350337. DOI: 10.1098/rsta.2022.0169. View