» Articles » PMID: 36365919

AIE-YOLO: Auxiliary Information Enhanced YOLO for Small Object Detection

Overview
Journal Sensors (Basel)
Publisher MDPI
Specialty Biotechnology
Date 2022 Nov 11
PMID 36365919
Authors
Affiliations
Soon will be listed here.
Abstract

Small object detection is one of the key challenges in the current computer vision field due to the low amount of information carried and the information loss caused by feature extraction. You Only Look Once v5 (YOLOv5) adopts the Path Aggregation Network to alleviate the problem of information loss, but it cannot restore the information that has been lost. To this end, an auxiliary information-enhanced YOLO is proposed to improve the sensitivity and detection performance of YOLOv5 to small objects. Firstly, a context enhancement module containing a receptive field size of 21×21 is proposed, which captures the global and local information of the image by fusing multi-scale receptive fields, and introduces an attention branch to enhance the expressive ability of key features and suppress background noise. To further enhance the feature expression ability of small objects, we introduce the high- and low-frequency information decomposed by wavelet transform into PANet to participate in multi-scale feature fusion, so as to solve the problem that the features of small objects gradually disappear after multiple downsampling and pooling operations. Experiments on the challenging dataset Tsinghua-Tencent 100 K show that the mean average precision of the proposed model is 9.5% higher than that of the original YOLOv5 while maintaining the real-time speed, which is better than the mainstream object detection models.

Citing Articles

Machine vision-based autonomous road hazard avoidance system for self-driving vehicles.

Qiu C, Tang H, Yang Y, Wan X, Xu X, Lin S Sci Rep. 2024; 14(1):12178.

PMID: 38806585 PMC: 11133374. DOI: 10.1038/s41598-024-62629-4.


MBAN: multi-branch attention network for small object detection.

Li L, Gao S, Wu F, An X PeerJ Comput Sci. 2024; 10:e1965.

PMID: 38660186 PMC: 11041927. DOI: 10.7717/peerj-cs.1965.


ETSR-YOLO: An improved multi-scale traffic sign detection algorithm based on YOLOv5.

Liu H, Zhou K, Zhang Y, Zhang Y PLoS One. 2023; 18(12):e0295807.

PMID: 38096147 PMC: 10721062. DOI: 10.1371/journal.pone.0295807.


STC-YOLO: Small Object Detection Network for Traffic Signs in Complex Environments.

Lai H, Chen L, Liu W, Yan Z, Ye S Sensors (Basel). 2023; 23(11).

PMID: 37300034 PMC: 10255978. DOI: 10.3390/s23115307.


TRD-YOLO: A Real-Time, High-Performance Small Traffic Sign Detection Algorithm.

Chu J, Zhang C, Yan M, Zhang H, Ge T Sensors (Basel). 2023; 23(8).

PMID: 37112213 PMC: 10145582. DOI: 10.3390/s23083871.

References
1.
Ren S, He K, Girshick R, Sun J . Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Trans Pattern Anal Mach Intell. 2016; 39(6):1137-1149. DOI: 10.1109/TPAMI.2016.2577031. View

2.
Lin T, Goyal P, Girshick R, He K, Dollar P . Focal Loss for Dense Object Detection. IEEE Trans Pattern Anal Mach Intell. 2018; 42(2):318-327. DOI: 10.1109/TPAMI.2018.2858826. View

3.
Santos C, Aguiar M, Welfer D, Belloni B . A New Approach for Detecting Fundus Lesions Using Image Processing and Deep Neural Network Architecture Based on YOLO Model. Sensors (Basel). 2022; 22(17). PMC: 9460625. DOI: 10.3390/s22176441. View

4.
Guo Z, Wang C, Yang G, Huang Z, Li G . MSFT-YOLO: Improved YOLOv5 Based on Transformer for Detecting Defects of Steel Surface. Sensors (Basel). 2022; 22(9). PMC: 9100565. DOI: 10.3390/s22093467. View

5.
Hu J, Shen L, Albanie S, Sun G, Wu E . Squeeze-and-Excitation Networks. IEEE Trans Pattern Anal Mach Intell. 2019; 42(8):2011-2023. DOI: 10.1109/TPAMI.2019.2913372. View