» Articles » PMID: 34883845

Inter-Level Feature Balanced Fusion Network for Street Scene Segmentation

Overview
Journal Sensors (Basel)
Publisher MDPI
Specialty Biotechnology
Date 2021 Dec 10
PMID 34883845
Authors
Affiliations
Soon will be listed here.
Abstract

Semantic segmentation, as a pixel-level recognition task, has been widely used in a variety of practical scenes. Most of the existing methods try to improve the performance of the network by fusing the information of high and low layers. This kind of simple concatenation or element-wise addition will lead to the problem of unbalanced fusion and low utilization of inter-level features. To solve this problem, we propose the Inter-Level Feature Balanced Fusion Network (IFBFNet) to guide the inter-level feature fusion towards a more balanced and effective direction. Our overall network architecture is based on the encoder-decoder architecture. In the encoder, we use a relatively deep convolution network to extract rich semantic information. In the decoder, skip-connections are added to connect and fuse low-level spatial features to restore a clearer boundary expression gradually. We add an inter-level feature balanced fusion module to each skip connection. Additionally, to better capture the boundary information, we added a shallower spatial information stream to supplement more spatial information details. Experiments have proved the effectiveness of our module. Our IFBFNet achieved a competitive performance on the Cityscapes dataset with only finely annotated data used for training and has been greatly improved on the baseline network.

References
1.
R Palafox P, Betz J, Nobis F, Riedl K, Lienkamp M . SemanticDepth: Fusing Semantic Segmentation and Monocular Depth Estimation for Enabling Autonomous Driving in Roads without Lane Lines. Sensors (Basel). 2019; 19(14). PMC: 6679503. DOI: 10.3390/s19143224. View

2.
Wang K, Yan F, Zou B, Tang L, Yuan Q, Lv C . Occlusion-Free Road Segmentation Leveraging Semantics for Autonomous Vehicles. Sensors (Basel). 2019; 19(21). PMC: 6864472. DOI: 10.3390/s19214711. View

3.
Zhang M, Jing W, Lin J, Fang N, Wei W, Wozniak M . NAS-HRIS: Automatic Design and Architecture Search of Neural Network for Semantic Segmentation in Remote Sensing Images. Sensors (Basel). 2020; 20(18). PMC: 7570751. DOI: 10.3390/s20185292. View

4.
Badrinarayanan V, Kendall A, Cipolla R . SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation. IEEE Trans Pattern Anal Mach Intell. 2017; 39(12):2481-2495. DOI: 10.1109/TPAMI.2016.2644615. View

5.
Chen L, Papandreou G, Kokkinos I, Murphy K, Yuille A . DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs. IEEE Trans Pattern Anal Mach Intell. 2017; 40(4):834-848. DOI: 10.1109/TPAMI.2017.2699184. View