» Articles » PMID: 33668156

Image Segmentation Using Encoder-Decoder with Deformable Convolutions

Overview
Journal Sensors (Basel)
Publisher MDPI
Specialty Biotechnology
Date 2021 Mar 6
PMID 33668156
Citations 3
Authors
Affiliations
Soon will be listed here.
Abstract

Image segmentation is an essential step in image analysis that brings meaning to the pixels in the image. Nevertheless, it is also a difficult task due to the lack of a general suited approach to this problem and the use of real-life pictures that can suffer from noise or object obstruction. This paper proposes an architecture for semantic segmentation using a convolutional neural network based on the Xception model, which was previously used for classification. Different experiments were made in order to find the best performances of the model (eg. different resolution and depth of the network and data augmentation techniques were applied). Additionally, the network was improved by adding a deformable convolution module. The proposed architecture obtained a 76.8 mean IoU on the Pascal VOC 2012 dataset and 58.1 on the Cityscapes dataset. It outperforms SegNet and U-Net networks, both networks having considerably more parameters and also a higher inference time.

Citing Articles

Brain CT image classification based on mask RCNN and attention mechanism.

Yin S, Li H, Teng L, Laghari A, Almadhor A, Gregus M Sci Rep. 2025; 14(1):29300.

PMID: 39905102 PMC: 11794603. DOI: 10.1038/s41598-024-78566-1.


A Medical Image Segmentation Method Based on Improved UNet 3+ Network.

Xu Y, Hou S, Wang X, Li D, Lu L Diagnostics (Basel). 2023; 13(3).

PMID: 36766681 PMC: 9914627. DOI: 10.3390/diagnostics13030576.


FN-OCT: Disease Detection Algorithm for Retinal Optical Coherence Tomography Based on a Fusion Network.

Ai Z, Huang X, Feng J, Wang H, Tao Y, Zeng F Front Neuroinform. 2022; 16:876927.

PMID: 35784186 PMC: 9243322. DOI: 10.3389/fninf.2022.876927.


An Autonomous Robot-Aided Auditing Scheme for Floor Cleaning.

Pathmakumar T, Kalimuthu M, Elara M, Ramalingam B Sensors (Basel). 2021; 21(13).

PMID: 34202746 PMC: 8271831. DOI: 10.3390/s21134332.

References
1.
Moeskops P, Viergever M, Mendrik A, de Vries L, Benders M, Isgum I . Automatic Segmentation of MR Brain Images With a Convolutional Neural Network. IEEE Trans Med Imaging. 2016; 35(5):1252-1261. DOI: 10.1109/TMI.2016.2548501. View

2.
Qian N . On the momentum term in gradient descent learning algorithms. Neural Netw. 2003; 12(1):145-151. DOI: 10.1016/s0893-6080(98)00116-6. View

3.
Artacho B, Savakis A . Waterfall Atrous Spatial Pooling Architecture for Efficient Semantic Segmentation. Sensors (Basel). 2019; 19(24). PMC: 6960670. DOI: 10.3390/s19245361. View

4.
Badrinarayanan V, Kendall A, Cipolla R . SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation. IEEE Trans Pattern Anal Mach Intell. 2017; 39(12):2481-2495. DOI: 10.1109/TPAMI.2016.2644615. View

5.
Chen L, Papandreou G, Kokkinos I, Murphy K, Yuille A . DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs. IEEE Trans Pattern Anal Mach Intell. 2017; 40(4):834-848. DOI: 10.1109/TPAMI.2017.2699184. View