Research Article

LEARNING DENSE CONTEXTUAL FEATURES FOR SEMANTIC SEGMENTATION

Volume: 62 Number: 1 June 30, 2020
EN

LEARNING DENSE CONTEXTUAL FEATURES FOR SEMANTIC SEGMENTATION

Abstract

Semantic segmentation, which is one of the key problems in computer vision, has been applied in various application domains such as autonomous driving, robot navigation, or medical imagery, to name a few. Recently, deep learning, especially deep neural networks, have shown significant performance improvement over conventional semantic segmentation methods. In this paper, we present a novel encoder-decoder type deep neural network-based method, namely XSeNet, that can be trained end-to-end in a supervised manner. We adapt ResNet-50 layers as the encoder and design a cascaded decoder that composes of the stack of the X-Modules, which enables the network to learning dense contextual information and having wider field-of-view. We evaluate our method using CamVid dataset, and experimental results reveal that our method can segment most part of the scene accurately and even outperforms previous state-of-the art methods.

Keywords

References

  1. Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, “Gradient-based learning applied to document recognition," Pro-ceedings of the IEEE, vol. 86, no. 11, pp. 2278-2324, 1998.
  2. M. D. Zeiler and R. Fergus, “Visualizing and understanding convolutional networks," in European conference oncomputer vision, pp. 818-833, Springer, 2014.
  3. A. Krizhevsky, I. Sutskever, and G. E. Hinton, “Imagenet classifcation with deep convolutional neural networks," inAdvances in neural information processing systems, pp. 1097-1105, 2012.
  4. K. Simonyan and A. Zisserman, “Very deep convolutional networks for large-scale image recognition," arXiv preprintarXiv:1409.1556, 2014.
  5. J. Long, E. Shelhamer, and T. Darrell, “Fully convolutional networks for semantic segmentation," in Proceedings ofthe IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431-3440, 2015.
  6. A. Karpathy and L. Fei-Fei, “Deep visual-semantic alignments for generating image descriptions," in Proceedings ofthe IEEE Conference on Computer Vision and Pattern Recognition, pp. 3128-3137, 2015.
  7. V. Badrinarayanan, A. Kendall, and R. Cipolla, “Segnet: A deep convolutional encoder-decoder architecture forimage segmentation," arXiv preprint arXiv:1511.00561, 2015.
  8. O. Ronneberger, P. Fischer, and T. Brox, “U-net: Convolutional networks for biomedical image segmentation," inInternational Conference on Medical image computing and computer-assisted intervention, pp. 234-241, Springer,2015.

Details

Primary Language

English

Subjects

Engineering

Journal Section

Research Article

Authors

Long Ang Lim This is me
Türkiye

Publication Date

June 30, 2020

Submission Date

August 27, 2019

Acceptance Date

February 4, 2020

Published in Issue

Year 2020 Volume: 62 Number: 1

APA
Yalim Keles, H., & Lim, L. A. (2020). LEARNING DENSE CONTEXTUAL FEATURES FOR SEMANTIC SEGMENTATION. Communications Faculty of Sciences University of Ankara Series A2-A3 Physical Sciences and Engineering, 62(1), 26-34. https://doi.org/10.33769/aupse.611958
AMA
1.Yalim Keles H, Lim LA. LEARNING DENSE CONTEXTUAL FEATURES FOR SEMANTIC SEGMENTATION. Commun.Fac.Sci.Univ.Ank.Series A2-A3: Phys.Sci. and Eng. 2020;62(1):26-34. doi:10.33769/aupse.611958
Chicago
Yalim Keles, Hacer, and Long Ang Lim. 2020. “LEARNING DENSE CONTEXTUAL FEATURES FOR SEMANTIC SEGMENTATION”. Communications Faculty of Sciences University of Ankara Series A2-A3 Physical Sciences and Engineering 62 (1): 26-34. https://doi.org/10.33769/aupse.611958.
EndNote
Yalim Keles H, Lim LA (June 1, 2020) LEARNING DENSE CONTEXTUAL FEATURES FOR SEMANTIC SEGMENTATION. Communications Faculty of Sciences University of Ankara Series A2-A3 Physical Sciences and Engineering 62 1 26–34.
IEEE
[1]H. Yalim Keles and L. A. Lim, “LEARNING DENSE CONTEXTUAL FEATURES FOR SEMANTIC SEGMENTATION”, Commun.Fac.Sci.Univ.Ank.Series A2-A3: Phys.Sci. and Eng., vol. 62, no. 1, pp. 26–34, June 2020, doi: 10.33769/aupse.611958.
ISNAD
Yalim Keles, Hacer - Lim, Long Ang. “LEARNING DENSE CONTEXTUAL FEATURES FOR SEMANTIC SEGMENTATION”. Communications Faculty of Sciences University of Ankara Series A2-A3 Physical Sciences and Engineering 62/1 (June 1, 2020): 26-34. https://doi.org/10.33769/aupse.611958.
JAMA
1.Yalim Keles H, Lim LA. LEARNING DENSE CONTEXTUAL FEATURES FOR SEMANTIC SEGMENTATION. Commun.Fac.Sci.Univ.Ank.Series A2-A3: Phys.Sci. and Eng. 2020;62:26–34.
MLA
Yalim Keles, Hacer, and Long Ang Lim. “LEARNING DENSE CONTEXTUAL FEATURES FOR SEMANTIC SEGMENTATION”. Communications Faculty of Sciences University of Ankara Series A2-A3 Physical Sciences and Engineering, vol. 62, no. 1, June 2020, pp. 26-34, doi:10.33769/aupse.611958.
Vancouver
1.Hacer Yalim Keles, Long Ang Lim. LEARNING DENSE CONTEXTUAL FEATURES FOR SEMANTIC SEGMENTATION. Commun.Fac.Sci.Univ.Ank.Series A2-A3: Phys.Sci. and Eng. 2020 Jun. 1;62(1):26-34. doi:10.33769/aupse.611958

Communications Faculty of Sciences University of Ankara Series A2-A3 Physical Sciences and Engineering licensed under a Creative Commons Attribution 4.0 International License.

Creative Commons License