International Journal of Fuzzy Logic and Intelligent Systems 2021; 21(4): 401-408
Published online December 25, 2021
https://doi.org/10.5391/IJFIS.2021.21.4.401
© The Korean Institute of Intelligent Systems
Wang-Su Jeon1 and Sang-Yong Rhee2
1Department of IT Convergence Engineering, Kyungnam University, Changwon, Korea
2Department of Computer Engineering, Kyungnam University, Changwon, Korea
Correspondence to :
Sang-Yong Rhee (syrhee@kyungnam.ac.kr)
*These authors contributed equally to this work.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/) which permits unrestricted noncommercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Image segmentation is the process of simplifying the analysis of the meaning or the front to say the process of dividing the image into a set of multiple pixels. The multiple path feature aggregation (MPFA) method proposed in this paper aims to extract various information of an object, and uses conventional pyramid pooling or the extraction of various sized features. This information can be combined with different regional features to obtain the overall feature information. We split four paths to extract numerous local features, and the results showed that the mean intersection over union (mIOU) is 81.6% for the validation data from the PASCAL VOC 2012 dataset, and a better performance than the existing DeepLab model was demonstrated.
Keywords: MPFA, Semantic segmentation, Feature aggregation, CNN, Inverted residual block, Local context
No potential conflicts of interest relevant to this article are reported.
E-mail : jws2218@naver.com
E-mail : syrhee@kyungnam.ac.kr
International Journal of Fuzzy Logic and Intelligent Systems 2021; 21(4): 401-408
Published online December 25, 2021 https://doi.org/10.5391/IJFIS.2021.21.4.401
Copyright © The Korean Institute of Intelligent Systems.
Wang-Su Jeon1 and Sang-Yong Rhee2
1Department of IT Convergence Engineering, Kyungnam University, Changwon, Korea
2Department of Computer Engineering, Kyungnam University, Changwon, Korea
Correspondence to:Sang-Yong Rhee (syrhee@kyungnam.ac.kr)
*These authors contributed equally to this work.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/) which permits unrestricted noncommercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Image segmentation is the process of simplifying the analysis of the meaning or the front to say the process of dividing the image into a set of multiple pixels. The multiple path feature aggregation (MPFA) method proposed in this paper aims to extract various information of an object, and uses conventional pyramid pooling or the extraction of various sized features. This information can be combined with different regional features to obtain the overall feature information. We split four paths to extract numerous local features, and the results showed that the mean intersection over union (mIOU) is 81.6% for the validation data from the PASCAL VOC 2012 dataset, and a better performance than the existing DeepLab model was demonstrated.
Keywords: MPFA, Semantic segmentation, Feature aggregation, CNN, Inverted residual block, Local context
Type of segmentation: (a) input, (b) instance segmentation, (c) semantic segmentation, and (d) panoptic segmentation. The images are from [
Multiple path feature aggregation network architecture.
Atrous spatial pyramid pooling structure.
Multi-scale residual feature structure.
Linear bottleneck structure: (a) residual block and (b) inverted residual block.
Pyramid pooling module structure.
Visual measurement of PASCAL VOC 2012 data. Results of the proposed method are compared with the baseline.
Visual measurement of PASCAL VOC 2012 data. The results of the proposed method are compared with the baseline.
Table 1 . Mean IOU and fps measurement according to MPFA size.
Method | Mean IOU(%) | fps |
---|---|---|
ResNet50-MPFA512 | 12 | |
ResNet50-MPFA256 | 78.3 | 24 |
ResNet50-MPFA128 | 75.5 | 32 |
ResNet50-MPFA64 | 72.3 |
Table 2 . Mean IOU measurement according to batch size and iterations.
Method | Batch size | Iteration | Mean IOU(%) |
---|---|---|---|
ResNet50-MPFA512 | 4 | 10k | |
ResNet50-MPFA512 | 4 | 20k | 77.7 |
ResNet50-MPFA512 | 10 | 20k | 78.3 |
ResNet50-MPFA512 | 10 | 20k |
Jeongmin Kim and Hyukdoo Choi
International Journal of Fuzzy Logic and Intelligent Systems 2024; 24(2): 105-113 https://doi.org/10.5391/IJFIS.2024.24.2.105Xinzhi Hu, Wang-Su Jeon, Grezgorz Cielniak, and Sang-Yong Rhee
International Journal of Fuzzy Logic and Intelligent Systems 2024; 24(1): 1-9 https://doi.org/10.5391/IJFIS.2024.24.1.1Herlawati Herlawati, Edi Abdurachman, Yaya Heryadi, and Haryono Soeparno
International Journal of Fuzzy Logic and Intelligent Systems 2023; 23(4): 389-398 https://doi.org/10.5391/IJFIS.2023.23.4.389Type of segmentation: (a) input, (b) instance segmentation, (c) semantic segmentation, and (d) panoptic segmentation. The images are from [
Multiple path feature aggregation network architecture.
|@|~(^,^)~|@|Atrous spatial pyramid pooling structure.
|@|~(^,^)~|@|Multi-scale residual feature structure.
|@|~(^,^)~|@|Linear bottleneck structure: (a) residual block and (b) inverted residual block.
|@|~(^,^)~|@|Pyramid pooling module structure.
|@|~(^,^)~|@|Visual measurement of PASCAL VOC 2012 data. Results of the proposed method are compared with the baseline.
|@|~(^,^)~|@|Visual measurement of PASCAL VOC 2012 data. The results of the proposed method are compared with the baseline.