Weakly-Supervised Image Semantic Segmentation Based on Superpixel Region Merging

Quanchun Jiang

Olamide Timothy Tawose

Songwen Pei

Xiaodong Chen

Linhua Jiang

Jiayao Wang and Dongfang Zhao

Resumen

In this paper, we propose a semantic segmentation method based on superpixel region merging and convolutional neural network (CNN), referred to as regional merging neural network (RMNN). Image annotation has always been an important role in weakly-supervised semantic segmentation. Most methods use manual labeling. In this paper, super-pixels with similar features are combined using the relationship between each pixel after super-pixel segmentation to form a plurality of super-pixel blocks. Rough predictions are generated by the fully convolutional networks (FCN) so that certain super-pixel blocks will be labeled. We perceive and find other positive areas in an iterative way through the marked areas. This reduces the feature extraction vector and reduces the data dimension due to super-pixels. The algorithm not only uses superpixel merging to narrow down the target?s range but also compensates for the lack of weakly-supervised semantic segmentation at the pixel level. In the training of the network, we use the method of region merging to improve the accuracy of contour recognition. Our extensive experiments demonstrated the effectiveness of the proposed method with the PASCAL VOC 2012 dataset. In particular, evaluation results show that the mean intersection over union (mIoU) score of our method reaches as high as 44.6%. Because the cavity convolution is in the pooled downsampling operation, it does not degrade the network?s receptive field, thereby ensuring the accuracy of image semantic segmentation. The findings of this work thus open the door to leveraging the dilated convolution to improve the recognition accuracy of small objects.

Palabras claves

superpixel - CNN - region merging - SLIC - weakly-supervised

Acceso

PÁGINAS

pp. 0 - 0

NÚMERO

Volumen: 3 Parte: 2 (2019)

MATERIAS

INFRAESTRUCTURA

REVISTAS SIMILARES

Big Data and Cognitive Computing
ISPRS International Journal of Geo-Information
Future Internet

DOI

https://doi.org/10.3390/bdcc3020031

Artículos similares

Semantic-Enhanced Graph Convolutional Neural Networks for Multi-Scale Urban Functional-Feature Identification Based on Human Mobility

Acceso

Yuting Chen, Pengjun Zhao, Yi Lin, Yushi Sun, Rui Chen, Ling Yu and Yu Liu

Precise identification of spatial unit functional features in the city is a pre-condition for urban planning and policy-making. However, inferring unknown attributes of urban spatial units from data mining of spatial interaction remains a challenge in ge... ver más