A Comparative Analysis of Image Segmentation Using Classical and Deep Learning Approach

Arsen Plaksyvyi; Maria Skublewska-Paszkowska; Pawel Powroznik

doi:10.12913/22998624/172771

Volume 17, Issue 6, 2023

Stats

CC-BY 4.0

Get citation

A Comparative Analysis of Image Segmentation Using Classical and Deep Learning Approach

Arsen Plaksyvyi ¹

Maria Skublewska-Paszkowska ¹

Pawel Powroznik ¹

More details

Hide details

Faculty of Electrical Engineering and Computer Science, Department of Computer Science, Lublin University of Technology, ul. Nadbystrzycka 38D, 20-618 Lublin, Poland

Corresponding author

Maria Skublewska-Paszkowska

Faculty of Electrical Engineering and Computer Science, Department of Computer Science, Lublin University of Technology, ul. Nadbystrzycka 38D, 20-618 Lublin, Poland

Adv. Sci. Technol. Res. J. 2023; 17(6):127-139

DOI: https://doi.org/10.12913/22998624/172771

Article (PDF, 3.92 MB)

KEYWORDS

pattern recognition

artificial neural networks

image segmentation

TOPICS

Computer Engineering

ABSTRACT

Segmentation is one of the image processing techniques, widely used in computer vision, to extract various types of information represented as objects or areas of interest. The development of neural networks has influenced image processing techniques, including creation of new ways of image segmentation. The aim of this study is to compare classical algorithms and deep learning methods in RGB image segmentation tasks. Two hypotheses were put forward: 1) “The quality of segmentation applying deep learning methods is higher than using classical methods for RGB images”, and 2) “The increase of the RGB image resolution has positive impact on the segmentation quality”. Two traditional segmentation algorithms (Thresholding and K-means) were compared with deep learning approach (U-Net, SegNet and FCN 8) to verify RGB segmentation quality. Two resolutions of images were taken into consideration: 160x240 and 320x480 pixels. Segmentation quality for each algorithm was estimated based on four parameters: Accuracy, Precision, Recall and Sorensen-Dice ratio (Dice score). In the study the Carvana dataset, containing 5,088 high-resolution images of cars, was applied. The initial set was divided into training, validation and test subsets as 60%, 20%, 20%, respectively. As a result, the best Accuracy, Dice score and Recall for images with resolution 160x240 were obtained for U-Net, achieving 99.37%, 98.56%, and 98.93%, respectively. For the same resolution the highest Precision 98.19% was obtained for FCN-8 architecture. For higher resolution, 320x480, the best mean Accuracy, Dice score, and Precision were obtained for FCN-8 network, reaching 99.55%, 99.95% and 98.85%, respectively. The highest results for classical methods were obtained for Threshold algorithm reaching 80.41% Accuracy, 58.49% Dice score, 67.32% Recall and 52.62% Precision. The results confirm both hypotheses.

Submit your paper

FAQ

Instructions for Authors

All issues

Articles in press

Send by email

Integrating hydration heat kinetics with artificial neural networks: A maturity model for prediction of ordinary Portland cement mortar strength development

Bayesian and neural network models for risk assessment of traffic-induced vibrations in residential buildings

Web-post buckling resistance prediction models of stainless-steel cellular beams using machine learning algorithms

Multithreaded evolutionary detection of hyperspheres in high-dimensional point cloud data

Comparison of unsupervised machine learning segmentation algorithms in the analysis of unmanned aerial vehicle – based multispectral crop images

Indexes

Keywords index

Authors index