Modified Base Autoencoder and Variational Autoencoder for Denoising Images in CIFAR-10 and MNIST Datasets

Jenan A. Alhijaj; Rana J. AL-Sukeinee; Asaad A. Alhijaj; Noor M. Al-Moosawi; Raidah S. Khudeyer

doi:10.31449/inf.v49i27.9620

Abstract

With the increasing volume of digital images, we must increase the quality of images for accuracy and visible applications, and we need ways to reduce the image noise while keeping important features such as edges, corners, and sharp details. In recent years, deep learning algorithms have become more significant for solving image denoising problems because they can simulate complex image patterns. This paper compares the performance of modified base AutoEncoders (AEs) and Variational Autoencoders (VAEs) models for image denoising in CIFAR-10 for color images and MNIST for grayscale images datasets. Our proposed modification to base AE and VAE architectures consists of changes in the encoder and decoder layers of feature extraction and reconstruction abilities, resulting in improved denoising performance. To simulate real-world image damages, data preparation involved normalization and the injection of Gaussian noise (0.5 for MNIST and 0.5 for CIFAR-10). With batch normalization and UpSampling2D layers with sigmoid outputs, the encoder-decoder architecture guaranteed the accuracy of spatial reconstruction, while VAE combined MSE with KL divergence for latent regularization, and AE optimized MSE reconstruction loss. The two models' performance was evaluated using important essential metrics: the Structural Similarity (SSIM) and the Peak Signal to Noise Ratio (PSNR). In both datasets, the results indicate that the VAE model outperforms the AE model in terms of image quality. The CIFAR-10 color dataset was given an SSIM of 0.954 and a PSNR of 32.86 dB, whereas the MNIST grayscale dataset provided an SSIM of 0.951 and a PSNR of 24.44 dB to the modified VAE model. In addition, the CIRAF-10 dataset achieved an SSIM of 0.891 and a PSNR of 27.72 dB, whereas the MNIST dataset was given an SSIM of 0.883 and a PSNR of 29.06 dB from the AE model. This study addresses how AE and VAE architectures differ in denoising performance across dataset complexities and the principles for optimal model selection.

Author Biography

Jenan A. Alhijaj, University of Basrah

computer sciencemaster

References

Alhijaj, J. A., & Khudeyer, R. S. (2023). Integration of efficientnetb0 and machine learning for fingerprint classification. Informatica, 47(5).

R. J. AL-Sukeinee and R. S. Khudeyer, "Review: Deep Learning and Fuzzy Logic Applications," Engineering and Technology Journal, vol. 9, no. 2456-3358, pp. 4231-4240, 2024.

R. Malhotra1, R. Malhotra and P. Singh, "Recent advances in deep learning models: a systematic," Mathematics, vol. 11, p. 1777, 2023.

K. Berahmand, F. Daneshfar2., S. S. Salehi, Y. Li1 and Y. Xu1, "Autoencoders and their applications in machine learning:" Artificial Intelligence Review, vol. 57, p. 28, 2024.

M. J. Kusner, B. Paige and J. M. Hernández-Lobato, "Grammar Variational Autoencoder," In International conference on machine learning, pp. 1945-1954, 2017

R. WEI, C. GARCIA, A. EL-SAYED, V. PETERSON and A. MAHMOOD, "Variations in Variational Autoencoders - A," comparative evaluation. Ieee Access, vol. 8, pp. 153651-153670, 2020.

B. Biswas and S. K. Ghosh, "DVAE: Deep Variational Auto-Encoders," Hybrid machine intelligence for medical image analysis, pp. 257-273, 2020.

M. Sreeteish, A. Mohammed, S. S. Reddy and C. N. Sujatha, "Image De-Noising Using Convolutional Variational," International Journal for Research in Applied Science & Engineering Technology (IJRASET), vol. 10, 2022.

A. Raj1, A. A. Jadhav, C. G. Bhimrao and D. J. Anil, "Image Denoising Using Python and Machine," International Journal for Research in Applied Science & Engineering Technology (IJRASET), vol. 11, 2023.

S. Singh and D. R. Gupta, "Comparative Analysis of Deep Learning-Based," International Journal for Research in Applied Science & Engineering Technology (IJRASET), vol. 12, 2024.

B. Biswas, S. K. Ghosh and A. Ghosh, "DVAE: deep variational auto-encoders for denoising retinal fundus image," Hybrid machine intelligence for medical image analysisز, pp. 257-273, 2020.

Pawar, Aashay." Noise reduction in images using autoencoders.," 3rd International Conference on Intelligent Sustainable Systems (ICISS)., pp. 987-990, 2020.

M. Prakash, A. Krull and F. Jug, "Fully Unsupervised Diversity," arXiv preprint arXiv:2006.06072, 2020.

P. Venkataraman, "Image Denoising Using Convolutional Autoencoder," arXiv:2207.11771, 2022.

H. V. Guleria, A. M. Luqmani, H. D. Kothari, P. Phukan, S. Patil, P. Pareek, K. Kotecha, A. Abraham and. L. A. Gabralla, "Enhancing the Breast Histopathology Image Analysis for Cancer Detection Using Variational Autoencoder," International Journal of Environmental Research and Public Health, vol. 20, no. 5, p. 4244, 2023.

M. B. DARICI and Z. ERDEM, "A Comparative Study on Denoising from Facial Images Using Convolutional," Gazi University Journal of Science, vol. 36, no. 3, pp. 1122-1138, 2023.

S. R. Tusher, N. N. Rahman, S. Chowdhury, A. Tabassum, A. Adnan, . R. Rahman and S. R. Al Masud, "An Enhanced Variational AutoEncoder Approach for the Purpose of Deblurring Bangla License Plate," International Journal of Advanced Computer Science and Applications, vol. 14, no. 6, pp. 10-14569, 2023.

O. J. Bartlett, D. M. Benoit, K. A. Pimbblet, B. Simmons and L. Hunt, "Noise reduction on single-shot images using an autoencoder," Monthly Notices of the Royal Astronomical Society, vol. 52, no. 4, pp. 6318-6329, 2023.

Y. FAROOQ and S. SAVAŞ, "Noise Removal from the Image Using Convolutional Neural Networks-Based Denoising Auto Encoder " journal of Emerging Computer Technologies., vol. 3, no. 1, pp. 21-28, 2024.

K. Kea, W.-D. Chang, H. C. Park and Y. Han, "Enhancing a Convolutional Autoencoder with a Quantum Approximate Optimization Algorithm for Image Noise Reduction," arXiv preprint arXiv:2401.06367, 2024.

R. Malhotra and P. Singh, "Recent advances in deep learning models: a systematic literature review,”. Multimedia Tools and Applications., vol. 82, no. 29, pp. 44977-45060, 2023.

M. Sewak, S. K. Sahay and. H. Rathore, "An Overview of Deep Learning Architecture of Deep Neural," Journal of Computational and Theoretical Nanoscience, vol. 17, no. 1, pp. 182188, 2020.

A. Solera-Rico, C. S. Vila, M. Gómez-López, Y. Wang, A. Almashjary, S. T. M. Dawson and R. Vinuesa, "β-Variational autoencoders and transformers for reduced-order modelling of fluid flows," Nature Communications, vol. 15, no. 1, p. 1361, 2024.

M. S. Rana, M. H. Kabir and A. Sobur, "Comparison of the Error Rates of MNIST Datasets Using Different Type of Machine Learning Model," North American Academic Research, vol. 6, no. 5, pp. 173-181, 2022.

M. Kundroo and T. Kim, "Demystifying Impact of Key Hyper-Parameters in Federated Learning:

A Case Study on CIFAR-10 and FashionMNIST," IEEE Access, 2024.

K. A. Alaghbari, H.-S. Lim, M. H. M. Saad and Y. S. Yong, "Deep Autoencoder-Based Integrated Model for Anomaly," IoT, vol. 4, no. 3, pp. 345-365, 2023.

C. J. Harvey, S. Shomaji, Z. Yao, M. I. and A. Noheria, "Comparison of Autoencoder Encodings for ECG Representation in Downstream Prediction Tasks," arXiv preprint arXiv:2410.02937, 2024.

M. B. DARICI and Z. ERDEM, "A Comparative Study on Denoising from Facial Images Using Convolutional Autoencoder," Gazi University Journal of Science, vol. 36, no. 3, pp. 11221138, 2023.

Modified Base Autoencoder and Variational Autoencoder for Denoising Images in CIFAR-10 and MNIST Datasets

Abstract

Author Biography

References

Authors

DOI:

Downloads

Published

Issue

Section

License

How to Cite

Developed By

Information