A Multi-Scale Deformable Convolutional Neural Network with Adaptive Adjustment for Robust Packaging Image Recognition
Abstract
This paper presents an innovative image recognition model for package inspection, designed to fulfill the demands of real-time and precise categorization in difficult industrial environments. Traditional techniques reliant on human feature extraction frequently underperform when confronted with lighting variability, background interference, and deformation of package items. To mitigate these limitations, the proposed model integrates a multi-scale convolutional architecture that captures both local and global characteristics through the use of parallel convolutional filters of varying sizes. An adaptive adjustment method is incorporated into the network to dynamically alter the placement of convolutional operations according to image content, hence improving flexibility and feature representation. A thorough data augmentation strategy incorporating geometric transformation, brightness modification, and semantic-level blending is implemented to boost the model's robustness and generalization capacity. Experiments performed on a bespoke industrial packaging dataset comprising 10,000 labeled images reveal that the proposed model attains a classification accuracy of 96.8 percent, a recall of 95.3 percent, and an F1-score of 93.8 percent, with an inference time of 11.2 milliseconds and a parameter count of 21.3 million. In comparison to current deep learning architectures like Residual Networks, the model demonstrates considerable enhancements in accuracy and speed. These results support its appropriateness for practical packaging inspection systems.References
V. Pagire, M. Chavali, A. Kale, A comprehensive review of object detection with traditional and deep learning methods, Signal Processing 237 (2025) 110075. https://doi.org/10.1016/j.sigpro.2025.110075.
S. Chen, D. Liu, Y. Pu, Y. Zhong, Advances in deep learning-based image recognition of product packaging, Image Vis Comput 128 (2022) 104571. https://doi.org/10.1016/j.imavis.2022.104571.
S. John, A. Danti, Lightweight Model for Occlusion Removal from Face Images, Annals of Emerging Technologies in Computing 8 (2024) 1–14. https://doi.org/10.33166/AETiC.2024.02.001.
S. Aisyah, F.S. Nainggolan, M. Simanjuntak, E.A. Lubis, Food Packaging Search Application From Text Image In Android With Deep Convolutional Neural Network (DCNN) Method, J Phys Conf Ser 1230 (2019) 012078. https://doi.org/10.1088/1742-6596/1230/1/012078.
L.D. Medus, M. Saban, J. V. Francés-Víllora, M. Bataller-Mompeán, A. Rosado-Muñoz, Hyperspectral image classification using CNN: Application to industrial food packaging, Food Control 125 (2021) 107962. https://doi.org/10.1016/j.foodcont.2021.107962.
H. Fırat, M.E. Asker, M.İ. Bayindir, D. Hanbay, Spatial-spectral classification of hyperspectral remote sensing images using 3D CNN based LeNet-5 architecture, Infrared Phys Technol 127 (2022) 104470. https://doi.org/10.1016/j.infrared.2022.104470.
O. Jarkas, J. Hall, S. Smith, R. Mahmud, P. Khojasteh, J. Scarsbrook, R.K.L. Ko, ResNet and Yolov5-enabled non-invasive meat identification for high-accuracy box label verification, Eng Appl Artif Intell 125 (2023) 106679. https://doi.org/10.1016/j.engappai.2023.106679.
S. Zheng, T. Zhou, Z. Li, Image Segmentation with Multi-Scale Feature Fusion of Local Binary Patterns for Flexible Integrated Circuit Packaging Substrates, in: 2023 China Automation Congress (CAC), IEEE, 2023: pp. 5704–5708. https://doi.org/10.1109/CAC59555.2023.10451417.
X. Yang, M. Han, H. Tang, Q. Li, X. Luo, Detecting Defects With Support Vector Machine in Logistics Packaging Boxes for Edge Computing, IEEE Access 8 (2020) 64002–64010. https://doi.org/10.1109/ACCESS.2020.2984539.
H. Zoubir, M. Rguig, M. El Aroussi, A. Chehri, R. Saadane, Concrete Bridge Crack Image Classification Using Histograms of Oriented Gradients, Uniform Local Binary Patterns, and Kernel Principal Component Analysis, Electronics (Basel) 11 (2022) 3357. https://doi.org/10.3390/electronics11203357.
M.A. Chandra, S.S. Bedi, Survey on SVM and their application in image classification, International Journal of Information Technology 13 (2021) 1–11. https://doi.org/10.1007/s41870-017-0080-1.
A. Rahman, L. He, H. Wang, Activation function optimization scheme for image classification, Knowl Based Syst 305 (2024) 112502. https://doi.org/10.1016/j.knosys.2024.112502.
R. Schäfer, T. Nicke, H. Höfener, A. Lange, D. Merhof, F. Feuerhake, V. Schulz, J. Lotz, F. Kiessling, Overcoming data scarcity in biomedical imaging with a foundational multi-task model, Nat Comput Sci 4 (2024) 495–509. https://doi.org/10.1038/s43588-024-00662-z.
A.R. Munappy, J. Bosch, H.H. Olsson, A. Arpteg, B. Brinne, Data management for production quality deep learning models: Challenges and solutions, Journal of Systems and Software 191 (2022) 111359. https://doi.org/10.1016/j.jss.2022.111359.
C. Zhang, M. Han, J. Jia, C. Kim, Packaging Design Image Segmentation Based on Improved Full Convolutional Networks, Applied Sciences 14 (2024) 10742. https://doi.org/10.3390/app142210742.
K. He, X. Zhang, S. Ren, J. Sun, Deep Residual Learning for Image Recognition, in: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016: pp. 770–778. https://doi.org/10.1109/CVPR.2016.90.
DOI:
https://doi.org/10.31449/inf.v49i9.9547Downloads
Published
How to Cite
Issue
Section
License
I assign to Informatica, An International Journal of Computing and Informatics ("Journal") the copyright in the manuscript identified above and any additional material (figures, tables, illustrations, software or other information intended for publication) submitted as part of or as a supplement to the manuscript ("Paper") in all forms and media throughout the world, in all languages, for the full term of copyright, effective when and if the article is accepted for publication. This transfer includes the right to reproduce and/or to distribute the Paper to other journals or digital libraries in electronic and online forms and systems.
I understand that I retain the rights to use the pre-prints, off-prints, accepted manuscript and published journal Paper for personal use, scholarly purposes and internal institutional use.
In certain cases, I can ask for retaining the publishing rights of the Paper. The Journal can permit or deny the request for publishing rights, to which I fully agree.
I declare that the submitted Paper is original, has been written by the stated authors and has not been published elsewhere nor is currently being considered for publication by any other journal and will not be submitted for such review while under review by this Journal. The Paper contains no material that violates proprietary rights of any other person or entity. I have obtained written permission from copyright owners for any excerpts from copyrighted works that are included and have credited the sources in my article. I have informed the co-author(s) of the terms of this publishing agreement.
Copyright © Slovenian Society Informatika







