Optimizing Deep Learning Model Ensembles for Plant Disease Detection through Ablation and Correlation Analysis

Nawel Ghrieb; Tahar Guerram; Othaila Chergui

doi:10.31449/inf.v50i8.10277

Abstract

Early detection of plant diseases is crucial for global food security. While Deep Learning ensemble techniques are widely adopted to improve performance, the assumption that simply aggregating models is always beneficial should be nuanced. This paper addresses this issue by conducting a rigorous analysis of an ensemble of four state-of-the-art architectures (Swin Transformer, Vision Transformer, EfficientNetV2, ConvNeXt) on the PlantDoc dataset, a benchmark known for its complexity.Our approach is twofold. First, we conduct a systematic ablation study to assess how each model contributes to the ensemble's performance. This analysis leads to the counter-intuitive finding that an optimized three-model subset (Swin, ViT, and EfficientNetV2) outperforms the full four-model ensemble. Quantitatively, the pruned ensemble achieves a Macro F1-score of 0.7503 and an accuracy of 0.7619, compared to 0.7409 and 0.7500 for the full set, respectively. Second, to explain this phenomenon, we perform a prediction correlation analysis. This reveals significant predictive redundancy, stemming from the architectural similarities between ConvNeXt and the Transformer-based models, as the cause for this sub-optimality. These findings suggest a key principle for ensemble design: the predictive complementarity of the models is a more critical factor than their individual performance or the complexity of the aggregation method-a finding reinforced by our benchmark showing that even advanced strategies like calibrated voting and stacking failed to outperform the pruned ensemble. Our work thus positions the methodological pairing of ablation study and correlation analysis as an essential and pragmatic approach to optimize the performance of ensembles in computer vision.

Author Biographies

Nawel Ghrieb, LAMIS Laboratory, Echahid Cheikh Larbi Tebessi University, Tebessa, Algeria.

Nawel Ghrieb is Associate Professor at the Department of Computer Science, Echahid Cheikh Larbi Tebessi University , Algeria. She is a researcher at the LAMIS Laboratory.

Tahar Guerram, RelaCS2Laboratory, Larbi Ben M'hidi University, Oum El Bouaghi, Algeria

Tahar Guerram is an Associate Professor at the Department of Computer Science, University of Oum El Bouaghi, Algeria. He is a researcher at the ReLaCS2 Laboratory.

Othaila Chergui, Laboratory of Signals and Smart Systems, Echahid Cheikh Larbi Tebessi University, Tebessa, Algeria.

Othaila Chergui is an Associate Professor at the Department of Computer Science, Echahid Cheikh Larbi Tebessi University, Algeria. She is a researcher at the Laboratory of Signals and Smart Systems.

References

Padol, P. B. and Yadav, A. A. (2016). SVM classifier based grape leaf disease detection. Conference on Advances in Signal Processing (CASP), Pune, India, 2016, pp. 175-179.

https://doi.org/10.1109/CASP.2016.7746160

Vaishnnave, M. P., Devi, K. S., Srinivasan, P. and Jothi, G. A. P. (2019). Detection and Classification of Groundnut Leaf Diseases using KNN classifier, IEEE International Conference on System, Computation, Automation and Networking (ICSCAN), Pondicherry, India, 2019, pp. 1-5.

https://doi.org/10.1109/ICSCAN.2019.8878733

Mohanty, S. P., Hughes, D. P. and Salathe, M. (2016). Using deep learning for image-based plant disease detection. Front. Plant Sci. https://doi.org/10.3389/fpls.2016.01419

Too, E. C., Yujian, L., Njuki, S., and Yingchun, L. (2019). A comparative study of fine-tuning deep learning models for plant disease identification. Comput. Electron. Agric. 161, pp. 272–279.

https://doi.org/10.1016/j.compag.2018.03.032

Hasan, R.I.; Yusuf, S.M.; Alzubaidi, L. (2020). Review of the state of the art of deep learning for plant diseases: A broad analysis and discussion. Plants, 9, 1302. https://doi.org/10.3390/plants9101302.

Shoaib, M., Shah, B., Ei-Sappagh, S., Ali, A., Ullah, A., Alenezi, F., Gechev, T., Hussain, T., Ali, F. (2023). An advanced deep learning models-based plant disease detection. A review of recent research. Front. Plant Sci., 14, 1158933.

https://doi.org/10.3389/fpls.2023.1158933

Akbar, M., Ullah, M., Shah, B., Khan, R.U., Hussain, T., Ali, F., Alenezi, F., Syed, I., Kwak, K.S. (2022). An effective deep learning approach for the classification of Bacteriosis in peach leave. Front. Plant Sci., 13, 1064854. https://doi.org/10.3389/fpls.2022.1064854

Zhao, Y., Sun, C., Xu, X., and Chen, J. (2022). RIC-net: A plant disease classification model based on the fusion of inception and residual structure and embedded attention mechanism, Computers and Electronics in Agriculture 193: 106644. https://doi.org/10.1016/j.compag.2021.106644

Chandra, M., Redkar, S., Roy, S., Patil, P. (2020). Classification of Various Plant Diseases Using Deep Siamese Network.

https://www.researchgate.net/publication/341322315

Iftikhar, M., Kandhro, I. A., Kausar, N., Kehar A., Uddin, M. and Dandoush A. (2024). Plant disease management: A fine-tuned enhanced CNN approach with mobile app integration for early detection and classification, Artif. Intell. Rev., vol. 57, no. 7, pp. 1-29.

https://doi.org/10.1007/s10462-024-10809-z

Ganaie, M., Hu, M., Malik, A., Tanveer, M., and Suganthan, P. (2022). Ensemble deep learning: A review. Eng. Appl. Artif. Intelligence, 115, 105151.

https://doi.org/10.1016/j.engappai.2022.105151

Li, H., Jin, Y., Zhong, J., and Zhao, R. (2021). A fruit tree disease diagnosis model based on stacking ensemble learning. Complexity, 2021, 6868592.

https://doi.org/10.1155/2021/6868592

Mathew, A., Antony, A., Mahadeshwar, Y., Khan, T., and Kulkarni, A. (2022). Plant disease detection using GLCM feature extractor and voting classification approach. Mat. Today, Proc. 58, Part 1, pp.407–415. https://doi.org/10.1016/j.matpr.2022.02.350

Palanisamy, S., and Sanjana, N. (2023). Corn leaf disease detection using genetic algorithm and weighted voting, Proceedings of the 2nd International Conference on Advancements in Electrical, Electronics, Communication, Computing and Automation (ICAECA), Coimbatore, India. (ICAECA). pp. 1-6,

https://doi.org/10.1109/ICAECA56562.2023.10200196

Chaudhary, A., Thakur, R., Kolhe, S., and Kamal, R. (2020). A particle swarm optimization-based ensemble for vegetable crop disease recognition. Comput. Electron. Agricult., 178, 105747.

https://doi.org/10.1016/j.compag.2020.105747

Sagi, O., & Rokach, L. (2018). Ensemble learning: A survey. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 8(4), e1249.

https://doi.org/10.1002/widm.1249.

Giacinto, G., and Roli, F. (2001). Dynamic classifier selection based on multiple classifier behaviour. Pattern Recognition, 34(9), pp.1879-1881. https://doi.org/10.1016/S0031-3203(00)00150-3

Tsoumakas, G., Katakis, I. (2007). Multi-label classification: An overview. International Journal of Data Warehousing and Mining (IJDWM), 3(3), pp. 1-13. https://doi.org/10.4018/jdwm.2007070101

Caruana, R., Niculescu-Mizil, A., Crew, G., Ksikes, A. (2004). Ensemble selection from libraries of models. Proceedings of the 21st International Conference on Machine Learning (ICML '04).

https://doi.org/10.1145/1015330.1015432

Astani, M., Hasheminejad, M. and Vaghefi, M. (2022). A diverse ensemble classifier for tomato disease recognition, Comput. Electron. Agricult., vol. 198, Art. no. 107054. https://doi.org/10.1016/j.compag.2022.107054

Shafik, W., Tufail, A. De Silva Liyanage, C. and R. Apong, A. A. H. M. (2024). Using transfer learning-based plant disease classification and detection for sustainable agriculture, BMC Plant Biol., vol. 24, no. 1, p. 136.

https://doi.org/10.1186/s12870-024-04825-y.

Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., Gelly, S., Uszkoreit, J., Houlsby, N. (2021). An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. Proceedings of the International Conference on Learning Representations (ICLR).

https://doi.org/10.48550/arXiv.2010.11929

Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., Lin, S., Guo, B. (2021). Swin Transformer: Hierarchical Vision Transformer Using Shifted Windows. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). https://doi.org/10.1109/ICCV48922.2021.00986

Liu, Z., Mao, H., Wu, C. Y., Feichtenhofer, C., Darrell, T., Xie, S. (2022). A ConvNet for the 2020s. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

https://doi.org/10.48550/arXiv.2201.03545

Chen, J., Zhang, D. Zeb, A. and Nanehkaran, Y. A. (2021). Identification of Rice plant diseases using lightweight attention networks, Expert Syst. Appl., vol. 169, Art. no. 114514. https://doi.org/10.1016/j.eswa.2020.114514

Ouamane, A., Chouchane, A., Himeur, Y., Miniaoui, S., Atalla, S., Mansoor, W. (2025). Optimized Vision Transformers for Superior Plant Disease Detection, IEEE Access, vol. 13, pp. 48552-48570.

https://doi.org/10.1109/ACCESS.2025.3547416

Singh Thakur, P., Khanna, P., Sheorey, T., and Ojha, A. (2022). Explainable vision transformer enabled convolutional neural network for plant disease identification: Plantxvit, arXiv:2207.07919. https://doi.org/10.48550/arXiv.2207.07919.

Tabbakh, A. and Barpanda, S. S. (2023). A deep features extraction model based on the transfer learning model and vision transformer ‘TLMViT’ for plant, IEEE Access, vol. 11, pp. 45377-45392.

https://doi.org/10.1109/ACCESS.2023.3273317

Singh, D., Padgett, E., & T. S. G., A. K. (2020). PlantDoc: A Dataset for Visual Plant Disease Detection. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). https://doi.org/10.48550/arXiv.1911.10317

Lin, T. Y., Goyal, P., Girshick, R., He, K., & Dollár, P. (2017). Focal loss for dense object detection. In Proceedings of the IEEE International Conference on Computer Vision, pp. 2980-2988.

https://doi.org/10.48550/arXiv.1708.02002

Kasneci, G. and Kasneci, E. (2024). Enriching Tabular Data with Contextual LLM Embeddings: A Comprehensive Ablation Study for Ensemble Classifiers, arXiv preprint arXiv:2411.01645. https://doi.org/10.48550/arXiv.2411.01645

Grandini M., Bagli E., Visani G. (2020). Metrics for multi-class classification: an overview. arXiv preprint arXiv:2008.05756.

https://doi.org/10.48550/arXiv.2008.05756

Ramaprasad, R., Raman, S. (2022). SEMFD-Net: A Stacked Ensemble for Multiple Foliar Disease Classification. Proceedings of the 5th Joint International Conference on Data Science & Management of Data (9th ACM IKDD CODS and 27th COMAD) pp. 241-245. https://doi.org/10.1145/3493700.3493719

Menon, V.; Ashwin, V.; Deepa, R.K. (2021). Plant disease detection using CNN and transfer learning. In Proceedings of the International Conference on Communication, Control and Information Sciences (ICCISc), Idukki, India, pp. 16–18; IEEE: Piscataway, NJ, USA, 2021; Volume 1, pp. 1–6.

https://doi.org/10.1109/ICCISc52257.2021.9484957

Puangsuwan T.; Surinta, O. (2021). Enhancement of plant leaf disease classification based on snapshot ensemble convolutional neural network. ICIC Exp Lett., 15(6), pp. 669–680.

https://doi.org/10.24507/icicel.15.06.669

Moupojou, E., Tagne, A., Retraint, F., Tadonkemwa, A., Dongmo, W., Tapamo, H., Nkenlifack, M. (2023). FieldPlant: A dataset of field plant images for plant disease detection and classification with deep learning. IEEE Access, 11, pp. 35398–35410.

https://doi.org/10.1109/ACCESS.2023.3263042

Optimizing Deep Learning Model Ensembles for Plant Disease Detection through Ablation and Correlation Analysis

Abstract

Author Biographies

Nawel Ghrieb, LAMIS Laboratory, Echahid Cheikh Larbi Tebessi University, Tebessa, Algeria.

Tahar Guerram, RelaCS2Laboratory, Larbi Ben M'hidi University, Oum El Bouaghi, Algeria

Othaila Chergui, Laboratory of Signals and Smart Systems, Echahid Cheikh Larbi Tebessi University, Tebessa, Algeria.

References

Authors

DOI:

Downloads

Published

How to Cite

Issue

Section

License

Developed By

Information