Deep learning in tropical leaf disease detection: advantages and applications

Zhiye Yao; Mengxing Huang; Zhiye Yao; Mengxing Huang

doi:10.48130/tp-0024-0018

2024 Volume 3

Article Contents

Next Previous

REVIEW Open Access

Deep learning in tropical leaf disease detection: advantages and applications

Zhiye Yao,
Mengxing Huang^,

1.
School of Information and Communication Engineering, Hainan University, Haikou 570228, China

More Information

Corresponding author: huangmx09@163.com

Received: 06 January 2024
Revised: 07 April 2024
Accepted: 15 April 2024
Published online: 26 June 2024
Tropical Plants 3, Article number: e020 (2024) | Cite this article

Abstract

This paper delves into the realm of artificial intelligence, where an array of deep learning techniques has proven effective in automating crop leaf disease identification and classification. The current paper shows mature detection methodologies for apple, tomato, rice, mango, coconut, and durian leaf diseases with examples while demonstrating research on leaf disease detection in tropical plants. Through this exploration, valuable insights into the benefits and applications of detection techniques based on deep learning methods are provided for leaf disease detection. Highlighting the advantages of deep learning methods are provided for automated feature extraction and disease detection, the paper describes the salient features and challenges of the application of leaf disease detection in the tropics. In this paper, an introductory overview of a leaf disease detection model is offered and delve into the factors influencing detection accuracy and speed while proposing ways to mitigate the inherent trade-offs between these indicators. Furthermore, the challenges, such as multi-scale detection and leaf overlapping, that may occur in plants in the tropics, have been examined, enriching our understanding of deep learning-driven leaf disease detection in tropical agriculture.
- Artificial intelligence,
- Convolutional neural networks,
- Object detection,
- Application in agriculture
Rights and permissions
Copyright: © 2024 by the author(s). Published by Maximum Academic Press on behalf of Hainan University. This article is an open access article distributed under Creative Commons Attribution License (CC BY 4.0), visit https://creativecommons.org/licenses/by/4.0/.

References

[1]	LeCun Y, Bengio Y, Hinton G. 2015. Deep learning. Nature 521:436−44 doi: 10.1038/nature14539 CrossRef Google Scholar
[2]	Kamilaris A, Prenafeta-Boldú FX. 2018. Deep learning in agriculture: A survey. Computers and Electronics in Agriculture 147:70−90 doi: 10.1016/j.compag.2018.02.016 CrossRef Google Scholar
[3]	Autor DH, Dorn D. 2013. The growth of low-skill service jobs and the polarization of the US labor market. American Economic Review 103:1553−97 doi: 10.1257/aer.103.5.1553 CrossRef Google Scholar
[4]	Sze V, Chen YH, Yang TJ, Emer JS. 2017. Efficient Processing of deep neural networks: a tutorial and survey. Proceedings of the IEEE 105:2295−329 doi: 10.1109/JPROC.2017.2761740 CrossRef Google Scholar
[5]	Zhang C, Bengio S, Hardt M, Recht B, Vinyals O. 2021. Understanding deep learning (still) requires rethinking generalization. Communications of the ACM 64:107−15 doi: 10.1145/3446776 CrossRef Google Scholar
[6]	Sharma R, Kumar N, Sharma BB. 2022. Applications of Artificial Intelligence in Smart Agriculture: A Review. In Proc. Recent Innovations in Computing, eds. Singh PK, Singh Y, Kolekar MH, Kar AK, Gonçalves PJS. vol 832. Singapore: Springer. pp. 135-42. https://doi.org/10.1007/978-981-16-8248-3_11
[7]	Shafik W, Tufail A, Namoun A, De Silva LC, Rosyzie Anna Awg Haji Mohd Apong. 2023. A systematic literature review on plant disease detection: Motivations, classification techniques, datasets, challenges, and future trends. IEEE Access 11:59174−203 doi: 10.1109/ACCESS.2023.3284760 CrossRef Google Scholar
[8]	Hossain S, Tanzim Reza M, Chakrabarty A, Jung YJ. 2023. Aggregating different scales of attention on feature variants for tomato leaf disease diagnosis from image data: a transformer driven study. Sensors 23:3751 doi: 10.3390/s23073751 CrossRef Google Scholar
[9]	Attri I, Awasthi LK, Sharma TP, Rathee P. 2023. A review of deep learning techniques used in agriculture. Ecological Informatics 77:102217 doi: 10.1016/j.ecoinf.2023.102217 CrossRef Google Scholar
[10]	Hannun AY, Rajpurkar P, Haghpanahi M, Tison GH, Bourn C, et al. 2019. Cardiologist-level arrhythmia detection and classification in ambulatory electrocardiograms using a deep neural network. Nature Medicine 25:65−69 doi: 10.1038/s41591-018-0268-3 CrossRef Google Scholar
[11]	Sladojevic S, Arsenovic M, Anderla A, Culibrk D, Stefanovic D. 2016. Deep neural networks based recognition of plant diseases by leaf image classification. Computational Intelligence and Neuroscience 2016:3289801 doi: 10.1155/2016/3289801 CrossRef Google Scholar
[12]	Ghosal S, Blystone D, Singh AK, Ganapathysubramanian B, Singh A, et al. 2018. An explainable deep machine vision framework for plant stress phenotyping. Proceedings of the National Academy of Sciences of the United States of America 115:4613−18 doi: 10.1073/pnas.1716999115 CrossRef Google Scholar
[13]	Sharma R. 2021. Artificial Intelligence in Agriculture: A Review. Proc. 2021 5th International Conference on Intelligent Computing and Control Systems (ICICCS), Madurai, India, 6−8 May 2021. USA: IEEE. pp. 937−42. https://doi.org/10.1109/ICICCS51141.2021.9432187
[14]	Toniutti L, Breitler JC, Etienne H, Campa C, Doulbeau S, et al. 2017. Influence of environmental conditions and genetic background of Arabica coffee (C. arabica L.) on leaf rust (Hemileia vastatrix) Pathogenesis. Frontiers in Plant Science 8:2025 doi: 10.3389/fpls.2017.02025 CrossRef Google Scholar
[15]	Andersen KF, Madden LV, Paul PA. 2015. Fusarium head blight development and deoxynivalenol accumulation in wheat as influenced by post-anthesis moisture patterns. Phytopathology 105:210−19 doi: 10.1094/PHYTO-04-14-0104-R CrossRef Google Scholar
[16]	Singh BK, Delgado-Baquerizo M, Egidi E, Guirado E, Leach JE, et al. 2023. Climate change impacts on plant pathogens, food security and paths forward. Nature Reviews Microbiology 21:640−56 doi: 10.1038/s41579-023-00900-7 CrossRef Google Scholar
[17]	Liu L, Ouyang W, Wang X, Fieguth P, Chen J, et al. 2020. Deep Learning for Generic Object Detection: A Survey. International Journal of Computer Vision 128:261−318 doi: 10.1007/s11263-019-01247-4 CrossRef Google Scholar
[18]	Karim S, Zhang Y, Yin S, Bibi I, Brohi AA. 2020. A brief review and challenges of object detection in optical remote sensing imagery. Multiagent and Grid Systems 16:227−43 doi: 10.3233/MGS-200330 CrossRef Google Scholar
[19]	Arulprakash E, Aruldoss M. 2022. A study on generic object detection with emphasis on future research directions. Journal of King Saud University - Computer and Information Sciences 34:7347−65 doi: 10.1016/j.jksuci.2021.08.001 CrossRef Google Scholar
[20]	Huang J, Rathod V, Sun C, Zhu M, Korattikara A, et al. 2017. Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21−26 July 2017. USA: IEEE. pp. 3296−97. https://doi.org/10.1109/CVPR.2017.351
[21]	Tan M, Le QV. 2019. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. In Proceedings of the 36^th International Conference on Machine Learning, Long Beach, California, USA, 2019. Vol. 97. Proceedings of Machine Learning Research (PMLR). pp. 6105−14. http://proceedings.mlr.press/v97/tan19a.html
[22]	He T, Yu S, Wang Z, Li J, Chen Z. 2019. From data quality to model quality: an exploratory study on deep learning. Proceedings of the 11^th Asia-Pacific Symposium on Internetware, Fukuoka Japan, October 28−29, 2019. New York, United States: Association for Computing Machinery. https://doi.org/10.1145/3361242.3361260
[23]	Bailly A, Blanc C, Francis É, Guillotin T, Jamal F, et al. 2022. Effects of dataset size and interactions on the prediction performance of logistic regression and deep learning models. Computer Methods and Programs in Biomedicine 213:106504 doi: 10.1016/j.cmpb.2021.106504 CrossRef Google Scholar
[24]	Recht B, Roelofs R, Schmidt L, Shankar V. 2019. Do ImageNet Classifiers Generalize to ImageNet? In Proceedings of the 36^th International Conference on Machine Learning, Long Beach, California, USA, 2019. Vol. 97. Proceedings of Machine Learning Research (PMLR). pp. 5389−400. http://proceedings.mlr.press/v97/recht19a.html
[25]	Priestley M, O’donnell F, Simperl E. 2023. A survey of data quality requirements that matter in ML development pipelines. Journal of Data and Information Quality 15:11 doi: 10.1145/3592616 CrossRef Google Scholar
[26]	Garcia Arnal Barbedo J, Vieira Koenigkan L, Almeida Halfeld-Vieira B, Veras Costa R, Lima Nechet K, et al. 2018. Annotated Plant Pathology Databases for Image-Based Detection and Recognition of Diseases. IEEE Latin America Transactions 16:1749−57 doi: 10.1109/TLA.2018.8444395 CrossRef Google Scholar
[27]	Kaustubh B. 2019. Tomato leaf disease detection. www.kaggle.com/datasets/kaustubhb999/tomatoleaf
[28]	Chen L, Yuan Y. 2019. Agricultural Disease Image Dataset for Disease Identification Based on Machine Learning. Proc. Big Scientific Data Management. BigSDM 2018. Lecture Notes in Computer Science, eds. Li J, Meng X, Zhang Y, Cui W, Du Z. Cham: Springer. pp. 263−74. https://doi.org/10.1007/978-3-030-28061-1_26
[29]	Francisco AKG. 2019. Rice-Disease-DataSet. https://github.com/aldrin233/RiceDiseases-DataSet
[30]	WoAiFeiJiang. 2023. Pathological images of apple leaves. https://aistudio.baidu.com/datasetdetail/11591/0
[31]	Thapa R, Zhang K, Snavely N, Belongie S, Khan A. 2021. Plant Pathology 2021 - FGVC8. https://kaggle.com/competitions/plant-pathology-2021-fgvc8
[32]	Arun Pandian J, Geetharamani G, Huang ML, Chang YH. 2022. Tomato Disease Multiple Sources. www.kaggle.com/datasets/cookiefinder/tomato-disease-multiple-sources/
[33]	Arun Pandian J, Geetharamani G. 2019. Data for: Identification of Plant Leaf Diseases Using a 9-layer Deep Convolutional Neural Network. https://data.mendeley.com/datasets/tywbtsjrjv/1
[34]	Zhu R, Zou H, Li Z, Ni R. 2023. Apple-Net: a model based on improved YOLOv5 to detect the apple leaf diseases. Plants 12:169 doi: 10.3390/plants12010169 CrossRef Google Scholar
[35]	Wang Y, Wang Y, Zhao J. 2022. MGA-YOLO: A lightweight one-stage network for apple leaf disease detection. Frontiers in Plant Science 13:927424 doi: 10.3389/fpls.2022.927424 CrossRef Google Scholar
[36]	Li H, Shi L, Fang S, Yin F. 2023. Real-time detection of apple leaf diseases in natural scenes based on YOLOv5. Agriculture 13:878 doi: 10.3390/agriculture13040878 CrossRef Google Scholar
[37]	Xu W, Wang R. 2023. ALAD-YOLO: an lightweight and accurate detector for apple leaf diseases. Frontiers in Plant Science 14:4569 doi: 10.3389/fpls.2023.1204569 CrossRef Google Scholar
[38]	Liu S, Qiao Y, Li J, Zhang H, Zhang M, et al. 2022. An Improved Lightweight Network for Real-Time Detection of Apple Leaf Diseases in Natural Scenes. Agronomy 12:2636 doi: 10.3390/agronomy12102363 CrossRef Google Scholar
[39]	Tian L, Zhang H, Liu B, Zhang J, Duan N, et al. 2023. VMF-SSD: A novel V-space based multi-scale feature fusion SSD for apple leaf disease detection. IEEE-ACM Transactions on Computational Biology and Bioinformatics 20:2016−28 doi: 10.1109/TCBB.2022.3229114 CrossRef Google Scholar
[40]	Zhu X, Li J, Jia R, Liu B, Yao Z, et al. 2023. LAD-Net: A Novel Light Weight Model for Early Apple Leaf Pests and Diseases Classification. Ieee-Acm Transactions on Computational Biology and Bioinformatics 20:1156−69 doi: 10.1109/TCBB.2022.3191854 CrossRef Google Scholar
[41]	Shafik W, Tufail A, Liyanage CDS, Apong RAAHM. 2023. Using a novel convolutional neural network for plant pests detection and disease classification. Journal of the Science of Food and Agriculture 103:5849−61 doi: 10.1002/jsfa.12700 CrossRef Google Scholar
[42]	Gao A, Ren H, Song Y, Ren L, Zhang Y, et al. 2023. Construction and verification of machine vision algorithm model based on apple leaf disease images. Frontiers in Plant Science 14:1246065 doi: 10.3389/fpls.2023.1246065 CrossRef Google Scholar
[43]	Khan AI, Quadri SMK, Banday S, Shah JL. 2022. Deep diagnosis: A real-time apple leaf disease detection system based on deep learning. Computers and Electronics in Agriculture 198:107093 doi: 10.1016/j.compag.2022.107093 CrossRef Google Scholar
[44]	Gong X, Zhang S. 2023. A high-precision detection method of apple leaf diseases using improved faster R-CNN. Agriculture 13:240 doi: 10.3390/agriculture13020240 CrossRef Google Scholar
[45]	Jing J, Li S, Qiao C, Li K, Zhu X, et al. 2023. A tomato disease identification method based on leaf image automatic labeling algorithm and improved YOLOv5 model. Journal of the Science of Food and Agriculture 103:7070−82 doi: 10.1002/jsfa.12793 CrossRef Google Scholar
[46]	Tang Z, He X, Zhou G, Chen A, Wang Y, et al. 2023. A Precise Image-Based Tomato Leaf Disease Detection Approach Using PLPNet. Plant Phenomics 5:0042 doi: 10.34133/plantphenomics.0042 CrossRef Google Scholar
[47]	Badiger M, Mathew JA. 2023. Tomato plant leaf disease segmentation and multiclass disease detection using hybrid optimization enabled deep learning. Journal of Biotechnology 374:101−13 doi: 10.1016/j.jbiotec.2023.07.011 CrossRef Google Scholar
[48]	Zhong Y, Teng Z, Tong M. 2023. LightMixer: A novel lightweight convolutional neural network for tomato disease detection. Frontiers in Plant Science 14:1166296 doi: 10.3389/fpls.2023.1166296 CrossRef Google Scholar
[49]	Liu Y, Song Y, Ye R, Zhu S, Huang Y, et al. 2023. High-Precision Tomato Disease Detection Using NanoSegmenter Based on Transformer and Lightweighting. Plants 12:2559 doi: 10.3390/plants12132559 CrossRef Google Scholar
[50]	Elfatimi E, Eryiğit R, Elfatimi L. 2024. Deep multi-scale convolutional neural networks for automated classification of multi-class leaf diseases in tomatoes. Neural Computing and Applications 36:803−22 doi: 10.1007/s00521-023-09062-2 CrossRef Google Scholar
[51]	Mondal D, Roy K, Pal D, Kole DK. 2022. Deep learning-based approach to detect and classify signs of crop leaf diseases and pest damage. SN Computer Science 3:433 doi: 10.1007/s42979-022-01332-5 CrossRef Google Scholar
[52]	Saeed A, Abdel-Aziz AA, Mossad A, Abdelhamid MA, Alkhaled AY, Mayhoub M. 2023. Smart Detection of Tomato Leaf Diseases Using Transfer Learning-Based Convolutional Neural Networks. Agriculture-Basel 13:14 Google Scholar
[53]	Roy K, Chaudhuri SS, Frnda J, Bandopadhyay S, Ray IJ, et al. 2023. Detection of Tomato Leaf Diseases for Agro-Based Industries Using Novel PCA DeepNet. Ieee Access 11:14983−5001 doi: 10.1109/ACCESS.2023.3244499 CrossRef Google Scholar
[54]	Zhang D, Huang Y, Wu C, Ma M. 2023. Detecting tomato disease types and degrees using multi-branch and destruction learning. Computers and Electronics in Agriculture 213:108244 doi: 10.1016/j.compag.2023.108244 CrossRef Google Scholar
[55]	Pan J, Wang T, Wu Q. 2023. RiceNet: A two stage machine learning method for rice disease identification. Biosystems Engineering 225:25−40 doi: 10.1016/j.biosystemseng.2022.11.007 CrossRef Google Scholar
[56]	Daniya T, Vigneshwari S. 2023. Rider Water Wave-enabled deep learning for disease detection in rice plant. Advances in Engineering Software 182:103472 doi: 10.1016/j.advengsoft.2023.103472 CrossRef Google Scholar
[57]	Chen L, Zou J, Yuan Y, He H. 2023. Improved domain adaptive rice disease image recognition based on a novel attention mechanism. Computers and Electronics in Agriculture 208:107806 doi: 10.1016/j.compag.2023.107806 CrossRef Google Scholar
[58]	Peng J, Wang Y, Jiang P, Zhang RF, Chen HL. 2023. RiceDRA-Net: Precise Identification of Rice Leaf Diseases with Complex Backgrounds Using a Res-Attention Mechanism. Applied Sciences-Basel 13:4928 doi: 10.3390/app13084928 CrossRef Google Scholar
[59]	Yang L, Yu X, Zhang S, Long H, Zhang H, et al. 2023. GoogLeNet based on residual network and attention mechanism identification of rice leaf diseases. Computers and Electronics in Agriculture 204:107543 doi: 10.1016/j.compag.2022.107543 CrossRef Google Scholar
[60]	Wang Y, Wang H, Peng Z. 2021. Rice diseases detection and classification using attention based neural network and bayesian optimization. Expert Systems with Applications 178:114770 doi: 10.1016/j.eswa.2021.114770 CrossRef Google Scholar
[61]	Yang Y, Jiao G, Liu J, Zhao W, Zheng J. 2023. A lightweight rice disease identification network based on attention mechanism and dynamic convolution. Ecological Informatics 78:102320 doi: 10.1016/j.ecoinf.2023.102320 CrossRef Google Scholar
[62]	Patil RR, Kumar S, Chiwhane S, Rani R, Pippal SK. 2023. An Artificial-Intelligence-Based Novel Rice Grade Model for Severity Estimation of Rice Diseases. Agriculture 13:47 doi: 10.3390/agriculture13010047 CrossRef Google Scholar
[63]	Stephen A, Punitha A, Chandrasekar A. 2023. Designing self attention-based ResNet architecture for rice leaf disease classification. Neural Computing & Applications 35:6737−51 doi: 10.1007/s00521-022-07793-2 CrossRef Google Scholar
[64]	Simhadri CG, Kondaveeti HK. 2023. Automatic Recognition of Rice Leaf Diseases Using Transfer Learning. Agronomy 13:961 doi: 10.3390/agronomy13040961 CrossRef Google Scholar
[65]	Thite S, Suryawanshi Y, Patil K, Chumchu P. 2023. Coconut (Cocos nucifera) tree disease dataset: A dataset for disease detection and classification for machine learning applications. Data in Brief 51:109690 doi: 10.1016/j.dib.2023.109690 CrossRef Google Scholar
[66]	Maray M, Albraikan AA, Alotaibi SS, Alabdan R, Al Duhayyim M, et al. 2022. Artificial intelligence-enabled coconut tree disease detection and classification model for smart agriculture. Computers and Electrical Engineering 104:108399 doi: 10.1016/j.compeleceng.2022.108399 CrossRef Google Scholar
[67]	Mazzia V, Salvetti F, Chiaberge M. 2021. Efficient-CapsNet: capsule network with self-attention routing. Scientific Reports 11:14634 doi: 10.1038/s41598-021-93977-0 CrossRef Google Scholar
[68]	Subbaian S, Balasubramanian A, Marimuthu M, Chandrasekaran S, Muthusaravanan G. 2024. Detection of coconut leaf diseases using enhanced deep learning techniques. Journal of Intelligent & Fuzzy Systems 46:5033−45 doi: 10.3233/JIFS-233831 CrossRef Google Scholar
[69]	Gallenero JA, Villaverde J. 2023. Identification of Durian Leaf Disease Using Convolutional Neural Network. Proc. 2023 15^th International Conference on Computer and Automation Engineering (ICCAE), Sydney, Australia, 3-05 March, 2023. pp. 172−77. https://doi.org/10.1109/ICCAE56788.2023.10111159
[70]	Sanath Rao U, Swathi R, Sanjana V, Arpitha L, Chandrasekhar K, et al. 2021. Deep Learning Precision Farming: Grapes and Mango Leaf Disease Detection by Transfer Learning. Global Transitions Proceedings 2:535−44 doi: 10.1016/j.gltp.2021.08.002 CrossRef Google Scholar
[71]	Piriyasupakij J, Prasitphan R. 2023. Development of autonomous drones to detect diseases on plant leaves of durian trees. Proc. 2023 27^th International Computer Science and Engineering Conference (ICSEC), Samui Island, Thailand, 14−15 September 2023. USA: IEEE. pp. 258−65. https://doi.org/10.1109/ICSEC59635.2023.10329671
[72]	Li C, Adhikari R, Yao Y, Miller AG, Kalbaugh K, et al. 2020. Measuring plant growth characteristics using smartphone based image analysis technique in controlled environment agriculture. Computers and Electronics in Agriculture 168:8 doi: 10.1016/j.compag.2019.105123 CrossRef Google Scholar
[73]	Du L, Yang H, Song X, Wei N, Yu C, et al. 2022. Estimating leaf area index of maize using UAV-based digital imagery and machine learning methods. Scientific Reports 12:15937 doi: 10.1038/s41598-022-20299-0 CrossRef Google Scholar
[74]	Martinez-Guanter J, Ribeiro Á, Peteinatos GG, Pérez-Ruiz M, Gerhards R, et al. 2019. Low-Cost Three-Dimensional Modeling of Crop Plants. Sensors 19:2883 doi: 10.3390/s19132883 CrossRef Google Scholar
[75]	Maken P, Gupta A. 2023. 2D-to-3D: A Review for Computational 3D Image Reconstruction from X-ray Images. Archives of Computational Methods in Engineering 30:85−114 doi: 10.1007/s11831-022-09790-z CrossRef Google Scholar
[76]	Marchand É, Bouthemy P, Chaumette F. 2001. A 2D–3D model-based approach to real-time visual tracking. Image and Vision Computing 19:941−55 doi: 10.1016/S0262-8856(01)00054-3 CrossRef Google Scholar
[77]	Konrad J, Wang M, Ishwar P. 2012. 2D-to-3D image conversion by learning depth from examples. Proc. 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, Providence, RI, USA, 16−21 June 2012. USA: IEEE. pp. 16−22. https://doi.org/10.1109/CVPRW.2012.6238903
[78]	Gao Y, Wang M, Tao D, Ji R, Dai Q. 2012. 3-D Object Retrieval and Recognition With Hypergraph Analysis. IEEE Transactions on Image Processing 21:4290−303 doi: 10.1109/TIP.2012.2199502 CrossRef Google Scholar
[79]	Lin TY, Dollár P, Girshick R, He K, Hariharan B, Belongie S. 2017. Feature Pyramid Networks for Object Detection. Proc. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21−26 July 2017. USA: IEEE. pp. 936−44. https://doi.org/10.1109/CVPR.2017.106
[80]	Li Y, Sun S, Zhang C, Yang G, Ye Q. 2022. One-Stage Disease Detection Method for Maize Leaf Based on Multi-Scale Feature Fusion. Applied Sciences 12:7960 doi: 10.3390/app12167960 CrossRef Google Scholar
[81]	Chen J, Deng X, Wen Y, Chen W, Zeb A, et al. 2023. Weakly-supervised learning method for the recognition of potato leaf diseases. Artificial Intelligence Review 56:7985−8002 doi: 10.1007/s10462-022-10374-3 CrossRef Google Scholar
[82]	Woo S, Park J, Lee JY, Kweon IS. 2018. CBAM: Convolutional Block Attention Module. In Computer Vision – ECCV 2018. ECCV 2018. Lecture Notes in Computer Science, eds. Ferrari V, Hebert M, Sminchisescu C, Weiss Y. Cham: Springer International Publishing. pp. 3−19. https://doi.org/10.1007/978-3-030-01234-2_1
[83]	Park J, Woo S, Lee JY, Kweon IS. 2020. A simple and light-weight attention module for convolutional neural networks. International Journal of Computer Vision 128:783−98 doi: 10.1007/s11263-019-01283-0 CrossRef Google Scholar
[84]	Law H, Deng J. 2020. CornerNet: detecting objects as paired keypoints. International Journal of Computer Vision 128:642−56 doi: 10.1007/s11263-019-01204-1 CrossRef Google Scholar
[85]	Justus D, Brennan J, Bonner S, McGough AS. 2018. Predicting the Computational Cost of Deep Learning Models. Proc. 2018 IEEE International Conference on Big Data (Big Data), Seattle, WA, USA, 10−13 December 2018. pp. 3873−82. https://doi.org/10.1109/BigData.2018.8622396
[86]	Sharma V, Tripathi AK, Mittal H. 2023. DLMC-Net: Deeper lightweight multi-class classification model for plant leaf disease detection. Ecological Informatics 75:2025 doi: 10.1016/j.ecoinf.2023.102025 CrossRef Google Scholar
[87]	Li B, Jiang W, Gu J, Liu K. 2020. A Summary of convolution Neural Network Compression and Acceleration Technology. Proc. 2020 International Conference on Intelligent Computing and Human-Computer Interaction (ICHCI), Sanya, China, 4−6 December 2020. USA: IEEE. pp. 269−75. https://doi.org/10.1109/ICHCI51889.2020.00065
[88]	He Y, Zhang X, Sun J. 2017. Channel pruning for accelerating very deep neural networks. Proc. 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22−29 October 2017. USA: IEEE. pp. 1398−406. https://doi.org/10.1109/ICCV.2017.155
[89]	Hinton G, Vinyals O, Dean J. 2015. Distilling the knowledge in a neural network. ArXiv In Press doi: 10.48550/arXiv.1503.02531 CrossRef Google Scholar
[90]	Keller B, Venkatesan R, Dai S, Tell SG, Zimmer B, et al. 2022. A 17–95.6 TOPS/W deep learning inference accelerator with per-vector scaled 4-bit quantization for transformers in 5nm. Proc. 2022 IEEE Symposium on VLSI Technology and Circuits (VLSI Technology and Circuits), Honolulu, HI, USA, 12-17 June 2022. USA: IEEE. pp. 16−17. https://doi.org/10.1109/VLSITechnologyandCir46769.2022.9830277
[91]	Wilson RC, Shenhav A, Straccia M, Cohen JD. 2019. The Eighty Five Percent Rule for optimal learning. Nature Communications 10:4646 doi: 10.1038/s41467-019-12552-4 CrossRef Google Scholar
[92]	Hashim N, Ali MM, mahadi MR, Abdullah AF, Wayayok A, et al. 2023. Smart farming for sustainable rice production: an insight into applications, challenges and future prospects. Rice Science 31:47−61 doi: 10.1016/j.rsci.2023.08.004 CrossRef Google Scholar
[93]	Mahlein AK. 2015. Plant disease detection by imaging sensors – parallels and specific demands for precision agriculture and plant phenotyping. Plant Disease 100:241−51 doi: 10.1094/PDIS-03-15-0340-FE CrossRef Google Scholar
[94]	Zahid A, Abbas HT, Imran MA, Qaraqe KA, Alomainy A, et al. 2019. Characterization and Water Content Estimation Method of Living Plant Leaves Using Terahertz Waves. Applied Sciences 9:2781 doi: 10.3390/app9142781 CrossRef Google Scholar
[95]	Zahid A, Abbas HT, Ren A, Zoha A, Heidari H, et al. 2019. Machine learning driven non-invasive approach of water content estimation in living plant leaves using terahertz waves. Plant Methods 15:138 doi: 10.1186/s13007-019-0522-9 CrossRef Google Scholar
[96]	Zahid A, Dashtipour K, Abbas HT, Mabrouk IB, Al-Hasan M, et al. 2022. Machine learning enabled identification and real-time prediction of living plants’ stress using terahertz waves. Defence Technology 18:1330−39 doi: 10.1016/j.dt.2022.01.003 CrossRef Google Scholar
[97]	Zhao WX, Zhou K, Li J, Tang T, Wang X, et al. 2023. A survey of large language model. ArXiv In Press doi: 10.48550/arXiv.2303.18223 CrossRef Google Scholar

About this article

Cite this article

Yao Z, Huang M. 2024. Deep learning in tropical leaf disease detection: advantages and applications. Tropical Plants 3: e020 doi: 10.48130/tp-0024-0018

Yao Z, Huang M. 2024. Deep learning in tropical leaf disease detection: advantages and applications. Tropical Plants 3: e020 doi: 10.48130/tp-0024-0018

Figures(1) / Tables(3)

Download PDF

Article Metrics

Article views(2730) PDF downloads(686)

Other Articles By Authors

on this site
- Zhiye Yao
- Mengxing Huang
on Google Scholar
- Zhiye Yao
- Mengxing Huang

HTML

Introduction

Deep learning, a subfield of machine learning, is distinguished by its computational model's capacity to acquire knowledge from abstract data using structures consisting of multiple processing units^[1]. Such models use automatic optimisation of model parameters, e.g. stochastic gradient descent, batch gradient descent, Adam optimiser, to systematically optimise the basic parameters of the data computation within the model architecture to achieve the goal of optimising or accelerating the optimization of the model parameters. The automatic optimisation of model parameters eliminates the need for manual design of parameters within the model and reduces the amount of manual work involved in model development. In addition, the trained models can be used to achieve objectives such as object detection, localization, image classification, and predictive analytics based on complex abstract data. One noteworthy advantage of contemporary machine learning methodologies is their inherent capability for automated feature extraction and learning from abstract data, thereby obviating the requirement for manual intervention in guiding the model through the processes of feature extraction and learning, as demonstrated in conventional feature engineering. In contrast, traditional machine-learning approaches necessitate the construction of relevant features by humans, contingent upon the dataset, a process commonly referred to as feature engineering. Performing the task of feature engineering is inherently complex and time-consuming, necessitating iterative human adjustments based on changes in the dataset or design requirements^[2]. Traditional feature engineering may be more advantageous when dealing with small datasets is required. But in today's environment of increasing labour costs^[3] and decreasing computer arithmetic costs^[4], as well as in practice where large neural networks based on deep learning can be better generalized this is more important^[5]. Deep learning methods may be more advantageous when dealing with large datasets or when a high degree of automation is required. As a result, the developmental costs associated with models of this kind, which is mainly labour costs, significantly increase, while their generalizability is concurrently limited.

In contrast, the incorporation and enhancement of network models which involve proficient feature extraction techniques combined with deep learning, are exemplified by Convolutional Neural Networks (CNN), You Only Look Once series (YOLO), Single Shot Multibox Detector (SSD), Residual Network (RstNet), Densely Connected Convolutional Network (DenseNet), GoogleNet, MobileNet, and Xception, have significantly enhanced the automatic feature-learning capabilities of models. These advancements enable deep-learning models to autonomously extract features of varying levels of complexity from raw data, displaying significant potential for improving the reliability of models based on leaf disease image analysis. As a result, models developed within the deep learning framework align with the processing of intricate abstract data, such as images, and it also demonstrates advantages in terms of labour costs, expenditure on productivity costs, and increased model versatility^[6].

Early detection of diseases minimizes the overuse of pesticides in disease prevention^[7]. The utilization of deep learning techniques for monitoring crop leaf diseases enable the analysis of various disease types by inputting images of diseased leaves into a model previously trained for these leaf diseases, eliminating the necessity for specialized personnel and allowing for an automated identification process, that enables rapid, timely control^[8]. When these trained models are deployed on small mobile terminals, it helps non-professional agriculturists to detect problems in time, even in completely unmanaged farmland, so that preventive measures can be implemented^[9]. Due to the good generalisation ability of the model, identification aided by the use of a trained model can provide a more reliable identification reference for experts^[10]. For example, in changing environments where some leaf diseases have no obvious symptoms or are difficult to detect, which requires plant pathologists to be well observed^[11], deep learning models can extract features that are difficult to observe with the human eye^[12]. Thereby, the application of such technologies contributes to the timely detection and prevention of plant leaf diseases, optimization of crop yields, and advancement of precision agriculture. Consequently, this enhances productivity, reduces labour costs, and strengthens sustainability^[13].

Tropical areas hold a significant position as major producers of staple food crops such as rice, corn, and millet, as well as various fruit crops including banana, mango, coconut, and durian. The region's consistently high temperatures lead to shorter pathogen incubation periods, exemplified by the reduced diurnal temperature which accelerates the latency period of pathogens like Hemileia vastatrix, causing rust epidemics in Central America^[14] Furthermore, the typically high humidity in tropical regions foster both crop growth and the survival of harmful bacteria^[15]. Consequently, tropical regions experience more frequent occurrences of plant diseases, posing a threat to food security^[16]. Thus, compared to non-tropical regions, crops in tropical areas have shorter growth cycles and are more susceptible to rapid disease outbreaks, underscoring the necessity for timely and effective plant disease prevention and control measures^[16]. Because deep learning-based leaf disease detection technology can be mounted on mobile devices and achieve real-time monitoring, this technology is more suitable for tropical areas.

The current study focuses on the application of detection technologies for plant leaf diseases and pests, exploring contemporary technological methodologies and attributes underpinning current crop leaf disease and pest detection techniques. This paper describes the technical characteristics and advantages of parallel detection using deep learning models. The development direction and challenges of deploying deep learning-based leaf disease detection technology in tropical regions have also been discussed.

The detection models and datasets in leaf disease detection

Characteristics and advantages of deep learning models in plant leaf disease detection

A review was conducted on a corpus of scholarly papers abput leaf disease detection published in the year 2023. The focus of the review includes the characteristics and advantages of techniques for detecting leaf disease using deep-learning models. The selected papers collectively encapsulated a broad spectrum of contemporary deep-learning models employed for the detection of leaf diseases. This article provided detailed insights into the characteristics of these models. Additionally, the review meticulously cataloged the salient features utilized of the model, thereby affording a understanding of the state-of-the-art methodologies employed in the domain of leaf disease detection. The models used in this study include models based on mature algorithm technology models such as YOLO, SSD, and CNN. The investigator has undertaken pertinent modifications to these infrastructures, tailoring them to optimize their efficacy for the distinct application conditions posed by the identification of pathological manifestations in foliage, including methods of identifying leaf disease using first and second-stage models.

These models exemplify superior accuracy, lightweight architecture, and adept deployment on mobile devices, rendering them well-suited for the detection of leaf diseases across diverse scenarios. Notably, enhancements in recognition accuracy are achieved through the replacement of backbone networks or the introduction of innovative modules. These improvements are usually to enhance the ability to extract features from images or the ability to fuse features after extraction, and to reduce interference as elucidated in Table 2 (No. 1, 2, 3, 5, 6, 9, 11, 12, 13, 16, 17, 18, 20, 26, 27, 30). Additionally, attention mechanisms play a pivotal role in refining recognition accuracy by focusing on key information on different channels or convolution kernels at different scales, as evidenced by the methodologies delineated in Table 2 (No. 1, 2, 3, 5, 6, 7, 9, 12, 13, 16, 17, 18, 22, 25, 26, 27, 28, 29, 31). Furthermore, after comparing the four state-of-the-art(SOTA) Vision Transformer models by Hossain et al.^[8], it is concluded that MaxViT has better recognition accuracy, which proved that using global attention is more suitable to improve the recognition accuracy of the leaf disease identification.

Techniques for refining input image quality, exemplified by the implementation of Generative Adversarial Networks (GAN)^[34,53], further contribute to the augmentation of accuracy. Finally, introducing other methods to improve the classifier can also increase the accuracy of recognition^[41,56] and optimize the training method of the model^[47].

Moreover, as part of the overarching goal to enhance recognition accuracy, the adoption of pre-trained models through transfer learning on newly curated datasets emerges as a commendable paradigm, as advocated by Saeed et al.^[52], Simhadri & Kondaveeti^[64] and Sudhesh et al.^[63]. The imperative for reduction in model size is achieved through the introduction of novel modules or the replacement of the feature extraction network, as evidenced by instances in Table 2 (No. 2, 4, 7, 9, 15, 16).

Furthermore, certain authors have made notable contributions to practical field applications, exemplified by endeavors in real-time processing, the amelioration of environmental challenges and assessment of disease severity. The two-stage methodology includes classification before detection and detection before classification, demarcating the detection process into initial leaf segmentation and subsequent leaf disease classification, as undertaken by Khan et al.^[43] and Badiger & Mathew ^[47], exhibits a proclivity towards effectively discerning multiple diseases on leaves. Pen et al.^[55] and Daniya & Vigneshwari^[56] also used a two-stage approach to solve the problem of disease identification from photos with complex background obtained in actual fields, which can overcome the interference of complex background environments to recognition to some extent. At the same time, because a new segmented disease data set is generated in the process, the problem of small samples with fewer original images is also solved.

Leaf diseases of tropical plants usually have the following characteristics: a variety of leaf diseases, rapid outbreak of leaf diseases, high frequency, and difficult to prevent. In addition, unlike in other regions, tropical plants tend to be relatively tall, with thicker foliage and a faster growth cycle. This makes disease surveillance and management of tropical plants more difficult. For some fruit crops, such as coconut, mango, lychee, and durian, it is difficult to achieve early detection and early treatment of leaf disease. Even though some crops can be reduced in height through dwarfing management, they still have wider leaves for relatively similar crops such as apples and tomatoes. This makes it difficult to use a camera to photograph the diseased leaf in its entirety up close. Therefore, new requirements are put forward for leaf disease detection technology based on deep learning models especially for real-time monitoring.

Given the difficulties in the detection of leaf diseases of tropical plants like coconut, such as the small number of leaf disease data sets, the mutual occlusion of large leaves, the influence of leaf shadows, and the interference of complex leaf backgrounds, some researchers have conducted studies. Thite et al. have published a dataset named 'Coconut (Cocos nucifera) Tree Disease Dataset', which contains five diseases: Bud Root Dropping, Bud Rot, Gray Leaf Spot, Leaf Rot, and Stem Bleeding^[65]. The images in this dataset are centered on disease locations and also include disease photos presented on tree trunks. In addition, researchers have developed a detection model for coconut tree disease. The model uses the newly developed AIE-CTDDC technology^[66]. To solve the problem of identifying coconut tree disease in the complex coconut leaf background environment, the model uses CapsNet^[67] as the feature extractor, and the data is pre-processed using MF-based enrollment removal technology before this. Similarly, to solve the problem of mutual occlusion of large coconut leaves and the impact of leaf shadows on the recognition effect, Subbaian et al. proposed a coconut leaf disease detection method based on YOLOv4^[68]. The method improved the prediction accuracy of the model through multi-scale detection, PANet, and adaptive border improvement.

In terms of the portability of detection and solving the problem that the plants are too high to observe, some researchers have proposed some methods and applications for the detection of durian leaf disease. Gallenero & Villaverde designed a portable device embedded with the Duri Premium application to identify durian leaf disease. The device was equipped with the Mobilenet-based convolutional network model, which achieved good identification accuracy^[69]. Also for portable detection, a mobile application was developed to detect the leaf diseases of mango and grape by Rao et al.^[70]. To solve the problem caused by the rapid detection and prevention of durian leaf disease, Piriyasupakij & Prastiphan designed an unmanned aircraft equipped with YOLOv5 for the detection of durian leaves on the tree and realized the effect of automatic cruise shooting and identification of durian leaf disease^[71]

From the above, it can be concluded that when identifying leaf diseases in tropical crops, researchers need to consider two aspects. On the one hand, it is to solve the impact of large blade occlusion and complex leaf surface environment around leaf disease. Another aspect is that to achieve rapid leaf disease detection, it is necessary to carry out portable design of detection equipment, such as mounted on mobile terminals, to cope with complex detection environment or to detect excessively high plant leaf disease.

The balance of speed and precision in leaf disease detection

There is a trade-off between the speed and accuracy of model inference. In general, increasing the inference speed of a model may result in decreasing the recognition accuracy of the model, and vice versa. This is because when designing a model, to improve the reasoning speed, it is often necessary to reduce the complexity and the number of parameters of the model, which may lose certain recognition accuracy. On the contrary, to improve the recognition accuracy of the model, it may be necessary to increase the complexity and the number of parameters of the model, resulting in slower inference speed.

To assess and compare the performance of the models, the model prediction results are commonly used as True Positive (TP), which refers to the number of positive samples correctly identified; False Positive (FP), which refers to the number of negative samples incorrectly identified; True Negative (TN), which refers to the number of negative samples correctly identified; and False Negative (False Negative, FP) refers to the number of negative samples that are incorrectly identified. The correspondence is shown in Table 3.

Table 3. Classification of predicted and actual results.

Actual Predicted
Positive Negative

True True positive True negative
False False positive False negative

Based on the four results of sample classification, it also sets Accuracy to indicate the samples predicted to be classified correctly among all samples; sets Precision to indicate the proportion of true-positive samples among those predicted to be positive; sets Recall to indicate the proportion of true-positive samples among those classified correctly; and sets the average precision. The corresponding formulas for Accuracy, Precision and Recall are as follows, respectively.

$ Accuracy=\dfrac{T P}{T P+F P+T N+F P} $ (1)

$ Precision=\dfrac{T P}{T P+F P}$ (2)

$ Recall=\dfrac{T P}{T P+F N}$ (3)

In the assessment of the model classification quality of each category, since the use of quasi-departure rate, checking rate, and recall rate alone cannot be considered together to assess the score, the researcher, therefore, proposes the use of Average Precision (AP) as a measure of the quality of the model classification for a certain category, i.e., integrating a function plotted on a certain category of objects with the Recall (r) as the horizontal axis and the corresponding Precision (p(r)) as a function plotted on the vertical axis for integration; use of mean Average Precision (mAP) for evaluating a model for multiple classes (n) of object classification performance evaluation metrics; F1 Score is used as an assessment of the combined consideration of check accuracy and recall, i.e., the reconciled average of check accuracy and recall. The Average Precision and F1 Score correspond to the following calculation formula:

$ AP=\underset{0}{\overset{1}{\int }}p\left(r\right)dr $ (4)

$mAP=\dfrac{1}{n}{\sum }_{i=1}^{n}A{P}_{i} $ (5)

$ {F}_{1}=\dfrac{2Precision\;\times\; Recall}{Precision\;+\; Recall}$ (6)

In improving the accuracy of recognition, there are limitations in the identification of leaf disease using two-dimensional image processing. The approaches mentioned in Table 2, which employ convolutional and deep learning networks for image feature extraction, are inherently designed for the analysis of two-dimensional images. In practical applications, foliage afflicted with diseases often exhibits characteristics such as leaf curl, damage, and instances of mutual occlusion during the acquisition of field imagery. Consequently, a nuanced examination of disease severity, based on a model learned on two-dimensional image data alone, predicated on estimating the proportion of the diseased area relative to the entire leaf surface^[62], may lead to the inadvertent^[72]. Researchers have suggested a method for creating three-dimensional reconstructions using two-dimensional images, aiming to overcome spatial limitations present in these types of pictures. The research shows that it is feasible to deduce crop height and leaf area through 3D modeling^[73]. Compared with the 2D RGB image processing method, the 3D method can accurately estimate the number of leaves, avoid the influence of mutual occlusion of leaves to a certain extent, and greatly improves the accuracy of detection^[74]. At the same time, it may also solve the problem proposed by Tang et al.^[46], that the occurrence of diseases at the edge of leaves in a complex background will interfere with the recognition. Utilizing the approach of reconstructing a three-dimensional model based on two-dimensional images still poses challenges^[75−77]. These challenges encompass the loss of depth information, compromised accuracy due to low resolution or distorted images, and difficulties in precisely capturing intricate geometric textures, particularly in complex scenes. In terms of computing cost, it cannot be ignored that three-dimensional method consumes more computing cost than two-dimensional method^[78].

In addition, to improve the quality of recognition, the solution of the multi-scale detection problems and the application of the attention mechanism have played a great help. Objects of all sizes (objects proportional to the size of the image) need to be detected, requiring the network to have the ability to recognize objects of different sizes, faced with the challenge of significantly decreasing detection accuracy for very large or very small scale targets^[36]. However, the deeper the network, the smaller the size of the feature map, which makes it difficult to detect small objects, which is a problem that cannot be avoided after the model extracts the feature map^[79]. This problem can be alleviated in the process of extracting features^[39] and in the process of feature fusion^[80], to improve the average precision of the model. The attention mechanism is a self-supervised learning method used in the natural language processing, and applied to enable the network to focus on the target region with important information by learning how much the input data contributes to the output data, while suppressing other irrelevant information and reducing the interference caused by irrelevant background on detection results^[36,81]. This method can be applied to the model to extract features of different channels, for example, CBAM^[82], BAM^[83].

The recognition speed of the evaluation target recognition model usually has the following evaluation indexes, such as inference time, inference throughput, inference frame rate, and hardware resource utilization. These evaluation indexes are also used to evaluate the reasoning speed of the model in specific application scenarios. Inference time is commonly used to evaluate the speed of the model in image recognition and classification, which means the time it takes the model to go from receiving the input image to outputting the prediction.

In speeding up the prediction speed, the one-stage target detection method has more advantages. For the one-stage recognition method based on YOLO, the anchor method should be used for frame selection first, especially for YOLOv2 to YOLOv6, which takes up computing resources. To reduce model size and prediction speed, many researchers proposed an anchor-free method, which takes key points as the core. For example, the target center of the feature map is taken as the key point to locate the target. Based on the number of key points, the free-anchor method can be divided into central-point-based method and multi-key point-based method, such as CenterNet^[84]. From a hardware aspect, different hardware selected for the prediction means different predictive speeds under the same model selection and parameters^[85].

In practical production, a single plant can exhibit concurrent occurrence of multiple diseases, with distinct characteristics of various diseases observed on the same leaf, or the same disease has different characteristics at different times^[86]. This multifaceted infection pattern can be considered a more indicative measure for assessing current crop damage levels, placing higher demands on the accuracy and generalization capacity of disease identification models.

Overall, if the leaf disease identification method is deployed in practical production, the balance between recognition accuracy and recognition speed can not be achieved only by optimizing the model or hardware. However, in terms of the evolution and development of methods for identifying leaf diseases using convolutional neural networks, the balance between accuracy and speed has to be mentioned.

There are still some ways to balance the relationship between speed and accuracy of model inference. These methods can be roughly divided into three general directions: model compression, hardware optimization, and algorithm improvement. Model compression^[87], such as channel pruning^[88] and knowledge distilling^[89], can reduce the number of parameters and complexity of the model, thereby improving the model reasoning speed and maintaining the accuracy of recognition to a certain extent^[87]. Hardware optimization can often significantly increase the speed of model inference. For example, running a model on a GPU, TPU, or professional computing device can significantly increase the speed of the model inference^[90]. There are more methods to accelerate the model inference speed and improve the inference accuracy by improving the algorithm, which are not listed in this paper. All three methods can improve the performance of the model both during training and during inference.

It is noteworthy that the lower the error rate of the training model is not equal to the better the quality of the model when training models and too low a classification error rate usually leads to overfitting problems. For example, such as a fully connected network classifier, one should not simply assume that achieving the best learning quality is synonymous with minimizing the classification error rate. Some researchers have delved into understanding the delicate balance between learning difficulty and learning speed. By utilizing a single-layer perceptron and a double-layer neural network optimized with a gradient descent learning algorithm, the average accuracy typically decreases with training time. The model attains a harmonious equilibrium between training difficulty and learning rate when the training error rate is at 15.87%^[91], resulting in an approximately 85% accuracy.

Hence, in the pursuit of a specific characteristic index for the model, it is imperative to selectively adjust and optimize the model based on the prevailing circumstances or specific requirements. Furthermore, relying on a singular index is inadequate for evaluating the overall quality of a given model.

Problems and prospects of application in tropical environment

In the realm of agriculture, the identification of crop diseases stands as a pivotal task, serving as a key to further assessing the severity of current or potential hazards. The foregoing review elucidates that the application of artificial intelligence (AI) technology in monitoring plant leaf diseases attains commendable levels of recognition accuracy and expeditious identification in the model development and testing. This methodology emerges as a proactive approach to disease identification, conferring the capacity to empower agricultural stakeholders and experts in effectually addressing extant diseases or preemptively mitigating potential threats. Moreover, through the implementation of smart agriculture methodologies, the attainment of sustainable and resilient production is conceivable, thereby mitigating environmental impact and fortifying food security^[92]. The judicious quantification of crop diseases fosters the formulation of precise protection strategies tailored to the dynamic and perpetually changing agricultural milieu. This approach facilitates the adoption of targeted disease prevention and control measures, consequently diminishing the superfluous use of pesticides. The resultant abatement in pesticide application not only serves to curtail production costs but also mitigates environmental pollution arising from pesticide usage^[93]. The application of artificial intelligence (AI) technology for identifying leaf diseases, while promising, is not without potential challenges.

In the application of tropical plant leaf disease recognition, the real-time monitoring and mobile device support characteristics based on deep learning model can greatly solve the characteristics of tropical plant leaf disease difficult to find and observe in time. To further solve the problem of insufficient computing power of mobile hardware devices or high demand for model recognition accuracy, the Master-Slave structure can be used. In this structure, the master model acts as the central node, such as the cloud platform, responsible for coordinating and controlling the operation of the whole system, while the receiver acts as the slave node, such as mobile devices, responsible for receiving and processing the instructions or data of the autonomous model, which can effectively realize the parallel processing and collaboration of tasks.

Current studies have shown that terahertz waves can be used to detect physiological and biochemical parameters in plant leaves, such as water content^[94], chlorophyll content, cell structure, and cell wall thickness, to indirectly reflect the occurrence and development of leaf diseases. Terahertz waves are electromagnetic waves between microwaves and infrared light, with frequencies ranging from 300 GHz to 3 THz. Terahertz waves have a wide range of applications in biomedicine, material science, and safety testing. Terahertz waves have strong penetration in biological materials and are also resonance absorbed by biomolecules, so they can be used to detect changes in the internal structure of plant leaves and biomolecules. It is possible to use deep learning models to analyze the signals of crop leaves fed back by terahertz waves, but the technology is still in the research stage^[95,96]

The rapid development of large language modeling in recent years has made it possible to combine large language modeling with leaf disease detection techniques^[97]. The rapid development of large language modeling in recent years has made it possible to combine large language modeling with leaf disease detection techniques. Large language models have excellent advantages in processing and analyzing literature and data, which can help researchers better understand and grasp the research progress within the field of leaf disease detection. In addition, the powerful text comprehension and text generation capabilities of the big language model can assist in the annotation and enhancement of plant petiole image data, thus improving the efficiency of model training. Similarly, the Big Language Model can assist in reasoning and summarizing the results of leaf disease detection and provide scientific references and bases.

However, it is still due to the complex and changeable climate environment such as tropical high temperatures and high humidity, especially considering the production and investment costs, it is not practical to use mobile devices equipped with identification models for detection. Hot and humid environments tend to damage electronic equipment, which increases maintenance costs after the equipment is deployed to the field.

Conclusions

The application of artificial intelligence (AI) technology in the detection and diagnosis of crop leaf diseases represents an advanced approach in precision agriculture, which particularly in machine learning and deep learning, various methods have proven effective in automating the identification and classification of crop leaf diseases. However, in practical implementation, it is imperative to carefully choose the suitable model and method for deployment based on the specific circumstances and demands. The detection and management of plant diseases in tropical areas remain a multifaceted issue. This technological application aims to swiftly and accurately evaluate the situation, thereby enabling timely interventions to mitigate the adverse impact of diseases on crop productivity.

Author contributions

The authors confirm contribution to the paper as follows: study conception and design: Huang M; manuscript preparation: Yao Z, Huang M. Both authors read and approved the final version of manuscript.

No.	Model name	Technical characteristics	Advantages.	Ref.
1	Apple-Net	The Feature Enhancement Module (FEM) and Coordinate Attention (CA) incorporation. Generative Adversarial Networks (GAN) for interference reduction.	Multi-scale information acquisition, increased diversity, and noise resistance.	[34]
2	Mobile Ghost with Attention YOLO	Ghost modules and separable convolution for reducing model size. The mobile inverted residual bottleneck convolution with Convolutional Block Attention Module (CBAM) for improving feature extraction capability.	Lightweight real-time monitoring (10.34 MB), suitable for mobile terminals.	[35]
3	BTC-YOLOv5s	Bidirectional Feature Pyramid Network (BiFPN) for a fusion of multi-scale features. Transformer attention mechanism for capturing global contextual information and establishing long-range dependencies. CBAM for interference reduction.	Reduces irrelevant information, small model size (15.8 MB).	[36]
4	AlAD-YOLO	The backbone network of TOLOv5s replaced with that of MobileNetV3.	Reduction in parameters and computational complexity during feature extraction.	[37]
5	YOLOX-ASSANano	Asymmetric ShuffleBlock for enhanceing feature fusion. Cross stage partial module with shuffle attention for interference reduction.	Processes complex natural backgrounds and lightweight model.	[38]
6	V-space-based Multi-scale Feature-fusion SSD	Multi-scale attention extremum for automatic lesion detection.	Enhances detection ability for disease lesions, especially small ones.	[39]
7	LAD-Net	Asymmetric and dilated convolution as the convolution to reduce model size. LAD-Inception designed with an attention mechanism for improving multiscale detection capabilities.	Small model size (1.25 MB), high accuracy (97.72%), and implementation of deployment on mobile devices.	[40]
8	Enhanced LSTM-CNN	Majority voting ensemble classifier replaced the classifier. Optimal LSTM layer network applied to select deep features autonomously.	Enhanced feature extraction and classifier modification.	[41]
9	LALNet	EARD module with multi-branch structure and depth separable modules extracts more feature information with fewer parameters and computational complexity. SE attention module for increase the feature extraction capability.	Small size (6.61 MB), fast execution (6.68 ms/photo), and high recognition accuracy.	[42]
10	Two-stage detection system	Three-way classification in the first stage using Xception as the base model. Real-time detection in the second stage.	Detects multiple diseases with 87.9% mean average precision.	[43]
11	Improved Faster R-CNN	Res2Net and feature pyramid network replaced the backbone of Faster R-CNN for batter feature fusion. RoIAlign instead of RoIPool of Faster R-CNN for improving the identification precision.	Extracts multi-dimensional features in natural scenes with complex backgrounds.	[44]
12	BC-YOLOv5	Modify YOLOv5 neck structure with weighted BiFPN and CBAM.	Enhanced feature extraction in the detection layer, reduced irrelevant information for complex backgrounds.	[45]
13	PLPNet	Perceptual adaptive convolution (PAC) for enhancing the network's global sensing capability. location relation attention module (LRAM) for reducing unnecessary information. SD-PFAN structure for fusing features batter.	Recognizes leaf diseases at the edge of the leaf, resist background interference.	[46]
14	DL Technique	U-net with Gradient GSO for leaf segmentation in the first stage. DbneAlexnet trained using proposed GJ-GSO for leaf classification using Gradient Jaya-Golden search optimization in the second stage.	Two-stage approach mitigating background noise. Optimized segmentation and classification through new training methods.	[47]
15	LightMixer	Depth convolution with Phish (DCWP) and light residual (LR) modules to increase feature integration and reduce parameters. Phish activation function for reducing the information loss.	Identifies diseases in complex environments, suitable for mobile deployment.	[48]
16	NanoSegmenter	Transformer structure and sparse attention mechanisms to tackle the instance segmentation task, replacing the CNN backbone. The bottleneck inversion technique to achieve model lightweighting.	High accuracy in instance segmentation, low computational complexity, and small model size.	[49]
17	DMCNN	Multi-scale convolution for disease classification from multiple channels.	Enhancement of accuracy and efficiency through multi-scale detection	[50]
18	CRNN	Combines CNN and RNN for improved sequential features extraction.	Achieves significant improvement in maximum accuracy compared to traditional CNN.	[51]
19	Transfer learning with pre-trained CNN models.	Transfer learning with Faster-RCNN and Inception ResNetv2 models.	High recognition ability on new dataset after transfer learning.	[52]
20	PCA DeepNet	Data enhancement with CycleGAN Feature extraction with PCA Classification with Faster-RCNN.	Innovative PCA method for image extraction, followed by Faster-RCNN for classification.	[53]
21	Four transformer-based models.	Comparative study on four vision transformers (EANet, MaxVit, CCT, PVT) for tomato leaf disease identification.	MaxViT architecture identified as the best for tomato leaf disease identification.	[8]
22	Fine-grained image identification framework	Utilizes OPM, DRM, AADM, and OCB for object identification, feature learning, and severity assessment.	Assess severity based on categorized dataset, captures fine-grained details with DRM.	[54]
23	RiceNet	YOLOX identifies disease sites in the first stage. Siamese Network classifies diseases in the second stage.	Effective two-stage detection, addressing complex backgrounds and limited samples.	[55]
24	RWW-NN	SetNet isolates the rice crop images. RWW algorithm (WWO & ROA), for improved classification.	Two-stage approach mitigating background noise, improved classifier performance.	[56]
25	The domain adaptation networks with novel attention mechanisms	Channel and spatial attention mechanism (CPAM) in DSAN for key feature identification.	Alleviates data distribution differences and small sample problems.	[57]
26	RiceDRA-Net	Res-Attention module based on CBAM for accurate disease identification and localization. DenseNet-121 serves as the backbone network.	Precise disease localization, even in complex backgrounds.	[58]
27	rE-GoogLeNet	ECA attention mechanism in GoogLeNe Residual networks for information loss mitigation.	Improved recognition and performance over alternatives.	[59]
28	ADSNN-BO	Enhanced self-attention mechanism employed along the entire architecture in MobileNetV1, Bayesian optimization for hyperparameter tuning.	Outperforms MobileNet with 3.6% accuracy improvement.	[60]
29	DGLNet	Global attention module (GAM) enhances sensitivity by reducing background noise. Dynamic representation module (DRM) for flexible feature acquisition.	Enhances generalization capability and feature representation in lightweight models.	[61]
30	Novel rice grade model	EfficientNet-B0 architecture as the backbone for better recognition accuracy for spotting diseases. By identifying leaf instances and disease areas, the ratio of the two areas was calculated to estimate the severity of the disease.	Reliable disease spot recognition, quantifies severity of rice disease.	[62]
31	Comparison of pre-trained residual network models	Comparison of ResNet34, ResNet50, ResNet18 with self-attention and ResNet34 with self-attention.	Models with self-attention exhibit improved recognition accuracy during transfer learning.	[63]

{{lists.name}}

Deep learning in tropical leaf disease detection: advantages and applications

Abstract