Search
2024 Volume 3
Article Contents
REVIEW   Open Access    

Deep learning in tropical leaf disease detection: advantages and applications

More Information
  • This paper delves into the realm of artificial intelligence, where an array of deep learning techniques has proven effective in automating crop leaf disease identification and classification. The current paper shows mature detection methodologies for apple, tomato, rice, mango, coconut, and durian leaf diseases with examples while demonstrating research on leaf disease detection in tropical plants. Through this exploration, valuable insights into the benefits and applications of detection techniques based on deep learning methods are provided for leaf disease detection. Highlighting the advantages of deep learning methods are provided for automated feature extraction and disease detection, the paper describes the salient features and challenges of the application of leaf disease detection in the tropics. In this paper, an introductory overview of a leaf disease detection model is offered and delve into the factors influencing detection accuracy and speed while proposing ways to mitigate the inherent trade-offs between these indicators. Furthermore, the challenges, such as multi-scale detection and leaf overlapping, that may occur in plants in the tropics, have been examined, enriching our understanding of deep learning-driven leaf disease detection in tropical agriculture.
  • 加载中
  • [1]

    LeCun Y, Bengio Y, Hinton G. 2015. Deep learning. Nature 521:436−44

    doi: 10.1038/nature14539

    CrossRef   Google Scholar

    [2]

    Kamilaris A, Prenafeta-Boldú FX. 2018. Deep learning in agriculture: A survey. Computers and Electronics in Agriculture 147:70−90

    doi: 10.1016/j.compag.2018.02.016

    CrossRef   Google Scholar

    [3]

    Autor DH, Dorn D. 2013. The growth of low-skill service jobs and the polarization of the US labor market. American Economic Review 103:1553−97

    doi: 10.1257/aer.103.5.1553

    CrossRef   Google Scholar

    [4]

    Sze V, Chen YH, Yang TJ, Emer JS. 2017. Efficient Processing of deep neural networks: a tutorial and survey. Proceedings of the IEEE 105:2295−329

    doi: 10.1109/JPROC.2017.2761740

    CrossRef   Google Scholar

    [5]

    Zhang C, Bengio S, Hardt M, Recht B, Vinyals O. 2021. Understanding deep learning (still) requires rethinking generalization. Communications of the ACM 64:107−15

    doi: 10.1145/3446776

    CrossRef   Google Scholar

    [6]

    Sharma R, Kumar N, Sharma BB. 2022. Applications of Artificial Intelligence in Smart Agriculture: A Review. In Proc. Recent Innovations in Computing, eds. Singh PK, Singh Y, Kolekar MH, Kar AK, Gonçalves PJS. vol 832. Singapore: Springer. pp. 135-42. https://doi.org/10.1007/978-981-16-8248-3_11

    [7]

    Shafik W, Tufail A, Namoun A, De Silva LC, Rosyzie Anna Awg Haji Mohd Apong. 2023. A systematic literature review on plant disease detection: Motivations, classification techniques, datasets, challenges, and future trends. IEEE Access 11:59174−203

    doi: 10.1109/ACCESS.2023.3284760

    CrossRef   Google Scholar

    [8]

    Hossain S, Tanzim Reza M, Chakrabarty A, Jung YJ. 2023. Aggregating different scales of attention on feature variants for tomato leaf disease diagnosis from image data: a transformer driven study. Sensors 23:3751

    doi: 10.3390/s23073751

    CrossRef   Google Scholar

    [9]

    Attri I, Awasthi LK, Sharma TP, Rathee P. 2023. A review of deep learning techniques used in agriculture. Ecological Informatics 77:102217

    doi: 10.1016/j.ecoinf.2023.102217

    CrossRef   Google Scholar

    [10]

    Hannun AY, Rajpurkar P, Haghpanahi M, Tison GH, Bourn C, et al. 2019. Cardiologist-level arrhythmia detection and classification in ambulatory electrocardiograms using a deep neural network. Nature Medicine 25:65−69

    doi: 10.1038/s41591-018-0268-3

    CrossRef   Google Scholar

    [11]

    Sladojevic S, Arsenovic M, Anderla A, Culibrk D, Stefanovic D. 2016. Deep neural networks based recognition of plant diseases by leaf image classification. Computational Intelligence and Neuroscience 2016:3289801

    doi: 10.1155/2016/3289801

    CrossRef   Google Scholar

    [12]

    Ghosal S, Blystone D, Singh AK, Ganapathysubramanian B, Singh A, et al. 2018. An explainable deep machine vision framework for plant stress phenotyping. Proceedings of the National Academy of Sciences of the United States of America 115:4613−18

    doi: 10.1073/pnas.1716999115

    CrossRef   Google Scholar

    [13]

    Sharma R. 2021. Artificial Intelligence in Agriculture: A Review. Proc. 2021 5th International Conference on Intelligent Computing and Control Systems (ICICCS), Madurai, India, 6−8 May 2021. USA: IEEE. pp. 937−42. https://doi.org/10.1109/ICICCS51141.2021.9432187

    [14]

    Toniutti L, Breitler JC, Etienne H, Campa C, Doulbeau S, et al. 2017. Influence of environmental conditions and genetic background of Arabica coffee (C. arabica L.) on leaf rust (Hemileia vastatrix) Pathogenesis. Frontiers in Plant Science 8:2025

    doi: 10.3389/fpls.2017.02025

    CrossRef   Google Scholar

    [15]

    Andersen KF, Madden LV, Paul PA. 2015. Fusarium head blight development and deoxynivalenol accumulation in wheat as influenced by post-anthesis moisture patterns. Phytopathology 105:210−19

    doi: 10.1094/PHYTO-04-14-0104-R

    CrossRef   Google Scholar

    [16]

    Singh BK, Delgado-Baquerizo M, Egidi E, Guirado E, Leach JE, et al. 2023. Climate change impacts on plant pathogens, food security and paths forward. Nature Reviews Microbiology 21:640−56

    doi: 10.1038/s41579-023-00900-7

    CrossRef   Google Scholar

    [17]

    Liu L, Ouyang W, Wang X, Fieguth P, Chen J, et al. 2020. Deep Learning for Generic Object Detection: A Survey. International Journal of Computer Vision 128:261−318

    doi: 10.1007/s11263-019-01247-4

    CrossRef   Google Scholar

    [18]

    Karim S, Zhang Y, Yin S, Bibi I, Brohi AA. 2020. A brief review and challenges of object detection in optical remote sensing imagery. Multiagent and Grid Systems 16:227−43

    doi: 10.3233/MGS-200330

    CrossRef   Google Scholar

    [19]

    Arulprakash E, Aruldoss M. 2022. A study on generic object detection with emphasis on future research directions. Journal of King Saud University - Computer and Information Sciences 34:7347−65

    doi: 10.1016/j.jksuci.2021.08.001

    CrossRef   Google Scholar

    [20]

    Huang J, Rathod V, Sun C, Zhu M, Korattikara A, et al. 2017. Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21−26 July 2017. USA: IEEE. pp. 3296−97. https://doi.org/10.1109/CVPR.2017.351

    [21]

    Tan M, Le QV. 2019. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. In Proceedings of the 36th International Conference on Machine Learning, Long Beach, California, USA, 2019. Vol. 97. Proceedings of Machine Learning Research (PMLR). pp. 6105−14. http://proceedings.mlr.press/v97/tan19a.html

    [22]

    He T, Yu S, Wang Z, Li J, Chen Z. 2019. From data quality to model quality: an exploratory study on deep learning. Proceedings of the 11th Asia-Pacific Symposium on Internetware, Fukuoka Japan, October 28−29, 2019. New York, United States: Association for Computing Machinery. https://doi.org/10.1145/3361242.3361260

    [23]

    Bailly A, Blanc C, Francis É, Guillotin T, Jamal F, et al. 2022. Effects of dataset size and interactions on the prediction performance of logistic regression and deep learning models. Computer Methods and Programs in Biomedicine 213:106504

    doi: 10.1016/j.cmpb.2021.106504

    CrossRef   Google Scholar

    [24]

    Recht B, Roelofs R, Schmidt L, Shankar V. 2019. Do ImageNet Classifiers Generalize to ImageNet? In Proceedings of the 36th International Conference on Machine Learning, Long Beach, California, USA, 2019. Vol. 97. Proceedings of Machine Learning Research (PMLR). pp. 5389−400. http://proceedings.mlr.press/v97/recht19a.html

    [25]

    Priestley M, O’donnell F, Simperl E. 2023. A survey of data quality requirements that matter in ML development pipelines. Journal of Data and Information Quality 15:11

    doi: 10.1145/3592616

    CrossRef   Google Scholar

    [26]

    Garcia Arnal Barbedo J, Vieira Koenigkan L, Almeida Halfeld-Vieira B, Veras Costa R, Lima Nechet K, et al. 2018. Annotated Plant Pathology Databases for Image-Based Detection and Recognition of Diseases. IEEE Latin America Transactions 16:1749−57

    doi: 10.1109/TLA.2018.8444395

    CrossRef   Google Scholar

    [27]

    Kaustubh B. 2019. Tomato leaf disease detection. www.kaggle.com/datasets/kaustubhb999/tomatoleaf

    [28]

    Chen L, Yuan Y. 2019. Agricultural Disease Image Dataset for Disease Identification Based on Machine Learning. Proc. Big Scientific Data Management. BigSDM 2018. Lecture Notes in Computer Science, eds. Li J, Meng X, Zhang Y, Cui W, Du Z. Cham: Springer. pp. 263−74. https://doi.org/10.1007/978-3-030-28061-1_26

    [29]

    Francisco AKG. 2019. Rice-Disease-DataSet. https://github.com/aldrin233/RiceDiseases-DataSet

    [30]

    WoAiFeiJiang. 2023. Pathological images of apple leaves. https://aistudio.baidu.com/datasetdetail/11591/0

    [31]

    Thapa R, Zhang K, Snavely N, Belongie S, Khan A. 2021. Plant Pathology 2021 - FGVC8. https://kaggle.com/competitions/plant-pathology-2021-fgvc8

    [32]

    Arun Pandian J, Geetharamani G, Huang ML, Chang YH. 2022. Tomato Disease Multiple Sources. www.kaggle.com/datasets/cookiefinder/tomato-disease-multiple-sources/

    [33]

    Arun Pandian J, Geetharamani G. 2019. Data for: Identification of Plant Leaf Diseases Using a 9-layer Deep Convolutional Neural Network. https://data.mendeley.com/datasets/tywbtsjrjv/1

    [34]

    Zhu R, Zou H, Li Z, Ni R. 2023. Apple-Net: a model based on improved YOLOv5 to detect the apple leaf diseases. Plants 12:169

    doi: 10.3390/plants12010169

    CrossRef   Google Scholar

    [35]

    Wang Y, Wang Y, Zhao J. 2022. MGA-YOLO: A lightweight one-stage network for apple leaf disease detection. Frontiers in Plant Science 13:927424

    doi: 10.3389/fpls.2022.927424

    CrossRef   Google Scholar

    [36]

    Li H, Shi L, Fang S, Yin F. 2023. Real-time detection of apple leaf diseases in natural scenes based on YOLOv5. Agriculture 13:878

    doi: 10.3390/agriculture13040878

    CrossRef   Google Scholar

    [37]

    Xu W, Wang R. 2023. ALAD-YOLO: an lightweight and accurate detector for apple leaf diseases. Frontiers in Plant Science 14:4569

    doi: 10.3389/fpls.2023.1204569

    CrossRef   Google Scholar

    [38]

    Liu S, Qiao Y, Li J, Zhang H, Zhang M, et al. 2022. An Improved Lightweight Network for Real-Time Detection of Apple Leaf Diseases in Natural Scenes. Agronomy 12:2636

    doi: 10.3390/agronomy12102363

    CrossRef   Google Scholar

    [39]

    Tian L, Zhang H, Liu B, Zhang J, Duan N, et al. 2023. VMF-SSD: A novel V-space based multi-scale feature fusion SSD for apple leaf disease detection. IEEE-ACM Transactions on Computational Biology and Bioinformatics 20:2016−28

    doi: 10.1109/TCBB.2022.3229114

    CrossRef   Google Scholar

    [40]

    Zhu X, Li J, Jia R, Liu B, Yao Z, et al. 2023. LAD-Net: A Novel Light Weight Model for Early Apple Leaf Pests and Diseases Classification. Ieee-Acm Transactions on Computational Biology and Bioinformatics 20:1156−69

    doi: 10.1109/TCBB.2022.3191854

    CrossRef   Google Scholar

    [41]

    Shafik W, Tufail A, Liyanage CDS, Apong RAAHM. 2023. Using a novel convolutional neural network for plant pests detection and disease classification. Journal of the Science of Food and Agriculture 103:5849−61

    doi: 10.1002/jsfa.12700

    CrossRef   Google Scholar

    [42]

    Gao A, Ren H, Song Y, Ren L, Zhang Y, et al. 2023. Construction and verification of machine vision algorithm model based on apple leaf disease images. Frontiers in Plant Science 14:1246065

    doi: 10.3389/fpls.2023.1246065

    CrossRef   Google Scholar

    [43]

    Khan AI, Quadri SMK, Banday S, Shah JL. 2022. Deep diagnosis: A real-time apple leaf disease detection system based on deep learning. Computers and Electronics in Agriculture 198:107093

    doi: 10.1016/j.compag.2022.107093

    CrossRef   Google Scholar

    [44]

    Gong X, Zhang S. 2023. A high-precision detection method of apple leaf diseases using improved faster R-CNN. Agriculture 13:240

    doi: 10.3390/agriculture13020240

    CrossRef   Google Scholar

    [45]

    Jing J, Li S, Qiao C, Li K, Zhu X, et al. 2023. A tomato disease identification method based on leaf image automatic labeling algorithm and improved YOLOv5 model. Journal of the Science of Food and Agriculture 103:7070−82

    doi: 10.1002/jsfa.12793

    CrossRef   Google Scholar

    [46]

    Tang Z, He X, Zhou G, Chen A, Wang Y, et al. 2023. A Precise Image-Based Tomato Leaf Disease Detection Approach Using PLPNet. Plant Phenomics 5:0042

    doi: 10.34133/plantphenomics.0042

    CrossRef   Google Scholar

    [47]

    Badiger M, Mathew JA. 2023. Tomato plant leaf disease segmentation and multiclass disease detection using hybrid optimization enabled deep learning. Journal of Biotechnology 374:101−13

    doi: 10.1016/j.jbiotec.2023.07.011

    CrossRef   Google Scholar

    [48]

    Zhong Y, Teng Z, Tong M. 2023. LightMixer: A novel lightweight convolutional neural network for tomato disease detection. Frontiers in Plant Science 14:1166296

    doi: 10.3389/fpls.2023.1166296

    CrossRef   Google Scholar

    [49]

    Liu Y, Song Y, Ye R, Zhu S, Huang Y, et al. 2023. High-Precision Tomato Disease Detection Using NanoSegmenter Based on Transformer and Lightweighting. Plants 12:2559

    doi: 10.3390/plants12132559

    CrossRef   Google Scholar

    [50]

    Elfatimi E, Eryiğit R, Elfatimi L. 2024. Deep multi-scale convolutional neural networks for automated classification of multi-class leaf diseases in tomatoes. Neural Computing and Applications 36:803−22

    doi: 10.1007/s00521-023-09062-2

    CrossRef   Google Scholar

    [51]

    Mondal D, Roy K, Pal D, Kole DK. 2022. Deep learning-based approach to detect and classify signs of crop leaf diseases and pest damage. SN Computer Science 3:433

    doi: 10.1007/s42979-022-01332-5

    CrossRef   Google Scholar

    [52]

    Saeed A, Abdel-Aziz AA, Mossad A, Abdelhamid MA, Alkhaled AY, Mayhoub M. 2023. Smart Detection of Tomato Leaf Diseases Using Transfer Learning-Based Convolutional Neural Networks. Agriculture-Basel 13:14

    Google Scholar

    [53]

    Roy K, Chaudhuri SS, Frnda J, Bandopadhyay S, Ray IJ, et al. 2023. Detection of Tomato Leaf Diseases for Agro-Based Industries Using Novel PCA DeepNet. Ieee Access 11:14983−5001

    doi: 10.1109/ACCESS.2023.3244499

    CrossRef   Google Scholar

    [54]

    Zhang D, Huang Y, Wu C, Ma M. 2023. Detecting tomato disease types and degrees using multi-branch and destruction learning. Computers and Electronics in Agriculture 213:108244

    doi: 10.1016/j.compag.2023.108244

    CrossRef   Google Scholar

    [55]

    Pan J, Wang T, Wu Q. 2023. RiceNet: A two stage machine learning method for rice disease identification. Biosystems Engineering 225:25−40

    doi: 10.1016/j.biosystemseng.2022.11.007

    CrossRef   Google Scholar

    [56]

    Daniya T, Vigneshwari S. 2023. Rider Water Wave-enabled deep learning for disease detection in rice plant. Advances in Engineering Software 182:103472

    doi: 10.1016/j.advengsoft.2023.103472

    CrossRef   Google Scholar

    [57]

    Chen L, Zou J, Yuan Y, He H. 2023. Improved domain adaptive rice disease image recognition based on a novel attention mechanism. Computers and Electronics in Agriculture 208:107806

    doi: 10.1016/j.compag.2023.107806

    CrossRef   Google Scholar

    [58]

    Peng J, Wang Y, Jiang P, Zhang RF, Chen HL. 2023. RiceDRA-Net: Precise Identification of Rice Leaf Diseases with Complex Backgrounds Using a Res-Attention Mechanism. Applied Sciences-Basel 13:4928

    doi: 10.3390/app13084928

    CrossRef   Google Scholar

    [59]

    Yang L, Yu X, Zhang S, Long H, Zhang H, et al. 2023. GoogLeNet based on residual network and attention mechanism identification of rice leaf diseases. Computers and Electronics in Agriculture 204:107543

    doi: 10.1016/j.compag.2022.107543

    CrossRef   Google Scholar

    [60]

    Wang Y, Wang H, Peng Z. 2021. Rice diseases detection and classification using attention based neural network and bayesian optimization. Expert Systems with Applications 178:114770

    doi: 10.1016/j.eswa.2021.114770

    CrossRef   Google Scholar

    [61]

    Yang Y, Jiao G, Liu J, Zhao W, Zheng J. 2023. A lightweight rice disease identification network based on attention mechanism and dynamic convolution. Ecological Informatics 78:102320

    doi: 10.1016/j.ecoinf.2023.102320

    CrossRef   Google Scholar

    [62]

    Patil RR, Kumar S, Chiwhane S, Rani R, Pippal SK. 2023. An Artificial-Intelligence-Based Novel Rice Grade Model for Severity Estimation of Rice Diseases. Agriculture 13:47

    doi: 10.3390/agriculture13010047

    CrossRef   Google Scholar

    [63]

    Stephen A, Punitha A, Chandrasekar A. 2023. Designing self attention-based ResNet architecture for rice leaf disease classification. Neural Computing & Applications 35:6737−51

    doi: 10.1007/s00521-022-07793-2

    CrossRef   Google Scholar

    [64]

    Simhadri CG, Kondaveeti HK. 2023. Automatic Recognition of Rice Leaf Diseases Using Transfer Learning. Agronomy 13:961

    doi: 10.3390/agronomy13040961

    CrossRef   Google Scholar

    [65]

    Thite S, Suryawanshi Y, Patil K, Chumchu P. 2023. Coconut (Cocos nucifera) tree disease dataset: A dataset for disease detection and classification for machine learning applications. Data in Brief 51:109690

    doi: 10.1016/j.dib.2023.109690

    CrossRef   Google Scholar

    [66]

    Maray M, Albraikan AA, Alotaibi SS, Alabdan R, Al Duhayyim M, et al. 2022. Artificial intelligence-enabled coconut tree disease detection and classification model for smart agriculture. Computers and Electrical Engineering 104:108399

    doi: 10.1016/j.compeleceng.2022.108399

    CrossRef   Google Scholar

    [67]

    Mazzia V, Salvetti F, Chiaberge M. 2021. Efficient-CapsNet: capsule network with self-attention routing. Scientific Reports 11:14634

    doi: 10.1038/s41598-021-93977-0

    CrossRef   Google Scholar

    [68]

    Subbaian S, Balasubramanian A, Marimuthu M, Chandrasekaran S, Muthusaravanan G. 2024. Detection of coconut leaf diseases using enhanced deep learning techniques. Journal of Intelligent & Fuzzy Systems 46:5033−45

    doi: 10.3233/JIFS-233831

    CrossRef   Google Scholar

    [69]

    Gallenero JA, Villaverde J. 2023. Identification of Durian Leaf Disease Using Convolutional Neural Network. Proc. 2023 15th International Conference on Computer and Automation Engineering (ICCAE), Sydney, Australia, 3-05 March, 2023. pp. 172−77. https://doi.org/10.1109/ICCAE56788.2023.10111159

    [70]

    Sanath Rao U, Swathi R, Sanjana V, Arpitha L, Chandrasekhar K, et al. 2021. Deep Learning Precision Farming: Grapes and Mango Leaf Disease Detection by Transfer Learning. Global Transitions Proceedings 2:535−44

    doi: 10.1016/j.gltp.2021.08.002

    CrossRef   Google Scholar

    [71]

    Piriyasupakij J, Prasitphan R. 2023. Development of autonomous drones to detect diseases on plant leaves of durian trees. Proc. 2023 27th International Computer Science and Engineering Conference (ICSEC), Samui Island, Thailand, 14−15 September 2023. USA: IEEE. pp. 258−65. https://doi.org/10.1109/ICSEC59635.2023.10329671

    [72]

    Li C, Adhikari R, Yao Y, Miller AG, Kalbaugh K, et al. 2020. Measuring plant growth characteristics using smartphone based image analysis technique in controlled environment agriculture. Computers and Electronics in Agriculture 168:8

    doi: 10.1016/j.compag.2019.105123

    CrossRef   Google Scholar

    [73]

    Du L, Yang H, Song X, Wei N, Yu C, et al. 2022. Estimating leaf area index of maize using UAV-based digital imagery and machine learning methods. Scientific Reports 12:15937

    doi: 10.1038/s41598-022-20299-0

    CrossRef   Google Scholar

    [74]

    Martinez-Guanter J, Ribeiro Á, Peteinatos GG, Pérez-Ruiz M, Gerhards R, et al. 2019. Low-Cost Three-Dimensional Modeling of Crop Plants. Sensors 19:2883

    doi: 10.3390/s19132883

    CrossRef   Google Scholar

    [75]

    Maken P, Gupta A. 2023. 2D-to-3D: A Review for Computational 3D Image Reconstruction from X-ray Images. Archives of Computational Methods in Engineering 30:85−114

    doi: 10.1007/s11831-022-09790-z

    CrossRef   Google Scholar

    [76]

    Marchand É, Bouthemy P, Chaumette F. 2001. A 2D–3D model-based approach to real-time visual tracking. Image and Vision Computing 19:941−55

    doi: 10.1016/S0262-8856(01)00054-3

    CrossRef   Google Scholar

    [77]

    Konrad J, Wang M, Ishwar P. 2012. 2D-to-3D image conversion by learning depth from examples. Proc. 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, Providence, RI, USA, 16−21 June 2012. USA: IEEE. pp. 16−22. https://doi.org/10.1109/CVPRW.2012.6238903

    [78]

    Gao Y, Wang M, Tao D, Ji R, Dai Q. 2012. 3-D Object Retrieval and Recognition With Hypergraph Analysis. IEEE Transactions on Image Processing 21:4290−303

    doi: 10.1109/TIP.2012.2199502

    CrossRef   Google Scholar

    [79]

    Lin TY, Dollár P, Girshick R, He K, Hariharan B, Belongie S. 2017. Feature Pyramid Networks for Object Detection. Proc. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21−26 July 2017. USA: IEEE. pp. 936−44. https://doi.org/10.1109/CVPR.2017.106

    [80]

    Li Y, Sun S, Zhang C, Yang G, Ye Q. 2022. One-Stage Disease Detection Method for Maize Leaf Based on Multi-Scale Feature Fusion. Applied Sciences 12:7960

    doi: 10.3390/app12167960

    CrossRef   Google Scholar

    [81]

    Chen J, Deng X, Wen Y, Chen W, Zeb A, et al. 2023. Weakly-supervised learning method for the recognition of potato leaf diseases. Artificial Intelligence Review 56:7985−8002

    doi: 10.1007/s10462-022-10374-3

    CrossRef   Google Scholar

    [82]

    Woo S, Park J, Lee JY, Kweon IS. 2018. CBAM: Convolutional Block Attention Module. In Computer Vision – ECCV 2018. ECCV 2018. Lecture Notes in Computer Science, eds. Ferrari V, Hebert M, Sminchisescu C, Weiss Y. Cham: Springer International Publishing. pp. 3−19. https://doi.org/10.1007/978-3-030-01234-2_1

    [83]

    Park J, Woo S, Lee JY, Kweon IS. 2020. A simple and light-weight attention module for convolutional neural networks. International Journal of Computer Vision 128:783−98

    doi: 10.1007/s11263-019-01283-0

    CrossRef   Google Scholar

    [84]

    Law H, Deng J. 2020. CornerNet: detecting objects as paired keypoints. International Journal of Computer Vision 128:642−56

    doi: 10.1007/s11263-019-01204-1

    CrossRef   Google Scholar

    [85]

    Justus D, Brennan J, Bonner S, McGough AS. 2018. Predicting the Computational Cost of Deep Learning Models. Proc. 2018 IEEE International Conference on Big Data (Big Data), Seattle, WA, USA, 10−13 December 2018. pp. 3873−82. https://doi.org/10.1109/BigData.2018.8622396

    [86]

    Sharma V, Tripathi AK, Mittal H. 2023. DLMC-Net: Deeper lightweight multi-class classification model for plant leaf disease detection. Ecological Informatics 75:2025

    doi: 10.1016/j.ecoinf.2023.102025

    CrossRef   Google Scholar

    [87]

    Li B, Jiang W, Gu J, Liu K. 2020. A Summary of convolution Neural Network Compression and Acceleration Technology. Proc. 2020 International Conference on Intelligent Computing and Human-Computer Interaction (ICHCI), Sanya, China, 4−6 December 2020. USA: IEEE. pp. 269−75. https://doi.org/10.1109/ICHCI51889.2020.00065

    [88]

    He Y, Zhang X, Sun J. 2017. Channel pruning for accelerating very deep neural networks. Proc. 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22−29 October 2017. USA: IEEE. pp. 1398−406. https://doi.org/10.1109/ICCV.2017.155

    [89]

    Hinton G, Vinyals O, Dean J. 2015. Distilling the knowledge in a neural network. ArXiv In Press

    doi: 10.48550/arXiv.1503.02531

    CrossRef   Google Scholar

    [90]

    Keller B, Venkatesan R, Dai S, Tell SG, Zimmer B, et al. 2022. A 17–95.6 TOPS/W deep learning inference accelerator with per-vector scaled 4-bit quantization for transformers in 5nm. Proc. 2022 IEEE Symposium on VLSI Technology and Circuits (VLSI Technology and Circuits), Honolulu, HI, USA, 12-17 June 2022. USA: IEEE. pp. 16−17. https://doi.org/10.1109/VLSITechnologyandCir46769.2022.9830277

    [91]

    Wilson RC, Shenhav A, Straccia M, Cohen JD. 2019. The Eighty Five Percent Rule for optimal learning. Nature Communications 10:4646

    doi: 10.1038/s41467-019-12552-4

    CrossRef   Google Scholar

    [92]

    Hashim N, Ali MM, mahadi MR, Abdullah AF, Wayayok A, et al. 2023. Smart farming for sustainable rice production: an insight into applications, challenges and future prospects. Rice Science 31:47−61

    doi: 10.1016/j.rsci.2023.08.004

    CrossRef   Google Scholar

    [93]

    Mahlein AK. 2015. Plant disease detection by imaging sensors – parallels and specific demands for precision agriculture and plant phenotyping. Plant Disease 100:241−51

    doi: 10.1094/PDIS-03-15-0340-FE

    CrossRef   Google Scholar

    [94]

    Zahid A, Abbas HT, Imran MA, Qaraqe KA, Alomainy A, et al. 2019. Characterization and Water Content Estimation Method of Living Plant Leaves Using Terahertz Waves. Applied Sciences 9:2781

    doi: 10.3390/app9142781

    CrossRef   Google Scholar

    [95]

    Zahid A, Abbas HT, Ren A, Zoha A, Heidari H, et al. 2019. Machine learning driven non-invasive approach of water content estimation in living plant leaves using terahertz waves. Plant Methods 15:138

    doi: 10.1186/s13007-019-0522-9

    CrossRef   Google Scholar

    [96]

    Zahid A, Dashtipour K, Abbas HT, Mabrouk IB, Al-Hasan M, et al. 2022. Machine learning enabled identification and real-time prediction of living plants’ stress using terahertz waves. Defence Technology 18:1330−39

    doi: 10.1016/j.dt.2022.01.003

    CrossRef   Google Scholar

    [97]

    Zhao WX, Zhou K, Li J, Tang T, Wang X, et al. 2023. A survey of large language model. ArXiv In Press

    doi: 10.48550/arXiv.2303.18223

    CrossRef   Google Scholar

  • Cite this article

    Yao Z, Huang M. 2024. Deep learning in tropical leaf disease detection: advantages and applications. Tropical Plants 3: e020 doi: 10.48130/tp-0024-0018
    Yao Z, Huang M. 2024. Deep learning in tropical leaf disease detection: advantages and applications. Tropical Plants 3: e020 doi: 10.48130/tp-0024-0018

Figures(1)  /  Tables(3)

Article Metrics

Article views(241) PDF downloads(157)

Other Articles By Authors

REVIEW   Open Access    

Deep learning in tropical leaf disease detection: advantages and applications

Tropical Plants  3 Article number: e020  (2024)  |  Cite this article

Abstract: This paper delves into the realm of artificial intelligence, where an array of deep learning techniques has proven effective in automating crop leaf disease identification and classification. The current paper shows mature detection methodologies for apple, tomato, rice, mango, coconut, and durian leaf diseases with examples while demonstrating research on leaf disease detection in tropical plants. Through this exploration, valuable insights into the benefits and applications of detection techniques based on deep learning methods are provided for leaf disease detection. Highlighting the advantages of deep learning methods are provided for automated feature extraction and disease detection, the paper describes the salient features and challenges of the application of leaf disease detection in the tropics. In this paper, an introductory overview of a leaf disease detection model is offered and delve into the factors influencing detection accuracy and speed while proposing ways to mitigate the inherent trade-offs between these indicators. Furthermore, the challenges, such as multi-scale detection and leaf overlapping, that may occur in plants in the tropics, have been examined, enriching our understanding of deep learning-driven leaf disease detection in tropical agriculture.

    • Deep learning, a subfield of machine learning, is distinguished by its computational model's capacity to acquire knowledge from abstract data using structures consisting of multiple processing units[1]. Such models use automatic optimisation of model parameters, e.g. stochastic gradient descent, batch gradient descent, Adam optimiser, to systematically optimise the basic parameters of the data computation within the model architecture to achieve the goal of optimising or accelerating the optimization of the model parameters. The automatic optimisation of model parameters eliminates the need for manual design of parameters within the model and reduces the amount of manual work involved in model development. In addition, the trained models can be used to achieve objectives such as object detection, localization, image classification, and predictive analytics based on complex abstract data. One noteworthy advantage of contemporary machine learning methodologies is their inherent capability for automated feature extraction and learning from abstract data, thereby obviating the requirement for manual intervention in guiding the model through the processes of feature extraction and learning, as demonstrated in conventional feature engineering. In contrast, traditional machine-learning approaches necessitate the construction of relevant features by humans, contingent upon the dataset, a process commonly referred to as feature engineering. Performing the task of feature engineering is inherently complex and time-consuming, necessitating iterative human adjustments based on changes in the dataset or design requirements[2]. Traditional feature engineering may be more advantageous when dealing with small datasets is required. But in today's environment of increasing labour costs[3] and decreasing computer arithmetic costs[4], as well as in practice where large neural networks based on deep learning can be better generalized this is more important[5]. Deep learning methods may be more advantageous when dealing with large datasets or when a high degree of automation is required. As a result, the developmental costs associated with models of this kind, which is mainly labour costs, significantly increase, while their generalizability is concurrently limited.

      In contrast, the incorporation and enhancement of network models which involve proficient feature extraction techniques combined with deep learning, are exemplified by Convolutional Neural Networks (CNN), You Only Look Once series (YOLO), Single Shot Multibox Detector (SSD), Residual Network (RstNet), Densely Connected Convolutional Network (DenseNet), GoogleNet, MobileNet, and Xception, have significantly enhanced the automatic feature-learning capabilities of models. These advancements enable deep-learning models to autonomously extract features of varying levels of complexity from raw data, displaying significant potential for improving the reliability of models based on leaf disease image analysis. As a result, models developed within the deep learning framework align with the processing of intricate abstract data, such as images, and it also demonstrates advantages in terms of labour costs, expenditure on productivity costs, and increased model versatility[6].

      Early detection of diseases minimizes the overuse of pesticides in disease prevention[7]. The utilization of deep learning techniques for monitoring crop leaf diseases enable the analysis of various disease types by inputting images of diseased leaves into a model previously trained for these leaf diseases, eliminating the necessity for specialized personnel and allowing for an automated identification process, that enables rapid, timely control[8]. When these trained models are deployed on small mobile terminals, it helps non-professional agriculturists to detect problems in time, even in completely unmanaged farmland, so that preventive measures can be implemented[9]. Due to the good generalisation ability of the model, identification aided by the use of a trained model can provide a more reliable identification reference for experts[10]. For example, in changing environments where some leaf diseases have no obvious symptoms or are difficult to detect, which requires plant pathologists to be well observed[11], deep learning models can extract features that are difficult to observe with the human eye[12]. Thereby, the application of such technologies contributes to the timely detection and prevention of plant leaf diseases, optimization of crop yields, and advancement of precision agriculture. Consequently, this enhances productivity, reduces labour costs, and strengthens sustainability[13].

      Tropical areas hold a significant position as major producers of staple food crops such as rice, corn, and millet, as well as various fruit crops including banana, mango, coconut, and durian. The region's consistently high temperatures lead to shorter pathogen incubation periods, exemplified by the reduced diurnal temperature which accelerates the latency period of pathogens like Hemileia vastatrix, causing rust epidemics in Central America[14] Furthermore, the typically high humidity in tropical regions foster both crop growth and the survival of harmful bacteria[15]. Consequently, tropical regions experience more frequent occurrences of plant diseases, posing a threat to food security[16]. Thus, compared to non-tropical regions, crops in tropical areas have shorter growth cycles and are more susceptible to rapid disease outbreaks, underscoring the necessity for timely and effective plant disease prevention and control measures[16]. Because deep learning-based leaf disease detection technology can be mounted on mobile devices and achieve real-time monitoring, this technology is more suitable for tropical areas.

      The current study focuses on the application of detection technologies for plant leaf diseases and pests, exploring contemporary technological methodologies and attributes underpinning current crop leaf disease and pest detection techniques. This paper describes the technical characteristics and advantages of parallel detection using deep learning models. The development direction and challenges of deploying deep learning-based leaf disease detection technology in tropical regions have also been discussed.

    • Leaf disease detection can be regarded as the detection of leaf disease objects, which is considered as a computer vision task, involving two key targets: object recognition (classification) and object localization. The process of building object detection can be divided into three main parts: data set selection or construction, model selection and training, model evaluation and deployment.

      Choosing the right architecture for the detection model depends primarily on the required level of accuracy, inference speed, and available computational resources. The detection methods are distinguished by whether object localization and classification are performed in a single stage[17]. The one-stage methods directly predict bounding boxes and class probabilities for all objects in a single pass without region proposal[18], such as YOLO series, SSD, CornetNet, et al. However, the tow-stage methods are much more complicated, like CNN, RCNN. The first stage usually proposes potential regions of interest, and the second stage refines these proposals by classifying and adjusting the bounding boxes. This stage difference in practical application leads to the two models having different emphasis on prediction speed and prediction accuracy[19], which the one-stage method emphasizes the prediction speed, correspondingly, and the other emphasizes the prediction accuracy. Popular CNN architecture used as the backbone includes AlexNet and its variants like VGGNet, GoogLeNet, Inception series, ResNet and its variants like ResNet50 and ResNet101, DenseNet, MobileNet.

      Currently, based on the literature summarized in Tables 1 & 2, the authors' models have consistently achieved recognition accuracy exceeding 90% in validation or prediction applications. A 90% recognition accuracy in target recognition models is often seen as high, but its adequacy depends on the specific application and requirements. Thus, there isn't a universal benchmark, and performance evaluation should align with specific tasks and applications[20]. Some models even exhibit recognition accuracy surpassing 99% in test sets. In the realm of laboratory development and certain outdoor applications, achieving high accuracy with simple models is still a prominent challenge[21]. Furthermore, due to the diverse hardware platforms employed by different authors and the specific purposes of their applications, direct comparisons of reasoning speed and computing resource requirements among various models pose challenges. Nevertheless, a delicate balance is required between recognition accuracy, model size, and model inference speed. This becomes particularly critical when contemplating the deployment of models on mobile terminals for practical applications. Achieving this balance represents a formidable challenge in the field.

      Table 1.  Plant disease open data sets.

      Data set nameCropBrief descriptionRef.
      Image Database of Plant Disease Symptoms21 plant speciesIn October of 2016, this database called PDDB, had 2,326 images of 171 diseases and other disorders affecting 21 plant species including soybean, citrus, coconut tree, dry bean, cassava, passion fruit, corn, coffee, cashew tree, grapevine, oil palm, wheat, sugarcane, cotton, black pepper, cabbage, melon, rice, pineapple, papaya, cupuacu.[26]
      Tomato leaf disease detectionTomatoA tomato leaf disease dataset includes tomato mosaic virus, tomato yellow leaf curl virus, late blight, leaf mold, early blight, septoria leaf spot, Spider mites Two-spotted spider mite, healthy tomato.[27]
      Agricultural Disease Image Database for Agricultural Diseases and Pests ResearchRice and wheat, fruits
      and vegetables, etc.
      The dataset currently has about 15,000 high-quality agricultural disease images, including field crops such as rice and wheat, fruits and vegetables such as cucumber and grape, etc.[28]
      Rice disease datasetRiceA rice leaf disease dataset includes bacterial leaf blight, blast and brown spot.[29]
      Pathological images of apple leavesAppleThe apple leaf disease image dataset contains 8 common apple leaf diseases: Mosaic, rust, gray spot, mottle leaf disease, brown spot, black star disease, black rot, healthy leaf disease.[30]
      Plant Pathology 2021 - FGVC8Apple2021-FGVC8 contains approximately 23,000 high-quality RGB images of apple foliar diseases, including a large expert-annotated disease dataset.[31]
      Tomato Disease Multiple SourcesTomatoOver 20 k images of tomato leaves with 10 diseases and one healthy class. Images are collected from both lab scenes and in-the-wild scenes, which includes late blight, healthy, early blight, septoria leaf spot, tomato yellow leaf curl virus, bacterial spot, target spot, tomato mosaic virus, leaf mold, spider mites Two-spotted spider mite, powdery mildew.[32]
      Data for: Identification of Plant Leaf Diseases Using a 9-layer Deep Convolutional Neural Network12 plant speciesThe dataset includes 39 different classes of plant leaf and background images are available. The data-set contain 61,486 images.[33]

      Table 2.  Leaf disease detection technology and corresponding advantages.

      No.Model nameTechnical characteristicsAdvantages.Ref.
      1Apple-NetThe Feature Enhancement Module (FEM) and Coordinate Attention (CA) incorporation.
      Generative Adversarial Networks (GAN) for interference reduction.
      Multi-scale information acquisition, increased diversity, and noise resistance.[34]
      2Mobile Ghost with Attention YOLOGhost modules and separable convolution for reducing model size.
      The mobile inverted residual bottleneck convolution with Convolutional Block Attention Module (CBAM) for improving feature extraction capability.
      Lightweight real-time monitoring (10.34 MB), suitable for mobile terminals.[35]
      3BTC-YOLOv5sBidirectional Feature Pyramid Network (BiFPN) for a fusion of multi-scale features.
      Transformer attention mechanism for capturing global contextual information and establishing long-range dependencies.
      CBAM for interference reduction.
      Reduces irrelevant information, small model size (15.8 MB).[36]
      4AlAD-YOLOThe backbone network of TOLOv5s replaced with that of MobileNetV3.Reduction in parameters and computational complexity during feature extraction.[37]
      5YOLOX-ASSANanoAsymmetric ShuffleBlock for enhanceing feature fusion.
      Cross stage partial module with shuffle attention for interference reduction.
      Processes complex natural backgrounds and lightweight model.[38]
      6V-space-based Multi-scale Feature-fusion SSDMulti-scale attention extremum for automatic lesion detection.Enhances detection ability for disease lesions, especially small ones.[39]
      7LAD-NetAsymmetric and dilated convolution as the convolution to reduce model size.
      LAD-Inception designed with an attention mechanism for improving multiscale detection capabilities.
      Small model size (1.25 MB), high accuracy (97.72%), and implementation of deployment on mobile devices.[40]
      8Enhanced LSTM-CNNMajority voting ensemble classifier replaced the classifier.
      Optimal LSTM layer network applied to select deep features autonomously.
      Enhanced feature extraction and classifier modification.[41]
      9LALNetEARD module with multi-branch structure and depth separable modules extracts more feature information with fewer parameters and computational complexity. SE attention module for increase the feature extraction capability.Small size (6.61 MB), fast execution (6.68 ms/photo), and high recognition accuracy.[42]
      10Two-stage detection systemThree-way classification in the first stage using Xception as the base model.
      Real-time detection in the second stage.
      Detects multiple diseases with 87.9% mean average precision.[43]
      11Improved Faster R-CNNRes2Net and feature pyramid network replaced the backbone of Faster R-CNN for batter feature fusion.
      RoIAlign instead of RoIPool of Faster R-CNN for improving the identification precision.
      Extracts multi-dimensional features in natural scenes with complex backgrounds.[44]
      12BC-YOLOv5Modify YOLOv5 neck structure with weighted BiFPN and CBAM.Enhanced feature extraction in the detection layer, reduced irrelevant information for complex backgrounds.[45]
      13PLPNetPerceptual adaptive convolution (PAC) for enhancing the
      network's global sensing capability.
      location relation attention module (LRAM) for reducing unnecessary information.
      SD-PFAN structure for fusing features batter.
      Recognizes leaf diseases at the edge of the leaf, resist background interference.[46]
      14DL TechniqueU-net with Gradient GSO for leaf segmentation in the first stage.
      DbneAlexnet trained using proposed GJ-GSO for leaf classification using Gradient Jaya-Golden search optimization in the second stage.
      Two-stage approach mitigating background noise. Optimized segmentation and classification through new training methods.[47]
      15LightMixerDepth convolution with Phish (DCWP) and light residual (LR) modules to increase feature integration and reduce parameters.
      Phish activation function for reducing the information loss.
      Identifies diseases in complex environments, suitable for mobile deployment.[48]
      16NanoSegmenterTransformer structure and sparse attention mechanisms to tackle the instance segmentation task, replacing the CNN backbone.
      The bottleneck inversion technique to achieve model lightweighting.
      High accuracy in instance segmentation, low computational complexity, and small model size.[49]
      17DMCNNMulti-scale convolution for disease classification from multiple channels.Enhancement of accuracy and efficiency through multi-scale detection[50]
      18CRNNCombines CNN and RNN for improved sequential features extraction.Achieves significant improvement in maximum accuracy compared to traditional CNN.[51]
      19Transfer learning with pre-trained CNN models.Transfer learning with Faster-RCNN and Inception ResNetv2 models.High recognition ability on new dataset after transfer learning.[52]
      20PCA DeepNetData enhancement with CycleGAN
      Feature extraction with PCA
      Classification with Faster-RCNN.
      Innovative PCA method for image extraction, followed by Faster-RCNN for classification.[53]
      21Four transformer-based models.Comparative study on four vision transformers (EANet, MaxVit, CCT, PVT) for tomato leaf disease identification.MaxViT architecture identified as the best for tomato leaf disease identification.[8]
      22Fine-grained image identification frameworkUtilizes OPM, DRM, AADM, and OCB for object identification, feature learning, and severity assessment.Assess severity based on categorized dataset, captures fine-grained details with DRM.[54]
      23RiceNetYOLOX identifies disease sites in the first stage.
      Siamese Network classifies diseases in the second stage.
      Effective two-stage detection, addressing complex backgrounds and limited samples.[55]
      24RWW-NNSetNet isolates the rice crop images.
      RWW algorithm (WWO & ROA), for improved classification.
      Two-stage approach mitigating background noise, improved classifier performance.[56]
      25The domain adaptation networks with novel attention mechanismsChannel and spatial attention mechanism (CPAM) in DSAN for key feature identification.Alleviates data distribution differences and small sample problems.[57]
      26RiceDRA-NetRes-Attention module based on CBAM for accurate disease identification and localization.
      DenseNet-121 serves as the backbone network.
      Precise disease localization, even in complex backgrounds.[58]
      27rE-GoogLeNetECA attention mechanism in GoogLeNe
      Residual networks for information loss mitigation.
      Improved recognition and performance over alternatives.[59]
      28ADSNN-BOEnhanced self-attention mechanism employed along the entire architecture in MobileNetV1,
      Bayesian optimization for hyperparameter tuning.
      Outperforms MobileNet with 3.6% accuracy improvement.[60]
      29DGLNetGlobal attention module (GAM) enhances sensitivity by reducing background noise.
      Dynamic representation module (DRM) for flexible feature acquisition.
      Enhances generalization capability and feature representation in lightweight models.[61]
      30Novel rice grade modelEfficientNet-B0 architecture as the backbone for better recognition accuracy for spotting diseases.
      By identifying leaf instances and disease areas, the ratio of the two areas was calculated to estimate the severity of the disease.
      Reliable disease spot recognition, quantifies severity of rice disease.[62]
      31Comparison of pre-trained residual network modelsComparison of ResNet34, ResNet50, ResNet18 with self-attention and ResNet34 with self-attention.Models with self-attention exhibit improved recognition accuracy during transfer learning.[63]

      The model's ability to recognize is strongly tied to the quality and specifics of the dataset used for training, validation, and testing. For instance, factors such as the accuracy of object labels, the number of images and labels in the data set, balanced data distribution, and the use of data enhancement methods impact the ultimate prediction accuracy and the model's ability to generalize. He et al.[22] suggested that imbalanced data could compromise model classification accuracy. The size of the dataset is commonly regarded as significant which is the number of images and labels in the data set, yet it is not definitive[23]. While expanding the dataset size may enhance the model's generalization capability and performance to some degree, it does not imply that a larger dataset always leads to better outcomes or more accurate identification under identical conditions[24]. Errors in data labels can significantly impact the accuracy of model testing. These views are further extended and proposed by Priestley et al.[25] and they pointed out differences in data management practices between academia and industry, underlining that data quality should align with diverse user needs. They also proposed that the availability of data can be supported by an infrastructure for data collection and management, particularly in large organizations. In academia, data sets are typically categorized as published or non-public, posing challenges in objectively assessing the quality of datasets used in some researcher-proposed models.

      In practice, an important consideration influencing model performance is whether images in the dataset capture natural environments, encompassing both indoor and outdoor scenes. For example, as shown in Fig. 1ac, are all indoor, d is outdoor. Natural environment photos tend to have complex backgrounds, demanding models with stronger anti-interference capabilities against intricate scenarios. As researchers develop models, it is still suggested that if the researchers want to develop a model for the identification of a leaf disease, it is a very necessary process to collect relevant data and label it. The research focus of tropical leaf disease detection is still based on effective data collection. Based on datasets from Tables 1 & 2 this article organizes the published datasets, including referenced ones, for further reference.

      Figure 1. 

      Photographic examples of a plant disease dataset. (a) Balck rot; (b) Northern leaf blight; (c) Isariopsis leaf spot; (d) Brown spot.

    • A review was conducted on a corpus of scholarly papers abput leaf disease detection published in the year 2023. The focus of the review includes the characteristics and advantages of techniques for detecting leaf disease using deep-learning models. The selected papers collectively encapsulated a broad spectrum of contemporary deep-learning models employed for the detection of leaf diseases. This article provided detailed insights into the characteristics of these models. Additionally, the review meticulously cataloged the salient features utilized of the model, thereby affording a understanding of the state-of-the-art methodologies employed in the domain of leaf disease detection. The models used in this study include models based on mature algorithm technology models such as YOLO, SSD, and CNN. The investigator has undertaken pertinent modifications to these infrastructures, tailoring them to optimize their efficacy for the distinct application conditions posed by the identification of pathological manifestations in foliage, including methods of identifying leaf disease using first and second-stage models.

      These models exemplify superior accuracy, lightweight architecture, and adept deployment on mobile devices, rendering them well-suited for the detection of leaf diseases across diverse scenarios. Notably, enhancements in recognition accuracy are achieved through the replacement of backbone networks or the introduction of innovative modules. These improvements are usually to enhance the ability to extract features from images or the ability to fuse features after extraction, and to reduce interference as elucidated in Table 2 (No. 1, 2, 3, 5, 6, 9, 11, 12, 13, 16, 17, 18, 20, 26, 27, 30). Additionally, attention mechanisms play a pivotal role in refining recognition accuracy by focusing on key information on different channels or convolution kernels at different scales, as evidenced by the methodologies delineated in Table 2 (No. 1, 2, 3, 5, 6, 7, 9, 12, 13, 16, 17, 18, 22, 25, 26, 27, 28, 29, 31). Furthermore, after comparing the four state-of-the-art(SOTA) Vision Transformer models by Hossain et al.[8], it is concluded that MaxViT has better recognition accuracy, which proved that using global attention is more suitable to improve the recognition accuracy of the leaf disease identification.

      Techniques for refining input image quality, exemplified by the implementation of Generative Adversarial Networks (GAN)[34,53], further contribute to the augmentation of accuracy. Finally, introducing other methods to improve the classifier can also increase the accuracy of recognition[41,56] and optimize the training method of the model[47].

      Moreover, as part of the overarching goal to enhance recognition accuracy, the adoption of pre-trained models through transfer learning on newly curated datasets emerges as a commendable paradigm, as advocated by Saeed et al.[52], Simhadri & Kondaveeti[64] and Sudhesh et al.[63]. The imperative for reduction in model size is achieved through the introduction of novel modules or the replacement of the feature extraction network, as evidenced by instances in Table 2 (No. 2, 4, 7, 9, 15, 16).

      Furthermore, certain authors have made notable contributions to practical field applications, exemplified by endeavors in real-time processing, the amelioration of environmental challenges and assessment of disease severity. The two-stage methodology includes classification before detection and detection before classification, demarcating the detection process into initial leaf segmentation and subsequent leaf disease classification, as undertaken by Khan et al.[43] and Badiger & Mathew [47], exhibits a proclivity towards effectively discerning multiple diseases on leaves. Pen et al.[55] and Daniya & Vigneshwari[56] also used a two-stage approach to solve the problem of disease identification from photos with complex background obtained in actual fields, which can overcome the interference of complex background environments to recognition to some extent. At the same time, because a new segmented disease data set is generated in the process, the problem of small samples with fewer original images is also solved.

      Leaf diseases of tropical plants usually have the following characteristics: a variety of leaf diseases, rapid outbreak of leaf diseases, high frequency, and difficult to prevent. In addition, unlike in other regions, tropical plants tend to be relatively tall, with thicker foliage and a faster growth cycle. This makes disease surveillance and management of tropical plants more difficult. For some fruit crops, such as coconut, mango, lychee, and durian, it is difficult to achieve early detection and early treatment of leaf disease. Even though some crops can be reduced in height through dwarfing management, they still have wider leaves for relatively similar crops such as apples and tomatoes. This makes it difficult to use a camera to photograph the diseased leaf in its entirety up close. Therefore, new requirements are put forward for leaf disease detection technology based on deep learning models especially for real-time monitoring.

      Given the difficulties in the detection of leaf diseases of tropical plants like coconut, such as the small number of leaf disease data sets, the mutual occlusion of large leaves, the influence of leaf shadows, and the interference of complex leaf backgrounds, some researchers have conducted studies. Thite et al. have published a dataset named 'Coconut (Cocos nucifera) Tree Disease Dataset', which contains five diseases: Bud Root Dropping, Bud Rot, Gray Leaf Spot, Leaf Rot, and Stem Bleeding[65]. The images in this dataset are centered on disease locations and also include disease photos presented on tree trunks. In addition, researchers have developed a detection model for coconut tree disease. The model uses the newly developed AIE-CTDDC technology[66]. To solve the problem of identifying coconut tree disease in the complex coconut leaf background environment, the model uses CapsNet[67] as the feature extractor, and the data is pre-processed using MF-based enrollment removal technology before this. Similarly, to solve the problem of mutual occlusion of large coconut leaves and the impact of leaf shadows on the recognition effect, Subbaian et al. proposed a coconut leaf disease detection method based on YOLOv4[68]. The method improved the prediction accuracy of the model through multi-scale detection, PANet, and adaptive border improvement.

      In terms of the portability of detection and solving the problem that the plants are too high to observe, some researchers have proposed some methods and applications for the detection of durian leaf disease. Gallenero & Villaverde designed a portable device embedded with the Duri Premium application to identify durian leaf disease. The device was equipped with the Mobilenet-based convolutional network model, which achieved good identification accuracy[69]. Also for portable detection, a mobile application was developed to detect the leaf diseases of mango and grape by Rao et al.[70]. To solve the problem caused by the rapid detection and prevention of durian leaf disease, Piriyasupakij & Prastiphan designed an unmanned aircraft equipped with YOLOv5 for the detection of durian leaves on the tree and realized the effect of automatic cruise shooting and identification of durian leaf disease[71]

      From the above, it can be concluded that when identifying leaf diseases in tropical crops, researchers need to consider two aspects. On the one hand, it is to solve the impact of large blade occlusion and complex leaf surface environment around leaf disease. Another aspect is that to achieve rapid leaf disease detection, it is necessary to carry out portable design of detection equipment, such as mounted on mobile terminals, to cope with complex detection environment or to detect excessively high plant leaf disease.

    • There is a trade-off between the speed and accuracy of model inference. In general, increasing the inference speed of a model may result in decreasing the recognition accuracy of the model, and vice versa. This is because when designing a model, to improve the reasoning speed, it is often necessary to reduce the complexity and the number of parameters of the model, which may lose certain recognition accuracy. On the contrary, to improve the recognition accuracy of the model, it may be necessary to increase the complexity and the number of parameters of the model, resulting in slower inference speed.

      To assess and compare the performance of the models, the model prediction results are commonly used as True Positive (TP), which refers to the number of positive samples correctly identified; False Positive (FP), which refers to the number of negative samples incorrectly identified; True Negative (TN), which refers to the number of negative samples correctly identified; and False Negative (False Negative, FP) refers to the number of negative samples that are incorrectly identified. The correspondence is shown in Table 3.

      Table 3.  Classification of predicted and actual results.

      ActualPredicted
      PositiveNegative
      TrueTrue positiveTrue negative
      FalseFalse positiveFalse negative

      Based on the four results of sample classification, it also sets Accuracy to indicate the samples predicted to be classified correctly among all samples; sets Precision to indicate the proportion of true-positive samples among those predicted to be positive; sets Recall to indicate the proportion of true-positive samples among those classified correctly; and sets the average precision. The corresponding formulas for Accuracy, Precision and Recall are as follows, respectively.

      $ Accuracy=\dfrac{T P}{T P+F P+T N+F P} $ (1)
      $ Precision=\dfrac{T P}{T P+F P}$ (2)
      $ Recall=\dfrac{T P}{T P+F N}$ (3)

      In the assessment of the model classification quality of each category, since the use of quasi-departure rate, checking rate, and recall rate alone cannot be considered together to assess the score, the researcher, therefore, proposes the use of Average Precision (AP) as a measure of the quality of the model classification for a certain category, i.e., integrating a function plotted on a certain category of objects with the Recall (r) as the horizontal axis and the corresponding Precision (p(r)) as a function plotted on the vertical axis for integration; use of mean Average Precision (mAP) for evaluating a model for multiple classes (n) of object classification performance evaluation metrics; F1 Score is used as an assessment of the combined consideration of check accuracy and recall, i.e., the reconciled average of check accuracy and recall. The Average Precision and F1 Score correspond to the following calculation formula:

      $ AP=\underset{0}{\overset{1}{\int }}p\left(r\right)dr $ (4)
      $mAP=\dfrac{1}{n}{\sum }_{i=1}^{n}A{P}_{i} $ (5)
      $ {F}_{1}=\dfrac{2Precision\;\times\; Recall}{Precision\;+\; Recall}$ (6)

      In improving the accuracy of recognition, there are limitations in the identification of leaf disease using two-dimensional image processing. The approaches mentioned in Table 2, which employ convolutional and deep learning networks for image feature extraction, are inherently designed for the analysis of two-dimensional images. In practical applications, foliage afflicted with diseases often exhibits characteristics such as leaf curl, damage, and instances of mutual occlusion during the acquisition of field imagery. Consequently, a nuanced examination of disease severity, based on a model learned on two-dimensional image data alone, predicated on estimating the proportion of the diseased area relative to the entire leaf surface[62], may lead to the inadvertent[72]. Researchers have suggested a method for creating three-dimensional reconstructions using two-dimensional images, aiming to overcome spatial limitations present in these types of pictures. The research shows that it is feasible to deduce crop height and leaf area through 3D modeling[73]. Compared with the 2D RGB image processing method, the 3D method can accurately estimate the number of leaves, avoid the influence of mutual occlusion of leaves to a certain extent, and greatly improves the accuracy of detection[74]. At the same time, it may also solve the problem proposed by Tang et al.[46], that the occurrence of diseases at the edge of leaves in a complex background will interfere with the recognition. Utilizing the approach of reconstructing a three-dimensional model based on two-dimensional images still poses challenges[7577]. These challenges encompass the loss of depth information, compromised accuracy due to low resolution or distorted images, and difficulties in precisely capturing intricate geometric textures, particularly in complex scenes. In terms of computing cost, it cannot be ignored that three-dimensional method consumes more computing cost than two-dimensional method[78].

      In addition, to improve the quality of recognition, the solution of the multi-scale detection problems and the application of the attention mechanism have played a great help. Objects of all sizes (objects proportional to the size of the image) need to be detected, requiring the network to have the ability to recognize objects of different sizes, faced with the challenge of significantly decreasing detection accuracy for very large or very small scale targets[36]. However, the deeper the network, the smaller the size of the feature map, which makes it difficult to detect small objects, which is a problem that cannot be avoided after the model extracts the feature map[79]. This problem can be alleviated in the process of extracting features[39] and in the process of feature fusion[80], to improve the average precision of the model. The attention mechanism is a self-supervised learning method used in the natural language processing, and applied to enable the network to focus on the target region with important information by learning how much the input data contributes to the output data, while suppressing other irrelevant information and reducing the interference caused by irrelevant background on detection results[36,81]. This method can be applied to the model to extract features of different channels, for example, CBAM[82], BAM[83].

      The recognition speed of the evaluation target recognition model usually has the following evaluation indexes, such as inference time, inference throughput, inference frame rate, and hardware resource utilization. These evaluation indexes are also used to evaluate the reasoning speed of the model in specific application scenarios. Inference time is commonly used to evaluate the speed of the model in image recognition and classification, which means the time it takes the model to go from receiving the input image to outputting the prediction.

      In speeding up the prediction speed, the one-stage target detection method has more advantages. For the one-stage recognition method based on YOLO, the anchor method should be used for frame selection first, especially for YOLOv2 to YOLOv6, which takes up computing resources. To reduce model size and prediction speed, many researchers proposed an anchor-free method, which takes key points as the core. For example, the target center of the feature map is taken as the key point to locate the target. Based on the number of key points, the free-anchor method can be divided into central-point-based method and multi-key point-based method, such as CenterNet[84]. From a hardware aspect, different hardware selected for the prediction means different predictive speeds under the same model selection and parameters[85].

      In practical production, a single plant can exhibit concurrent occurrence of multiple diseases, with distinct characteristics of various diseases observed on the same leaf, or the same disease has different characteristics at different times[86]. This multifaceted infection pattern can be considered a more indicative measure for assessing current crop damage levels, placing higher demands on the accuracy and generalization capacity of disease identification models.

      Overall, if the leaf disease identification method is deployed in practical production, the balance between recognition accuracy and recognition speed can not be achieved only by optimizing the model or hardware. However, in terms of the evolution and development of methods for identifying leaf diseases using convolutional neural networks, the balance between accuracy and speed has to be mentioned.

      There are still some ways to balance the relationship between speed and accuracy of model inference. These methods can be roughly divided into three general directions: model compression, hardware optimization, and algorithm improvement. Model compression[87], such as channel pruning[88] and knowledge distilling[89], can reduce the number of parameters and complexity of the model, thereby improving the model reasoning speed and maintaining the accuracy of recognition to a certain extent[87]. Hardware optimization can often significantly increase the speed of model inference. For example, running a model on a GPU, TPU, or professional computing device can significantly increase the speed of the model inference[90]. There are more methods to accelerate the model inference speed and improve the inference accuracy by improving the algorithm, which are not listed in this paper. All three methods can improve the performance of the model both during training and during inference.

      It is noteworthy that the lower the error rate of the training model is not equal to the better the quality of the model when training models and too low a classification error rate usually leads to overfitting problems. For example, such as a fully connected network classifier, one should not simply assume that achieving the best learning quality is synonymous with minimizing the classification error rate. Some researchers have delved into understanding the delicate balance between learning difficulty and learning speed. By utilizing a single-layer perceptron and a double-layer neural network optimized with a gradient descent learning algorithm, the average accuracy typically decreases with training time. The model attains a harmonious equilibrium between training difficulty and learning rate when the training error rate is at 15.87%[91], resulting in an approximately 85% accuracy.

      Hence, in the pursuit of a specific characteristic index for the model, it is imperative to selectively adjust and optimize the model based on the prevailing circumstances or specific requirements. Furthermore, relying on a singular index is inadequate for evaluating the overall quality of a given model.

    • In the realm of agriculture, the identification of crop diseases stands as a pivotal task, serving as a key to further assessing the severity of current or potential hazards. The foregoing review elucidates that the application of artificial intelligence (AI) technology in monitoring plant leaf diseases attains commendable levels of recognition accuracy and expeditious identification in the model development and testing. This methodology emerges as a proactive approach to disease identification, conferring the capacity to empower agricultural stakeholders and experts in effectually addressing extant diseases or preemptively mitigating potential threats. Moreover, through the implementation of smart agriculture methodologies, the attainment of sustainable and resilient production is conceivable, thereby mitigating environmental impact and fortifying food security[92]. The judicious quantification of crop diseases fosters the formulation of precise protection strategies tailored to the dynamic and perpetually changing agricultural milieu. This approach facilitates the adoption of targeted disease prevention and control measures, consequently diminishing the superfluous use of pesticides. The resultant abatement in pesticide application not only serves to curtail production costs but also mitigates environmental pollution arising from pesticide usage[93]. The application of artificial intelligence (AI) technology for identifying leaf diseases, while promising, is not without potential challenges.

      In the application of tropical plant leaf disease recognition, the real-time monitoring and mobile device support characteristics based on deep learning model can greatly solve the characteristics of tropical plant leaf disease difficult to find and observe in time. To further solve the problem of insufficient computing power of mobile hardware devices or high demand for model recognition accuracy, the Master-Slave structure can be used. In this structure, the master model acts as the central node, such as the cloud platform, responsible for coordinating and controlling the operation of the whole system, while the receiver acts as the slave node, such as mobile devices, responsible for receiving and processing the instructions or data of the autonomous model, which can effectively realize the parallel processing and collaboration of tasks.

      Current studies have shown that terahertz waves can be used to detect physiological and biochemical parameters in plant leaves, such as water content[94], chlorophyll content, cell structure, and cell wall thickness, to indirectly reflect the occurrence and development of leaf diseases. Terahertz waves are electromagnetic waves between microwaves and infrared light, with frequencies ranging from 300 GHz to 3 THz. Terahertz waves have a wide range of applications in biomedicine, material science, and safety testing. Terahertz waves have strong penetration in biological materials and are also resonance absorbed by biomolecules, so they can be used to detect changes in the internal structure of plant leaves and biomolecules. It is possible to use deep learning models to analyze the signals of crop leaves fed back by terahertz waves, but the technology is still in the research stage[95,96]

      The rapid development of large language modeling in recent years has made it possible to combine large language modeling with leaf disease detection techniques[97]. The rapid development of large language modeling in recent years has made it possible to combine large language modeling with leaf disease detection techniques. Large language models have excellent advantages in processing and analyzing literature and data, which can help researchers better understand and grasp the research progress within the field of leaf disease detection. In addition, the powerful text comprehension and text generation capabilities of the big language model can assist in the annotation and enhancement of plant petiole image data, thus improving the efficiency of model training. Similarly, the Big Language Model can assist in reasoning and summarizing the results of leaf disease detection and provide scientific references and bases.

      However, it is still due to the complex and changeable climate environment such as tropical high temperatures and high humidity, especially considering the production and investment costs, it is not practical to use mobile devices equipped with identification models for detection. Hot and humid environments tend to damage electronic equipment, which increases maintenance costs after the equipment is deployed to the field.

    • The application of artificial intelligence (AI) technology in the detection and diagnosis of crop leaf diseases represents an advanced approach in precision agriculture, which particularly in machine learning and deep learning, various methods have proven effective in automating the identification and classification of crop leaf diseases. However, in practical implementation, it is imperative to carefully choose the suitable model and method for deployment based on the specific circumstances and demands. The detection and management of plant diseases in tropical areas remain a multifaceted issue. This technological application aims to swiftly and accurately evaluate the situation, thereby enabling timely interventions to mitigate the adverse impact of diseases on crop productivity.

    • The authors confirm contribution to the paper as follows: study conception and design: Huang M; manuscript preparation: Yao Z, Huang M. Both authors read and approved the final version of manuscript.

    • Data sharing not applicable to this article as no datasets were generated or analyzed during the current study.

      • This research was supported by the Major Science and Technology Project of Hainan Province (Grant No. ZDKJ202017-2-2).

      • The authors declare that they have no conflict of interest.

      • Received 6 January 2024; Accepted 15 April 2024; Published online 26 June 2024

      • Copyright: © 2024 by the author(s). Published by Maximum Academic Press on behalf of Hainan University. This article is an open access article distributed under Creative Commons Attribution License (CC BY 4.0), visit https://creativecommons.org/licenses/by/4.0/.
    Figure (1)  Table (3) References (97)
  • About this article
    Cite this article
    Yao Z, Huang M. 2024. Deep learning in tropical leaf disease detection: advantages and applications. Tropical Plants 3: e020 doi: 10.48130/tp-0024-0018
    Yao Z, Huang M. 2024. Deep learning in tropical leaf disease detection: advantages and applications. Tropical Plants 3: e020 doi: 10.48130/tp-0024-0018

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return