Optimizing crop classification in precision agriculture using AlexNet and high resolution UAV imagery

Oluibukun Gbenga Ajayi; Elisha Iwendi; Oluwatobi Olalekan Adetunji; Oluibukun Gbenga Ajayi; Elisha Iwendi; Oluwatobi Olalekan Adetunji

doi:10.48130/tia-0024-0009

The rapid advancement of artificial intelligence (AI), coupled with the utilization of aerial images from Unmanned Aerial Vehicles (UAVs), presents a significant opportunity to enhance precision agriculture for crop classification. This is vital to meet the rising global food demand. In this study, the effectiveness of 8-layer AlexNet, a Convolutional Neural Network (CNN) variant was investigated for automatic crop classification. A DJI Mavic UAV was employed to capture high-resolution images of a mixed-crop farm while adopting an iterative training approach for both AlexNet and the conventional CNN model. Comparison based on performance was done between these models across various training epochs to assess the impact of training epochs on the model's performance. Findings from this study consistently demonstrated that AlexNet outperformed the conventional CNN throughout all epochs. The conventional CNN achieved its highest performance at 60 epochs, with training and validation accuracies of 62.83% and 46.98%, respectively. In contrast, AlexNet reached peak training and validation accuracies of 99.25% and 71.81% at 50 epochs but exhibited a slight drop at 60 epochs due to overfitting. Remarkably, a strong positive correlation between AlexNet's training and validation accuracies was observed, unlike in the conventional CNN. The research also highlighted AlexNet's potential to generalize its crop classification accuracy to datasets beyond its training domain, with a caution to implement early stopping mechanisms to prevent overfitting. The findings of this study reinforce the role of deep learning and remotely sensed data in precision agriculture.

HTML

Introduction

By 2030, global population growth is expected to have grown close to 9 billion, and this will consequentially increase global demand for food^[1]. Unfortunately, destructive natural disasters such as floods and drought, and climate change impacts are gradually becoming huge threats to food security on local and national scales^[2]. Hence, to ensure food security, there is a need to have timely and reliable information about the location, health, extent, type, and yield of crops^[3−5]. Accurate and timely crop classification can produce basic data for various applications required for sustaining adequate food production. This includes forecasting of crop yield, assessment of food security, and crop area estimation.

Crop classification, an integral part of precision agriculture, consists of the identification of different crop types planted on an agricultural farmland. In precision agriculture, crop classification is necessary for automated crop health and growth monitoring, precision fertilization, yield estimation, and prediction^[6,7]. Unlike the manual and physical contact approaches (expert estimation) of identifying and classifying crop types on a farmland, automated crop classification using remote sensing-based technologies reduces time and labour costs and has higher accuracies^[8,9].

The recent evolution of Remote Sensing (RS) involving the introduction of Unmanned Aerial Vehicles (UAVs) for earth and environmental studies has brought about notable advancement in scientific studies, especially in the field of precision agriculture^[10]. The use of this technology has proved very beneficial by providing significant results for periodic and accurate croplands monitoring, extraction of information about crop phenology, crop health, crop types, and yield estimation over small and large areas^[11−13].

Hyperspectral and multispectral remote sensing has been a popular technique for classifying crops in recent years due to the fine spectral response to crop attributes^[14]. Different analytical and experimental approaches have been used by different authors for achieving this aim. Using time-series UAV images, some studies utilized vegetation index to recognize and classify crop planting area and vegetation^[15,16]. More sophisticated methods involving temporal feature extraction approach such as pre-defined mathematical models are also used in various crop classification studies involving the use of UAV images^[17]. Using these approaches based on temporal feature extraction, remarkable performance in crop classification has been recorded. However, most of the studies indicated some imperfections that hampered the reliability of the outputs. Generally, these approaches usually rely on expert's experience and domain knowledge which often lead to information loss, and hence, limit the reliability of feature extractors and effectiveness of the crop classification processes^[18,19].

The concept of artificial intelligence (AI) in image classification and automatic feature identification and extraction is developing and becoming an important method in a variety of disciplines^[20,21]. Machine Learning (ML) and Deep Learning (DL) are two major interwoven parts of artificial intelligence, and are also referred to as Machine Intelligence (MI), has recently become research topical in many fields concerned with obtaining highest possible accuracy for an effective and informed decision-making^[22]. Though, there is a very thin line between ML and DL, various studies have distinguished their approach of application. ML algorithms have excellent generalization strength and are mostly compatible for tackling nonlinear problems^[17]. Based on their manageable degree of accuracy, studies such as the one by Lu et al.^[23] employed both pixel-based and image-based K-Nearest Neighbour (KNN) algorithms combined with Landsat-8 images to classify different land cover types in China. The results of the study revealed about 90% classification accuracy. To achieve better accuracy Juan et al.^[24] recommended stacking of multiple ML classifiers as this can produce a higher advantage in crop classification. Based on these ensemble classifications, Löw et al.^[25] integrated random forest and SVM models for mapping multiple crops in Uzbekistan. The study involved analyzing multispectral remotely sensed images and the mapping using these models yielded an accuracy of approximately 95%, exemplifying the possibility of achieving higher classification accuracy when models are combined compared to when a single classifier is used. Other studies have also evaluated the performance of various ML models for classifying different crops on large and small extent of land^[26−28]. Efficiency of ML algorithms for classifying different crops is hampered by many factors. For example, classical ML algorithms such as Random Forest, KNN, and SVM depend on feature selection where there is a need to design feature extractors which mostly perform optimally on small databases but fail on larger and varied data^[8]. Also, integrating these algorithms makes computation, training, and other processes more cumbersome, consuming more storage space, and often requires sophisticated computer systems for implementation.

On the other hand, DL algorithms are recognized as a reliable approach for analysing remote sensing data such as UAV images^[29]. In classification studies, remote sensing data has become highly relevant by greatly benefiting from DL algorithms due to their flexibility in feature automation, representation via end-to-end procedure, and automatic feature extraction^[30,31]. Based on these unique characteristics, DL models (different networks and structures) have been utilized for crop type mapping/classification and crop yield monitoring.

Convolutional Neural Networks (CNN) is one of the most popular and utilized DL networks^[32−34]. Recently, because of CNN, DL has become more popular. Compared to earlier modeling systems, the conventional CNN automatically detects significant features without human supervision which makes it more popularly used or implemented^[35]. Conventional CNN-architecture consists of three layers which are the convolve layer, Rectified Linear unit (ReLu), and pooling layer^[36]. Its major role is to track and capture data having similar features to the conventional feed-forward neural network. Each image is submitted through the layers until a loss function is achieved at the top layer^[37]. Feature extractions from an image are usually performed by using image patches and filters that progress over the input image in the convolve layer.

Many studies have employed conventional CNN architecture for classification studies. For example, Zhao et al.^[38] assessed five different DL models for classifying croplands. The findings of the study show the high effectiveness of 1-D CNN for classifying crops. Ji et al.^[39] developed a 3-D CNN model for automatically classifying crops using spatio-temporal remotely sensed images. The developed network was fine-tuned using an active learning technique for increasing the labeling accuracy. Comparing the outputs of the study with a 2-D CNN classifier, the findings established that the proposed classifier performed efficiently and with higher accuracy. Generally, previous studies have established that DL models having CNN-based architectures yield high accuracy for image classification and object detection (which are the major ingredient for crop type classification^[40,41]. Such algorithms or models make use of convolutional filters on an image to extract important features for understanding the object of interest in the image with the help of convolutional operations covering key properties such as local connection, parameters (weight) sharing, and translation equi-variance^[42]. Pandey and Jain ^[43]presented a new conjugated dense CNN (CD-CNN) algorithm having a new activation function tagged SL-ReLU for multiple crop calibration from UAV-captured RGB images. The developed CD-CNN integrates data fusion and feature map extraction process. SL-ReLU, a dense block architecture served as an activation function for the purpose of mitigating the chance of unbounded convolved output and gradient explosion. The proposed CD-CNN achieved a strong distinguishing capability from several classes of crops. Experimental results show that the proposed module achieved an accuracy of 96.2% for the used data. A recent study by Kalita et al.^[44] also established the possibility of obtaining very high crop type classification accuracy (up to 99%) with CNN-based classifiers using UAV-acquired images, especially when two or more of such models are combined (ensembled).

The architecture of ML models is a vital consideration in improving their performance for different applications. In the past decades, several CNN architectures have been presented as an improvement on the conventional CNN structure^[45]. Such modifications of the conventional architecture include parameter optimizations, structural reformulation and regulation, among others. Among the most famous conventional CNN modified models include AlexNet, a high-resolution (HR) three dimensional (in the context of input data structures) CNN model variant specifically developed for image classification tasks, etc.^[35].

AlexNet is an example of CNN-based model structures which have been given very little attention for image classification and recognition, especially in the field of crop type classification. On the other hand, various studies have established the smartness and effectiveness of AlexNet in recognizing and identifying other features of remotely sensed data. Khan et al.^[46] applied this algorithm for classifying plant disease which was introduced as the encoder to encode an image into a compact representation as the graphical features. A prediction model CNN was selected as a decoder which performed the plant diseases classification. AlexNet in this study was trained using stochastic gradient descent which makes the training stage very easy. The output of the study revealed that AlexNet is capable of detecting and recognizing rice diseases and pests with an accuracy of over 96%. This makes the model an important advisory or early warning tool. Also, Pauline et al.^[47] applied AlexNet for classifying vegetation (including weed and Chinese cabbage) to detect weeds for accurate smart spraying solution. This study used UAV-acquired images which were pre-processed and subsequently segmented into crop, soil, and weed classes using simple linear iterative clustering super-pixel algorithm. These segmented images were subsequently used to build AlexNet classifier. The accuracy of this classification was assessed by comparing it with the output from Random Forest classifier which established that the CNN-based classifier achieved a higher overall accuracy (92.4%) than random forest (86.2%). Lv et al.^[48] also implemented AlexNet for identifying maize lead disease.

The aim of this study is to assess CNN-based AlexNet's performance for crop type classification and identification using a UAV image covering a mixed small-scale agricultural farm. Generally, the objectives of the study includes the acquisition of high resolution UAV captured images, setting up, testing, and training the CNN-based AlexNet structure using segmented pre-processed images. Though the aim of this study is built around AlexNet, to assess the effectiveness of its classification process (degree of success and error), conventional CNN architecture was also implemented for crop type classification. Therefore, this study also presents a comparative analysis of the performance of the conventional CNN and AlexNet model for crop classification.

Conclusions

The primary objective of this study was to evaluate the efficacy of AlexNet, a Convolutional Neural Network (CNN)-based model variant, in identifying a particular crop in a mixed crop farm using high-resolution aerial imagery obtained from low-altitude UAVs.

The acquired aerial images were meticulously organized into 32 batches to optimize computational efficiency during data processing. Training and validation of the AlexNet model were conducted following a similar methodology as that of the conventional CNN, allowing for a comparative analysis of the two models.

It is a common expectation that, under well-defined learning parameters, the performance of AI algorithms designed for image classification should exhibit improvement in both training and validation accuracy as the quantity and quality of the training dataset increase. To assess the behaviour of these models during training, four different training epochs were considered, specifically 30, 40, 50, and 60 epochs. The evaluation criteria were based on training and validation loss as well as accuracy.

Throughout all the epochs, AlexNet consistently outperformed the CNN model in terms of both training and validation datasets. As anticipated, the accuracies of AlexNet increased progressively from 30 epochs through 50 epochs, but a decrease in accuracy was observed at 60 epochs, indicating the onset of overfitting. In contrast, while CNN's accuracies were not as high as AlexNet's, they demonstrated a continual increase as the number of epochs increased, suggesting that CNN's architectural design allows for effective learning from extensive data without suffering from the complexities of overfitting.

This study underscores AlexNet's capability to accurately classify crops within a mixed vegetation field, particularly when utilizing high-resolution UAV images. AlexNet achieved a remarkable maximum training accuracy of 99.25% and, when tested on datasets outside its training domain (validation dataset), attained a maximum accuracy of 71.81%, notably at 50 epochs.

The findings of this study align closely with the research conducted by Arya et al.^[51], which also compared the performance of conventional CNN and AlexNet in detecting diseases in potato leaves. Additionally, a recent study by Arya & Singh^[52] has reported findings that are consistent with the results presented in this current work.

In light of the observed overfitting, we strongly recommend implementing early stopping techniques, as demonstrated in this study at 50 epochs, or modifying classification hyperparameters to optimize AlexNet's performance whenever overfitting is detected. Further research efforts will be invested in the investigation of the performance of AlexNet in classifying crop plantations beyond two categories. It will involve optimizing the pre-processing stage and refining hyperparameter definitions to ensure that the model can be trained with the maximum available dataset and iterations, thereby advancing the field of crop classification in precision agriculture.

Author contributions

The authors confirm contribution to the paper as follows: study conception and design: Ajayi OG; data collection: Ajayi OG; analysis and interpretation of results: Iwendi, E, Adetunji OO; draft manuscript preparation: Iwendi, E, Adetunji OO. All authors reviewed the results and approved the final version of the manuscript.

[1]	Sheikh M, Fakhrul I, Zahurul K. 2020 World's demand for food and water. In Desalination - Challenges and Opportunities, eds. Hossein Davood Abadi Farahani M, Vahid Vatanpour, Amir Taheri. IntechOpen. https://doi.org/10.5772/intechopen.85919
[2]	Khan MA, Tahir A, Khurshid N, Husnain MIu, Ahmed M, et al. 2020. Economic effects of climate change-induced loss of agricultural production by 2050: A case study of Pakistan. Sustainability 12:1216 doi: 10.3390/su12031216 CrossRef Google Scholar
[3]	Agovino M, Casaccia M, Ciommi M, Ferrara M, Marchesano K. 2019. Agriculture, climate change and sustainability: The case of EU-28. Journal of Ecological Indicators 105:525−543 doi: 10.1016/j.ecolind.2018.04.064 CrossRef Google Scholar
[4]	Seydi ST, Amani M, Ghorbanian A. 2022. A dual attention Convolutional Neural Network for Crop classification Using Time-Series Sentinel-2 Imagery. Journal of Remote Sensing 14:498 doi: 10.3390/rs14030498 CrossRef Google Scholar
[5]	Khan HR, Gillani Z, Jamal MH, Athar A, Chaudhry MT, et al. 2023. Early identification of crop type for smallholder farming systems using deep learning on time-series Sentinel-2 Imagery. Sensors 52;3(4):1779 doi: 10.3390/s23041779 CrossRef Google Scholar
[6]	Ashapure A, Jung J, Yeom J, Chang A, Maeda M, et al. 2019. A novel framework to detect conventional tillage and no-tillage cropping system effect on cotton growth and development using multi-temporal UAS data. ISPRS Journal of Photogrammetry and Remote Sensing 152:49−64 doi: 10.1016/j.isprsjprs.2019.04.003 CrossRef Google Scholar
[7]	Ajayi OG, Ashi J, Guda B. 2023. Performance evaluation of YOLO v5 model for automatic crop and weed classification on UAV images. Smart Agricultural Technology 5:100231 doi: 10.1016/j.atech.2023.100231 CrossRef Google Scholar
[8]	Reedha R, Dericquebourg E, Canals R, Hafiane A. 2022. Transformer neural network and crop classification of high resolution UAV images. Journal of Remote Sensing 14(3):592 doi: 10.3390/rs14030592 CrossRef Google Scholar
[9]	Bhuyar N, Acharya S, Theng D. 2020. Crop classification with multi-temporal satellite image data. International Journal of Engineering Research & Technology 9(6):221−25 doi: 10.17577/ijertv9is060208 CrossRef Google Scholar
[10]	Weiss M, Jacob F, Duveiller G. 2020. Remote sensing for agricultural applications: A meta-review. Remote Sensing of Environment 236:111402 doi: 10.1016/j.rse.2019.111402 CrossRef Google Scholar
[11]	Donohue RJ, Lawes RA, Mata G, Gobbett D, Ouzman J. 2018. Towards a national, remote-sensing-based model for predicting field-scale crop yield. Field Crops Research 227:79−90 doi: 10.1016/j.fcr.2018.08.005 CrossRef Google Scholar
[12]	Kern A, Barcza Z, Marjanović H, Árendás T, Fodor N, Bónis P, Bognár P, Lichtenberger J. 2018. Statistical modelling of crop yield in Central Europe using climate data and remote sensing vegetation indices. Agricultural and Forest Meteorology 260-261:300−20 doi: 10.1016/j.agrformet.2018.06.009 CrossRef Google Scholar
[13]	Ajayi OG, Ashi J. 2023. Effect of varying training epochs of a faster region-based convolutional neural network on the accuracy of an automatic weed classification scheme. Smart Agricultural Technology 3:100128 doi: 10.1016/j.atech.2022.100128 CrossRef Google Scholar
[14]	Yang L, Chen J, Zhang R, Yang S, Zhang X, et al. 2023. Precise crop classification of UAV hyperspectral imagery using kernel tensor slice sparse coding based classifier. Neurocomputing 551:126487 doi: 10.1016/j.neucom.2023.126487 CrossRef Google Scholar
[15]	Wu Z, Zhang J, Deng F, Zhang S, Zhang D, et al. 2021. Fusion of GF and MODIS data for regional-scale grassland community classification with EVI2 time-series and phenological features. Journal of Remote Sensing 13(5):835 doi: 10.3390/rs13050835 CrossRef Google Scholar
[16]	Wang J, Wu B, Kohnen MV, Lin D, Yang C, et al. 2021. Classification of rice yield using UAV-based hyperspectral imagery and lodging feature. Plant Phenomics 2021:9765952 doi: 10.34133/2021/9765952 CrossRef Google Scholar
[17]	Wang X, Zhang J, Xun L, Wang J, Wu Z, et al. 2022. Evaluating the effectiveness of machine learning and deep learning models combined time-series satellite data for multiple crop types classification over a large-scale region. Remote Sensing 14(10):2341 doi: 10.3390/rs14102341 CrossRef Google Scholar
[18]	Zhong L, Hu L, Zhou H. 2019. Deep learning based multi-temporal crop classification. Remote Sensing of Environment 221:430−43 doi: 10.1016/j.rse.2018.11.032 CrossRef Google Scholar
[19]	Kumar S, Jain A, Shukla AP, Singh S, Raja R, et al. 2021. A comparative analysis of machine learning algorithms for detection of organic and nonorganic cotton diseases. Mathematical Problems in Engineering 2021:1790171 doi: 10.1155/2021/1790171 CrossRef Google Scholar
[20]	Fielding B, Zhang L. 2018. Evolving Image Classification Architectures with Enhanced Particle Swarm Optimisation. IEEE Access 6:68560−75 doi: 10.1109/ACCESS.2018.2880416 CrossRef Google Scholar
[21]	Ajayi OG, Oruma E. 2022. On the applicability of integrated UAV photogrammetry and automatic feature extraction for cadastral mapping. Advances in Geodesy and Geoinformation 71(1):1−24 doi: 10.24425/gac.2022.141172 CrossRef Google Scholar
[22]	Ajayi OG. 2023. Application of Machine intelligence in Smart Societies: A critical review of the opportunities and risks. In Machine Intelligence for Smart Applications. Studies in Computational Intelligence, eds. Adadi A, Motahhir S. vol. 1105. Cham: Springer. pp. 1−17. https://doi.org/10.1007/978-3-031-37454-8_1
[23]	Lu HX, He J, Liu L. 2019. Discussion on multispectral remote sensing image classification integrating object-oriented image analysis and KNN algorithm. Technological Innovation and Application 11:27−30 doi: 10.3969/j.issn.2095-2945.2019.11.007 CrossRef Google Scholar
[24]	Yuan PS, Yang CL, Song YH, Zhai ZY, Xu HL. 2019. Classification of rice phenotypic omics entities based on stacking integrated learning. Transactions of the Chinese Society for Agricultural Machinery 50(11):144−52 doi: 10.6041/j.issn.1000-1298.2019.11.016 CrossRef Google Scholar
[25]	Löw F, Michel U, Dech S, Conrad C. 2013. Impact of feature selection on the accuracy and spatial uncertainty of per-field crop classification using Support Vector Machines. ISPRS Journal of Photogrammetry and Remote Sensing 85:102−119 doi: 10.1016/j.isprsjprs.2013.08.007 CrossRef Google Scholar
[26]	Saini R, Ghosh SK. 2018. Crop classiﬁcation on single date sentinel-2 imagery using random forest and suppor vector machine. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences 42:683−88 doi: 10.5194/isprs-archives-xlii-5-683-2018 CrossRef Google Scholar
[27]	Maponya MG, van Niekerk A, Mashimbye ZE. 2020. Pre-harvest classification of crop types using a Sentinel-2 time-series and machine learning. Journal of Computers and Electronics in Agriculture 169:105164 doi: 10.1016/j.compag.2019.105164 CrossRef Google Scholar
[28]	Ajayi OG, Opaluwa YD, Ashi J, Zikirullahi WM. 2022. Applicability of artificial neural network for automatic crop type classification on UAV-based images. Environmental Technology and Science Journal 13(1):57−72 doi: 10.4314/etsj.v13i1.5 CrossRef Google Scholar
[29]	Seydi ST, Hasanlou M, Amani M, Huang W. 2021. Oil spill detection based on multiscale multidimensional residual CNN for optical remote sensing imagery. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 14:10941−52 doi: 10.1109/JSTARS.2021.3123163 CrossRef Google Scholar
[30]	Koirala A, Walsh KB, Wang Z, McCarthy C. 2019. Deep learning – Method overview and review of use for fruit detection and yield estimation. Computers and Electronics in Agriculture 162:219−34 doi: 10.1016/j.compag.2019.04.017 CrossRef Google Scholar
[31]	Wan X, Zhao C, Wang Y, Liu W. 2017. Stacked sparse autoencoder in hyperspectral data classification using spectral-spatial, higher order statistics and multifractal spectrum features. Infrared Physics & Technology 86:77−89 doi: 10.1016/j.infrared.2017.08.021 CrossRef Google Scholar
[32]	Yao G, Lei T, Zhong J. 2019. A review of convolutional-neural-network-based action recognition. Pattern Recognition Letters 118:14−22 doi: 10.1016/j.patrec.2018.05.018 CrossRef Google Scholar
[33]	Dhillon A, Verma GK. 2020. Convolutional neural network: a review of models, methodologies and applications to object detection. Progress in Artificial Intelligence 9(2):85−112 doi: 10.1007/s13748-019-00203-0 CrossRef Google Scholar
[34]	Ajayi OG, Ojima A. 2022. Performance evaluation of selected cloud occlusion removal algorithms on remote sensing imagery. Remote Sensing Applications: Society and Environment 25:100700 doi: 10.1016/j.rsase.2022.100700 CrossRef Google Scholar
[35]	Alzubaidi L, Zhang J, Humaidi AJ, Al-Dujaili A, Duan Y, et al. 2021. Review of deep learning: concepts, CNN architectures, challenges, applications, future directions. Journal of Big Data 8:53 doi: 10.1186/s40537-021-00444-8 CrossRef Google Scholar
[36]	Muhammad NA, Nasir AA, Ibrahim Z, Sabri N. 2018. Evaluation of CNN, AlexNet and GoogleNet for fruit recognition. ndonesian Journal of Electrical Engineering and Computer Science 12(2):468−75 doi: 10.11591/ijeecs.v12.i2.pp468-475 CrossRef Google Scholar
[37]	Sabri N, AbdulAziz Z, Ibrahim Z, Akmal Rasydan Bin Mohd Rosni M, Hafiz bin Abd Ghapul A. 2018. Comparing convolution neural network models for leaf recognition. International Journal of Engineering and Technology (IJET) 7:141−44 doi: 10.14419/ijet.v7i3.15.17518 CrossRef Google Scholar
[38]	Zhao H, Duan S, Liu J, Sun L, Reymondin L. 2021. Evaluation of five deep learning models for crop type mapping using sentinel-2 time series images with missing information. Remote Sensing 13(14):2790 doi: 10.3390/rs13142790 CrossRef Google Scholar
[39]	Ji S, Zhang C, Xu A, Shi Y, Duan Y. 2018. 3D convolutional neural networks for crop classification with multi-temporal remote sensing images. Remote Sensing 10:75 doi: 10.3390/rs10010075 CrossRef Google Scholar
[40]	Liu N, Zhao Q, Williams R, Barrett B. 2023. Enhanced crop classification through integrated optical and SAR data: a deep learning approach for multi-source image fusion. International Journal of Remote Sensing 00:1−29 doi: 10.1080/01431161.2023.2232552 CrossRef Google Scholar
[41]	Ajayi OG, Olufade OO. 2023. Drone-based crop type identification with convolutional neural networks: an evaluation of the performance of RESNET architectures. ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences X-1/W1-2023:991−98 doi: 10.5194/isprs-annals-X-1-W1-2023-991-2023 CrossRef Google Scholar
[42]	LeCun Y, Bengio Y, Hinton G. 2015. Deep learning. Nature 521:436−44 doi: 10.1038/nature14539 CrossRef Google Scholar
[43]	Pandey A, Jain K. 2022. An intelligent system for crop identification and classification from UAV images using conjugated dense convolutional neural network. Journal of Computers and Electronics in Agriculture 192:106543 doi: 10.1016/j.compag.2021.106543 CrossRef Google Scholar
[44]	Kalita I, Singh GP, Roy M. 2023. Crop classification using aerial images by analyzing an ensemble of DCNNs under multi-filter & multi-scale framework. Multimedia Tools and Applications 82:18409−33 doi: 10.1007/s11042-022-13946-1 CrossRef Google Scholar
[45]	Khan A, Sohail A, Zahoora U, Qureshi AS. 2020. A survey of the recent architectures of deep convolutional neural networks. Artificial Intelligence Review 53(8):5455−516 doi: 10.1007/s10462-020-09825-6 CrossRef Google Scholar
[46]	Krishna K. 2023. Plant disease classification using Alex Net. Research Square Preprint doi: 10.21203/rs.3.rs-2612739/v1 CrossRef Google Scholar
[47]	Ong P, Teo KS, Sia CK. 2023. UAV-based weed detection in Chinese cabbage using deep learning. Smart Agricultural Technology 4:100181 doi: 10.1016/j.atech.2023.100181 CrossRef Google Scholar
[48]	Lv M, Zhou G, He M, Chen A, Zhang W, et al. 2020. Maize leaf disease identification based on feature enhancement and DMS-Robust Alexnet. IEEE Access 8:57952−57966 doi: 10.1109/ACCESS.2020.2982443 CrossRef Google Scholar
[49]	Krizhevsky A, Sutskever I, Hinton GE. 2017. ImageNet classification with deep convolutional neural networks. Communications of the ACM 60(6):84−90 doi: 10.1145/3065386 CrossRef Google Scholar
[50]	Lakshmanarao A, Babu MR, Kiran TSR. 2021. Plant disease prediction and classification using deep learning ConvNets. International Conference on Artificial Intelligence and Machine Vision (AIMV), Gandhinagar, India, 24-26 September 2021. pp. 1−6. https://doi.org/10.1109/AIMV53313.2021.9670918
[51]	Arya S, Singh R. 2019. A comparative study of CNN and AlexNet for detection of disease in potato and mango leaf. International Conference on Issues and Challenges in Intelligent Computing Techniques (ICICT), Ghaziabad, India, 27−28 September 2019. USA: IEEE. pp. 1−6. https://doi.org/10.1109/ICICT46931.2019.8977648
[52]	Kayadibi I, Güraksın GE, Ergün U, Özmen Süzme N. 2022. An eye state recognition system using transfer learning: AlexNet-based deep convolutional neural network. International Journal of Computational Intelligence Systems 15:49 doi: 10.1007/s44196-022-00108-2 CrossRef Google Scholar

Parameters	Specifications	Remark
Number of rotors	4	Indirectly ensures no omission/gap in captured images
GSD	22.2 mm	This helped to achieve a better image resolution
Mission time	94:17 min	This is the total time taken for the drone to take off, cover the area of interest and return back
Battery	4,000 mAh	Capacity of the drone's battery – has a direct impact on cost and time of the flight
Flight direction	−50 °C
Number of batteries used	6	The life cycle of a fully charged battery of the drone is less than 16 min, hence, six were used for the flight
Flying altitude	30 m	Selected to ensure desired high spatial resolution is achieved
Flying velocity	5 ms⁻¹	Velocity set in agreement with the desired image quality
Flight date	August, 2019

Parameters	Specifications
Model	Mavic 1
Sensor	94:17 min
Resolution Sensor type	CMOS
Resolution	1.0 cm/px
F-stop	f/2.8
ISO	100
Focal length	3.5 mm

Hyperparameters	AlexNet	CNN
Depth	8 layers	5 layers
Image size	224 × 224	224 × 224
Batch size	32	32
No of epochs	30−60	30−60
Learning rate	0.0001	0.0001

Epochs	Time taken (h)		Training accuracy (%)		Training loss		Validation Accuracy (%)		Validation loss
Epochs	AlexNet	CNN	AlexNet	CNN	AlexNet	CNN	AlexNet	CNN	AlexNet	CNN
30	2.73	2.04	93.7%	44.3%	2.043	2.045	61%	21%	1.262	1.682
40	4.15	4.17	94.8%	54.2%	0.071	1.408	68%	39%	0.988	1.482
50	5.58	5.45	99.3%	54.5%	0.025	1.208	72%	42%	1.745	1.435
60	6.85	6.67	98.6%	63%	0.079	1.020	66%	47%	1.730	1.424

{{lists.name}}

Optimizing crop classification in precision agriculture using AlexNet and high resolution UAV imagery

Abstract

Rights and permissions

References

About this article

Cite this article

Article Metrics

Access History

Other Articles By Authors