Design and application of a multi-source sensor data fusion system based on a robot phenotype platform

Xi Tan; Yinglun Li; Wenbo Gou; Si Yang; Weiliang Wen; Qiang Zuo; Xinyu Xing; Dong Liang; Linsheng Huang; Xinyu Guo; Xi Tan; Yinglun Li; Wenbo Gou; Si Yang; Weiliang Wen; Qiang Zuo; Xinyu Xing; Dong Liang; Linsheng Huang; Xinyu Guo

doi:10.48130/frures-0025-0023

The compact, high-throughput phenotyping platform, characterized by its portability and small size, is well-suited for crop phenotyping across diverse environments. However, integrating multi-source sensors to achieve synchronized data acquisition and analysis poses significant challenges due to constraints in load capacity and available space. To address these issues, we developed a robotic platform specifically designed for phenotyping greenhouse strawberries. This system integrates an RGB-D camera, a multispectral camera, a thermal camera, and a LiDAR sensor, enabling the unified analysis of data from these sources. The platform accurately extracted key phenotypic parameters, including canopy width (R² = 0.9864, RMSE = 0.0185 m) and average temperature (R²= 0.8056, RMSE = 0.1732 °C), with errors maintained below 5%. Furthermore, it effectively distinguished between different strawberry varieties, achieving an Adjusted Rand Index of 0.94, underscoring the value of detailed phenotyping in variety differentiation. Compared to conventional UGV-LiDAR systems, the proposed platform is more cost-effective, efficient, and scalable, with enhanced data consistency, making it a promising solution for agricultural applications.

HTML

Introduction

The rapid advancement of artificial intelligence, big data, and sensor technology has propelled crop phenomics research into a phase of accelerated development^[1,2]. Various sensor modalities are essential for acquiring phenotypic data; however, traditional methods relying on manual observation and limited datasets suffer from subjectivity, low throughput, and insufficient dimensionality. To overcome these limitations, automated phenotyping platforms have been introduced. In recent years, diverse crop phenotyping systems have been developed globally, including rail-based systems^[3,4], unmanned vehicles^[5], robotic platforms^[6], and drones^[7]. Each system presents unique challenges: unmanned vehicles risk damaging crops during operation, increasing data uncertainty; drones are limited by low ground resolution and challenges in managing large data volumes; and rail-based platforms are costly and confined to restricted operational areas. In contrast, robotic phenotyping platforms, with their compact size, adaptability, and high levels of automation, effectively navigate complex environments with minimal crop disturbance. These features enhance the efficiency of data acquisition and improve analytical precision.

Recent advancements in multi-source sensor technologies, including LiDAR, thermal infrared, multispectral, and RGB cameras, have significantly enhanced the collection of comprehensive phenotypic information^[2]. RGB cameras are used to estimate textural features and vegetation indices^[8,9]. Thermal infrared cameras assess plant transpiration and identify genotypes with heat tolerance and drought resistance^[10]. Multispectral cameras provide insights into chlorophyll content, vegetation indices, and nitrogen concentration^[11]. LiDAR technology is primarily applied to three-dimensional structural modeling and biomass estimation of crops^[2]. Despite their utility, phenotypic parameters derived from individual sensors are inherently limited, requiring image or point cloud segmentation to isolate crop-specific regions of interest. However, each sensor type has limitations: infrared images often have indistinct contours, multispectral images may lack color texture, and LiDAR point clouds rely on structural information that necessitates complex segmentation algorithms^[12]. In contrast, RGB images offer detailed color and texture information, which improves segmentation performance in complex scenarios, such as overlapping leaves^[13]. Moreover, RGB image segmentation algorithms are computationally efficient and suitable for high-throughput applications with limited resources^[13]. Given these limitations, a critical research focus is the rapid and effective integration of multimodal data to create unified, feature-enhanced phenotypic datasets, and thereby facilitating the extraction of crop-specific regions of interest. This integration facilitates the extraction of crop-specific regions of interest, addressing gaps in traditional phenotypic analysis.

In the field of multi-source sensor data fusion, numerous researchers have made significant advancements. Teng et al. developed a multimodal sensor fusion framework to construct multimodal maps^[14]. Pire et al. introduced agricultural robot datasets that integrate odometry and sensor data layers^[15]. Yin et al.^[16] and Das et al.^[7] presented multi-sensor datasets for ground robots and drones, respectively, enabling the extraction of key plant attributes. Sagan et al.^[17] utilized support vector regression and deep neural networks to combine multimodal phenotypic information, such as canopy spectra and textures, for yield prediction. Xie et al. applied accelerated robust features to fuse close-range depth and snapshot spectral images, generating 3D multispectral point clouds^[18]. Zhang et al. proposed a thermal direct method, utilizing infrared features to fuse visible, infrared, and depth images with localization algorithms, thereby creating continuous thermal clouds^[19]. Li et al. demonstrated that fusing LiDAR and RGB camera data enhances temporal phenotyping accuracy^[20]. Sun et al. introduced a Fourier transform-based image registration technique for multispectral reflectance, which was further extended with posture estimation and multi-view RGB-D image reconstruction to produce multispectral 3D point clouds^[21]. Correa et al. applied pattern recognition and statistical optimization to fuse thermal and near-infrared images into a unified multispectral 3D image^[22]. Lin et al. utilized structure-from-motion (SfM) technology to generate RGB and thermal point clouds, employing the Fast Global Registration (FGR) algorithm for precise integration into RGB-T point clouds^[23]. Existing research predominantly focuses on multi-sensor crop information acquisition, with several teams sharing multi-source datasets and fusion methods. Some studies leverage intrinsic and extrinsic parameters for data fusion^[19,21,22], while others conduct feature-level fusion on drones^[17]. However, outdoor environments introduce challenges such as wind interference, low resolution, and flight altitude variability. Similarly, 3D reconstruction methods^[21,23] face limitations, including lengthy processing times and data loss. Despite advancements in multi-source sensor data fusion, the integration of thermal infrared and multispectral data remains underexplored; yet, this fusion holds significant potential for enhancing phenotypic analysis. Thermal infrared data provide valuable insights into plant transpiration and stress responses, while multispectral data reveal important information about plant chlorophyll content, water stress, and nitrogen levels. By combining these two data sources, a more comprehensive understanding of plant health and growth dynamics can be achieved.

To address these gaps, this study proposes a multi-source sensor fusion system implemented on a robotic phenotyping platform within a controlled greenhouse environment. The key contributions include high-throughput data acquisition, multi-dimensional phenotyping, and optimization of extrinsic parameters. Additionally, standardized data structures, 2D-3D mapping for crop region of interest (ROI) extraction, and rapid algorithms for measuring multi-dimensional phenotypic parameters are also developed. The system demonstrates superior performance compared to existing UGV-LiDAR platforms. This study aims to develop a low-cost, practical, and high-throughput crop phenotyping system for non-destructive, automatic measurement, and rapid analysis of phenotypic parameters.

The primary goals of this study are to:

● Improve the efficiency of data acquisition in greenhouse environments by automating the collection and integration of multi-source sensor data;

● Enhance the accuracy and reliability of phenotypic parameter extraction through the fusion of multi-modal sensor data;

● Develop a standardized data structure that facilitates seamless integration and analysis of multi-source sensor data for phenotypic assessment.

● Validate the performance of the proposed system by comparing its efficiency and accuracy to existing systems, particularly in terms of cost and scalability.

Discussion

Performance comparison

In this study, we conducted a comparative analysis of the performance of the robotic phenotyping platform and the UGV-LiDAR system previously established by our team. The UGV-LiDAR system requires a pause in operation until the slide process is completed, achieving a data acquisition rate of 810 plants/h. In contrast, our platform adjusts the height of the elevator bar to accommodate two plant pots, optimizing the sensor's field of view. Due to the sparse nature of the LiDAR single-frame point cloud, our system necessitates a 10-s stationary period to accumulate a denser point cloud, resulting in a slightly lower acquisition speed of 600 plants/h. Despite this minor reduction in speed, our platform is more cost-effective, with an estimated cost of approximately USD${\$} $8,200, significantly lower than the USD${\$} $11,780 required for the UGV-LiDAR system.

As shown in Table 2, although our method may not be the fastest in terms of data acquisition speed, it outperforms the UGV-LiDAR system in point cloud processing time and provides overall cost efficiency. Furthermore, our platform demonstrates superior performance in multi-dimensional phenotyping resolution efficiency, data consistency, and data scalability. Our method enables the rapid extraction of individual components from the fused point cloud and provides efficient resolution of phenotypic parameters. Meanwhile, it also ensures data consistency by aligning all camera data with the LiDAR coordinate system and simplifies the integration of new sensor data by requiring only a single calibration of internal and external parameters. This facilitates the seamless incorporation of new data into the existing system and enhances the scalability of data dimensionality. In contrast, while the UGV-LiDAR system can collect RGB, thermal infrared, multispectral, and LiDAR data, it lacks integration. It requires additional time-consuming data fusion, which compromises data consistency and reduces the efficiency of subsequent multi-dimensional phenotypic parameter analyses.

Table 2. Comparison of various performance aspects of the robotic phenotyping platform with the UGV-LiDAR.

Comparison term	Ours	UGV-Lidar^[5]
Efficiency of data acquisition	600 plants/h	810 plants/h
Pipeline processing time	2,894 ms/plant	13,672 ms/plant
Cost (${\$} $)	8,200	11,780
Efficiency of phenotyping	High	Low
Data consistency	High	Low
Data extension	High	Low
Storage space usage	Small	Large

Moreover, our customized standardized data structure offers significant advantages in storage space efficiency. Typically, the multi-sensor image data, temperature matrix files, and point cloud files for a single strawberry plant occupy approximately 30 MB of storage, with this requirement increasing with image resolution and point cloud quality. In contrast, our tailored data structure consolidates all sensor attributes and aligns the data within a unified coordinate system, occupying only about 5 MB of storage space. This size is primarily determined by the point cloud file, with minimal impact from the other image and temperature matrix files, demonstrating exceptional storage performance. This advantage is crucial for future data storage efforts.

Advancement of methodology

This study presents a multi-source sensor data fusion system developed for robotic phenotyping platforms, designed to acquire high-throughput, multi-dimensional phenotyping data and facilitate the rapid analysis of various phenotypic parameters. The system integrates multiple sensors, including an RGB-D camera, a thermal infrared camera, a four-channel high-frame-rate multispectral camera, and LiDAR, all mounted on the elevation system of the robotic platform. The platform's elevation and robot movement are controlled via a remote control system, facilitating the efficient acquisition of multi-dimensional phenotypic data.

Additionally, the system includes a multi-dimensional phenotype analysis pipeline for automated phenotypic assessment. A Fast-SAM image segmentation model is employed to extract regions of interest (ROIs) from RGB images, which are then aligned with data from other sensors in relation to the LiDAR coordinate system. To facilitate data fusion, a standardized data structure encompassing positional, color, thermal, and spectral information was developed.

The study also examines multi-dimensional phenotypic parameters derived from multi-source data collected from 72 potted strawberry plants grown in a greenhouse. The estimation of plant canopy and mean temperature showed higher accuracy compared to manual measurements. Vegetation indices, including NDVI, NDWI, and NRCT, were used to construct ratio indices P1 and P2. Both individual and combined clustering analyses were performed on these parameters, with the latter proving more effective. The optimal clustering outcome identified 23 out of the 24 strawberry varieties, achieving an ARI of 0.94. These results suggest that a more comprehensive combination of phenotypic parameters improves clustering accuracy and highlights the effectiveness of multi-source sensor data fusion in enhancing variety identification.

In conclusion, compared to the previously used UGV-LiDAR system, the proposed system offers advantages in terms of cost-effectiveness, efficient pipelined acquisition of plant ROI point clouds, and improved multi-dimensional phenotypic resolution while also demonstrating strong data consistency and scalability.

Limitations of method
The primary limitation of this study lies in the quality of the ground soil, which restricts the robot's movement. For instance, uneven terrain or low-lying areas can cause the robot to lurch during motion, leading to poor data acquisition. Additionally, the robot's anti-vibration system requires further optimization, as its jerky movements may complicate future efforts to construct maps using odometers. Furthermore, this study only integrates multi-source sensor data for a single frame, without constructing a real-time map during the robot's operation. Moreover, we focused on extracting primary multi-dimensional phenotypic indices without delving into the impact of spectral and thermal data on the plant's entire growth cycle. Thus, this study serves primarily as a reference for multi-source sensor data fusion in multi-dimensional phenotyping research. Additionally, the RMSE between the algorithmic and actual temperature measurements was substantial. This discrepancy may be attributed to the use of the average temperature of the potted strawberries and their surroundings in the actual measurements, which introduced some errors. Finally, this work was conducted exclusively under controlled greenhouse conditions, with plans for future experiments in outdoor environments.

Future research will aim to implement real-time fusion of multi-source sensors. By applying the odometry data for position tracking, we will stitch together each frame of time-synchronized point clouds from multi-source sensor data, remove the robot body from the field of view in each frame, and construct a map of the fused point cloud. High temporal alignment of the fused point clouds collected over time will facilitate multi-dimensional monitoring of crop growth states. Specifically, the research will focus on the following areas:

1. Further optimization of image segmentation and data fusion algorithms to enhance real-time processing speed and potentially using custom-labeled crop datasets;

2. Optimization of the temperature measurement method to reduce errors and improve average temperature estimation accuracy, possibly by utilizing high-precision temperature sensors;

3. Integration of additional sensors, such as hyperspectral cameras, to capture richer spectral information and improve crop phenotype detection;

4. Large-scale validation under diverse crop types and outdoor field environments to ensure the method's broad applicability and robustness;

5. Development of an integrated agricultural management platform to streamline the acquisition, processing, analysis, and multi-dimensional phenotypic parameter resolution, enabling user-friendly operation and management.

[1]	Yang W, Feng H, Zhang X, Zhang J, Doonan JH, et al. 2020. Crop phenomics and high-throughput phenotyping: past decades, current challenges and future perspectives. Molecular Plant 13(2):187−214 doi: 10.1016/j.molp.2020.01.008 CrossRef Google Scholar
[2]	Zhao C, Zhang Y, Du J, Guo X, Wen W, et al. 2019. Crop phenomics: current status and perspectives. Frontiers in Plant Science 10:714 doi: 10.3389/fpls.2019.00714 CrossRef Google Scholar
[3]	Guo QH, Yang WC, Wu FF, Pang SX, Jin SC, et al. 2018. High-throughput crop phenotyping: accelerators for development of breeding and precision agriculture. Bulletin of Chinese Academy of Sciences 33(9):940−46 doi: 10.16418/j.issn.1000-3045.2018.09.007 CrossRef Google Scholar
[4]	Fan J, Li Y, Yu S, Gou W, Guo X, et al. 2023. Application of Internet of Things to agriculture—the LQ-FieldPheno platform: a high-throughput platform for obtaining crop phenotypes in field. Research 6:0059 doi: 10.34133/research.0059 CrossRef Google Scholar
[5]	Cai S, Gou W, Wen W, Lu X, Fan J, et al. 2023. Design and development of a low-cost UGV 3D phenotyping platform with integrated LiDAR and electric slide rail. Plants 12(3):483 doi: 10.3390/plants12030483 CrossRef Google Scholar
[6]	Virlet N, Sabermanesh K, Sadeghi-Tehran P, Hawkesford MJ. 2016. Field Scanalyzer: an automated robotic field phenotyping platform for detailed crop monitoring. Functional Plant Biology 44(1):143−53 doi: 10.1071/FP16163 CrossRef Google Scholar
[7]	Das J, Cross G, Qu C, Makineni A, Tokekar P, et al. 2015. Devices, systems, and methods for automated monitoring enabling precision agriculture. 2015 IEEE International Conference on Automation Science and Engineering, Gothenburg, Sweden, 2015. US: IEEE. pp. 462−69 doi: 10.1109/CoASE.2015.7294123
[8]	Zhu F, Yan S, Sun L, He M, Zheng Z, et al. 2022. Estimation method of lettuce phenotypic parameters using deep learning multi-source data fusion. Transactions of the Chinese Society of Agricultural Engineering 38(9):195−204 doi: 10.11975/j.issn.1002-6819.2022.09.021 CrossRef Google Scholar
[9]	Wang D. 2023. Corn yield estimation based on phenotypic features of UAV RGB images. Thesis. Jilin Agricultural University, China. pp. 3−4
[10]	Zheng G, Moskal LM. 2012. Computational-Geometry-based retrieval of effective leaf area index using terrestrial laser scanning. IEEE Transactions on Geoscience and Remote Sensing 50(10):3958−69 doi: 10.1109/TGRS.2012.2187907 CrossRef Google Scholar
[11]	Gu Y, Wang Y, Wu Y, Warner TA, Guo T, et al. 2024. Novel 3D photosynthetic traits derived from the fusion of UAV LiDAR point cloud and multispectral imagery in wheat. Remote Sensing of Environment 311:114244 doi: 10.1016/j.rse.2024.114244 CrossRef Google Scholar
[12]	He Y, Yu H, Liu X, Yang Z, Sun W, et al. 2025. Deep learning based 3D segmentation in computer vision: a survey. Information Fusion 115:102722 doi: 10.1016/j.inffus.2024.102722 CrossRef Google Scholar
[13]	Hamuda E, Glavin M, Jones E. 2016. A survey of image processing techniques for plant extraction and segmentation in the field. Computers and Electronics in Agriculture 125:184−99 doi: 10.1016/j.compag.2016.04.024 CrossRef Google Scholar
[14]	Teng H, Wang Y, Song X, Karydis K. 2023. Multimodal dataset for localization, mapping and crop monitoring in citrus tree farms. Proc. International Symposium on Visual Computing, Lake Tahoe, Nevada, USA, 2023. pp. 571−82 doi: 10.48550/arXiv.2309.15332
[15]	Pire T, Mujica M, Civera J, Kofman E. 2019. The Rosario dataset: multisensor data for localization and mapping in agricultural environments. The International Journal of Robotics Research 38(6):633−41 doi: 10.1177/0278364919841437 CrossRef Google Scholar
[16]	Yin J, Li A, Li T, Yu W, Zou D. 2021. M2DGR: a multi-sensor and multi-scenario SLAM dataset for ground robots. IEEE Robotics and Automation Letters 7(2):2266−73 doi: 10.1109/LRA.2021.3138527 CrossRef Google Scholar
[17]	Maimaitijiang M, Sagan V, Sidike P, Hartling S, Esposito F, et al. 2020. Soybean yield prediction from UAV using multimodal data fusion and deep learning. Remote Sensing of Environment 237:111599 doi: 10.1016/j.rse.2019.111599 CrossRef Google Scholar
[18]	Xie P, Du R, Ma Z, Cen H. 2023. Generating 3D multispectral point clouds of plants with fusion of snapshot spectral and RGB-D images. Plant Phenomics 5:40 doi: 10.34133/plantphenomics.0040 CrossRef Google Scholar
[19]	Zhang T, Hu L, Sun Y, Li L, Navarro-Alarcon D. 2022. Computing thermal point clouds by fusing rgb-d and infrared images: from dense object reconstruction to environment mapping. 2022 IEEE International Conference on Robotics and Biomimetics, Jinghong, China, 2022. US: IEEE. pp. 1707−14 doi: 10.1109/ROBIO55434.2022.10011817
[20]	Li Y, Wen W, Fan J, Gou W, Gu S, et al. 2023. Multi-source data fusion improves time-series phenotype accuracy in maize under a field high-throughput phenotyping platform. Plant Phenomics 5:43 doi: 10.34133/plantphenomics.0043 CrossRef Google Scholar
[21]	Sun G, Wang X, Sun Y, Ding Y, Lu W. 2019. Measurement method based on multispectral three-dimensional imaging for the chlorophyll contents of greenhouse tomato plants. Sensors 19(15):3345 doi: 10.3390/s19153345 CrossRef Google Scholar
[22]	Correa ES, Calderon FC, Colorado JD. 2024. A novel multi-camera fusion approach at plant scale: from 2D to 3D. SN Computer Science 5(5):582 doi: 10.1007/s42979-024-02849-7 CrossRef Google Scholar
[23]	Lin D, Jarzabek-Rychard M, Tong X, Maas HG. 2019. Fusion of thermal imagery with point clouds for building façade thermal attribute mapping. ISPRS Journal of Photogrammetry and Remote Sensing 151:162−75 doi: 10.1016/j.isprsjprs.2019.03.010 CrossRef Google Scholar
[24]	Beltrán J, Guindel C, de la Escalera A, García F. 2022. Automatic extrinsic calibration method for LiDAR and camera sensor setups. IEEE Transactions on Intelligent Transportation Systems 23:17677−89 doi: 10.1109/TITS.2022.3155228 CrossRef Google Scholar
[25]	Liao Q, Chen Z, Liu Y, Wang Z, Liu M. 2018. Extrinsic calibration of lidar and camera with polygon. 2018 IEEE International Conference on Robotics and Biomimetics (ROBIO), Kuala Lumpur, Malaysia, 2018. US: IEEE. pp. 200−5 doi: 10.1109/ROBIO.2018.8665256
[26]	Ma T, Liu Z, Guo H, Li Y. 2021. CRLF: automatic calibration and refinement based on line feature for LiDAR and camera in road scenes. 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems, Prague, Czech Republic, 2021 doi: 10.48550/arXiv.2103.04558
[27]	Shen Y, Li J, Shao X, Romillo B I, Jindal A, et al. 2024. FastSAM3D: an efficient segment anything model for 3D volumetric medical images. International Conference on Medical Image Computing and Computer Assisted Intervention, Marrakesh, Morocco, 2024. pp. 542−52 doi: 10.48550/arXiv.2403.09827
[28]	Yang N, Zhou M, Chen H, Cao C, Du S, et al. 2023. Estimation of wheat leaf area index and yield based on UAV RGB images. Journal of Triticeae Crops 43(7):920−32 doi: 10.7606/j.issn.1009-1041.2023.07.13 CrossRef Google Scholar
[29]	Louhaichi M, Borman MM, Johnson DE. 2001. Spatially located platform and aerial photography for documentation of grazing impacts on wheat. Geocarto International 16(1):65−70 doi: 10.1080/10106040108542184 CrossRef Google Scholar

Parameters	Description
Size	1,006 mm × 340 mm × 521 mm
Motor specification	200 W, servo motor
Battery specification	48 V 35 AH
Quantity	80 kg
Running speed	0.5 m/s
Maximum load of the lifting system	10 kg
Height of the top from the ground	1.5~3.1 m

{{lists.name}}

Design and application of a multi-source sensor data fusion system based on a robot phenotype platform

Abstract

Rights and permissions

References

About this article

Cite this article

Article Metrics

Access History

Other Articles By Authors