Estimating end-effector pose for pruning tall-spindle apple trees using RGB-D images

KANG Feng; WANG Jiacheng; WANG Yaxiong; WANG Ning

doi:10.11975/j.issn.1002-6819.202502099

Transactions of the Chinese Society of Agricultural Engineering > 2025 > 41(12): 77-85. > DOI: 10.11975/j.issn.1002-6819.202502099

KANG Feng, WANG Jiacheng, WANG Yaxiong, et al. Estimating end-effector pose for pruning tall-spindle apple trees using RGB-D images[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2025, 41(12): 77-85. DOI: 10.11975/j.issn.1002-6819.202502099

Citation:

PDF (4081 KB)

Estimating end-effector pose for pruning tall-spindle apple trees using RGB-D images

1.
School of Technology, Beijing Forestry University, Beijing 100083, China
2.
Key Lab of State Forestry and Grassland Administration on Forestry Equipment and Automation, Beijing 100083, China

More Information

Received Date: February 19, 2025
Revised Date: May 21, 2025
Available Online: June 11, 2025

Graphical Abstract

Abstract

Abstract

Pruning is one of the most critical steps in the cultivation of fruit trees. Current pruning robots have realized to recognize the side branch, and then locate the pruning points in recent years. However, it is still lacking in the effective end-effector pose estimation in intelligent selective pruning. This study aims to propose the pruning point localization and end-effector pose estimation using RGB-D images. The research object was also selected as the dormant high spindle-shaped apple trees. A depth camera (Intel RealSense D435i) was utilized to capture the RGB and depth data. A point-to-plane mapping was introduced to derive the 3D orientation and position of the pruning pose from the detected pixel coordinates and depth information. The spatial location was predicted for the cutting plane’s orientation relative to the pruning point — a key requirement for autonomous robotic pruning. In the perception pipeline, an improved version of the YOLOv8-seg model was employed to segment the trunk and primary branch regions from the RGB images. Furthermore, it was often lacking on the clear boundary features of the branch base masks, due to the unconventional annotation. The original YOLOv8-seg model failed to accurately locate and then segment these regions. A Global Attention Mechanism (GAM) module was introduced into the neck network of YOLOv8-seg. Each C2f block was then integrated across all feature levels. The feature maps were also recalibrated using channel-wise multiplication, in order to enhance the salient features while suppressing the irrelevant ones. The multi-scale information and reasoning were significantly enhanced for the high accuracy of the segmentation. The improved YOLOv8-seg model was achieved in a mask-level precision of 95.31%, recall of 93.79%, and an mAP_0.5 of 93.86%, thus outperforming the original YOLOv8-seg by 0.79 percentage points in precision, 2.63 percentage points in recall, and 1.47 percentage points in mAP_0.5. Once the trunk and primary branches were segmented, the OpenCV-based image processing was applied to calculate the diameters and spacing of the branches. The potential pruning points were identified to fit the rectangles around the base regions of the side branches, according to the empirical pruning. Field trials were carried out to validate the effectiveness of this approach. A better performance was achieved, with a decision accuracy of 88.3% and an average processing speed of 2.1 seconds per image. Extensive testing showed that the point-to-plane mapping of the pose estimation was achieved with a success rate of 89.9%, with an average computation time of 3.3 s per image. In conclusion, a framework was presented for the intelligent selective pruning of the apple trees using RGB-D input, in order to realize the accurate pruning point localization and end-effector pose estimation. Advanced deep learning models were also integrated with the image processing. The pruning pose can be expected to align with the specific angles for the tree's health. The point-to-plane mapping can be expected to determine the spatial location of the pruning points. The optimal orientation of the cutting plane can also be calculated to fully meet the horticultural requirements of the pruning actions. Specifically, the normal vector of the cutting plane was derived, according to the detected pruning points and surrounding branch structures. The manipulator's reachability and safety distances can be considered to generate feasible pruning poses for practical execution. The pruning end-effector pose estimation can also provide strong support for developing robotic pruning.
- image processing,
- instance segmentation,
- RGB-D,
- YOLOv8-seg,
- apple trees,
- pruning,
- pose estimation

FullText(HTML)

References (29)

References

[1]	李双双,刘卫柏,蒋健. 农业机械化可以解决农业劳动力短缺吗?[J]. 中国农机化学报,2024,45(7):316-322. LI Shuangshuang, LIU Weibo, JIANG Jian. Can agricultural mechanization solve the shortage of agricultural labor?[J]. Journal of Chinese Agricultural Mechanization, 2024, 45(7): 316-322. (in Chinese with English abstract)
[2]	梁海忠,范崇辉,王琰,等. 苹果高纺锤形树体枝量、果实产量与品质的研究[J]. 西北农林科技大学学报(自然科学版),2010,38(7):123-128. LIANG Hailzhong, FAN Chonghui, WANG Yan, et al. Research on shoot number, fruit yield and quality of high-spindle apple trees[J]. Journal of Northwest A& F University(Natural Science Edition), 2010, 38(7): 123-128. (in Chinese with English abstract)
[3]	CHEN Z, TING D, NEWBURY R, et al. Semantic segmentation for partially occluded apple trees based on deep learning[J]. Computers and Electronics in Agriculture, 2021, 181: 105952. DOI: 10.1016/j.compag.2020.105952
[4]	SAPKOTA R, AHMED D, KARKEE M. Comparing YOLOv8 and Mask R-CNN for instance segmentation in complex orchard environments[J]. Artificial Intelligence in Agriculture, 2024, 13: 84-99. DOI: 10.1016/j.aiia.2024.07.001
[5]	梁喜凤,章鑫宇,王永维. 基于改进Mask R-CNN的番茄侧枝修剪点识别方法[J]. 农业工程学报,2022,38(23):112-121. DOI: 10.11975/j.issn.1002-6819.2022.23.012 LIANG Xifeng, ZHANG Xinyu, WANG Yongwei. Recognition method for the pruning points of tomato lateral branches using improved Mask R-CNN[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2022, 38(23): 112-121. (in Chinese with English abstract) DOI: 10.11975/j.issn.1002-6819.2022.23.012
[6]	YOU A, GRIMM C, SILWAL A, et al. Semantics-guided skeletonization of upright fruiting offshoot trees for robotic pruning[J]. Computers and Electronics in Agriculture, 2022, 192: 106622. DOI: 10.1016/j.compag.2021.106622
[7]	MA B, DU J, WANG L, et al. Automatic branch detection of jujube trees based on 3D reconstruction for dormant pruning using the deep learning-based method[J]. Computers and Electronics in Agriculture, 2021, 190: 106484. DOI: 10.1016/j.compag.2021.106484
[8]	付昱兴,李承明,朱江,等. Alpha-shape算法构建枣树点云三维模型[J]. 农业工程学报,2020,36(22):214-221. FU Yuxing, LI Chengming, ZHU Jiang, et al. Three-dimensional model construction method and experiment of jujube tree point cloud using Alpha-shape algorithm[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2020, 36(22): 214-221. (in Chinese with English abstract)
[9]	马保建,鄢金山,王乐,等. 基于语义分割的矮化密植枣树修剪枝识别与骨架提取[J]. 农业机械学报,2022,53(8):313-319. MA Baojian, YAN Jinshan, WANG Le, et al. Method for detection and skeleton of pruning branch of Jujube tree based on semantic segmentation for dormant pruning[J]. Transactions of the Chinese Society for Agricultural Machinery, 2022, 53(8): 313-319. (in Chinese with English abstract)
[10]	TONG S, ZHANG J, LI W, et al. An image-based system for locating pruning points in apple trees using instance segmentation and RGB-D images[J]. Biosystems Engineering, 2023, 236: 277-286. DOI: 10.1016/j.biosystemseng.2023.11.006
[11]	TONG S, YUE Y, LI W, et al. Branch identification and junction points location for apple trees based on deep learning[J]. Remote Sensing, 2022, 14(18): 4495.
[12]	王志富,马文强,项斌斌,等. 基于改进UNet模型的核桃树枝条分叉点定位与修剪位置选择[J]. 农业工程学报,2025,41(7):165-172. DOI: 10.11975/j.issn.1002-6819.202410149 WANG Zhifu, MA Wenqiang, XIANG Binbin, et al. Locating branch bifurcation points to select the pruning position of walnut trees using improved UNet model[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2025, 41(7): 165-172. (in Chinese with English abstract) DOI: 10.11975/j.issn.1002-6819.202410149
[13]	ZAHID A, MAHMUD M S, HE L, et al. Technological advancements towards developing a robotic pruner for apple trees: A review[J]. Computers and Electronics in Agriculture, 2021, 189: 106383. DOI: 10.1016/j.compag.2021.106383
[14]	郑太雄,江明哲,冯明驰. 基于视觉的采摘机器人目标识别与定位方法研究综述[J]. 仪器仪表学报,2021,42(9):28-51. ZHENG Taixiong, JIANG Mingzhe, FENG Mingchi. Vision based target recognition and location for picking robot: A review[J]. Chinese Journal of Scientific Instrument, 2021, 42(9): 28-51. (in Chinese with English abstract)
[15]	杨震宇,汪小旵,祁子涵,等. 基于改进YOLOv8的草莓识别与果梗采摘关键点检测[J]. 农业工程学报,2024,40(18):167-175. DOI: 10.11975/j.issn.1002-6819.202405044 YANG Zhenyu, WANG Xiaochan, QI Zihan, et al. Recognizing strawberry to detect the key points for peduncle picking using improved YOLOv8 model[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2024, 40(18): 167-175. (in Chinese with English abstract) DOI: 10.11975/j.issn.1002-6819.202405044
[16]	张勤,庞月生,李彬. 基于实例分割的番茄串视觉定位与采摘姿态估算方法[J]. 农业机械学报,2023,54(10):205-215. DOI: 10.6041/j.issn.1000-1298.2023.10.020 ZHANG Qin, PANG Yuesheng, LI Bin. Visual positioning and picking pose estimation of tomato clusters based on instance segmentation[J]. Transactions of the Chinese Society for Agricultural Machinery, 2023, 54(10): 205-215. (in Chinese with English abstract) DOI: 10.6041/j.issn.1000-1298.2023.10.020
[17]	ZHOU L, JIN S, WANG J, et al. 3D positioning of Camellia oleifera fruit-grabbing points for robotic harvesting[J]. Biosystems Engineering, 2024, 246: 110-121. DOI: 10.1016/j.biosystemseng.2024.07.019
[18]	高嘉正,李文涛,罗陈迪,等. 基于语义分割和几何分析的火龙果果实与枝条的三维重建[J]. 农业工程学报,2024,40(12):157-164. DOI: 10.11975/j.issn.1002-6819.202402104 GAO Jiazheng, LI Wentao, LUO Chendi, et al. Three-dimensional reconstruction for dragon fruits and branches using semantic segmentation and geometric analysis[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2024, 40(12): 157-164. (in Chinese with English abstract) DOI: 10.11975/j.issn.1002-6819.202402104
[19]	INTEL REALSENSE. Depth Camera D435i[EB/OL]. (2018-11-28)[2024-10-06]. https://www.intelrealsense.com/depth-camera-d435i/
[20]	ULTRALYTICS. YOLOv8[EB/OL]. (2023-11-12)[2024-04-09]. https://github.com/ultralytics/ultralytics
[21]	LIU Y, SHAO Z, HOFFMANN N. Global attention mechanism: Retain information to enhance channel-spatial interactions[EB/OL]. (2021-12-10)[2024-10-06]. https://doi.org/10.48550/arXiv.2112.05561
[22]	张林森,马锋旺,李丙智,等. 国外苹果高纺锤形整形技术与应用[J]. 中国果树,2007(6):69-70. DOI: 10.3969/j.issn.1000-8047.2007.06.043 ZHANG Linsen, MA Fengwang, LI Bingzhi, et al. High spindle training systems for apple trees: Techniques and applications[J]. China Fruits, 2007(6): 69-70. (in Chinese with English abstract) DOI: 10.3969/j.issn.1000-8047.2007.06.043
[23]	陈亚娅,张斌,付昱兴,等. 枣树修剪机械臂的设计与运动学分析[J]. 农机化研究,2021,43(10):7-11. DOI: 10.3969/j.issn.1003-188X.2021.10.002 CHEN Yaya, ZHANG Bin, FU Yuxing, et al. Design and kinematics analysis of jujube pruning manipulator[J]. Journal of Agricultural Mechanization Research, 2021, 43(10): 7-11. (in Chinese with English abstract) DOI: 10.3969/j.issn.1003-188X.2021.10.002
[24]	何雪涛,程源,黄钟,等. 齐次坐标变换在空间机构分析中的应用[J]. 北京化工大学学报(自然科学版),1999,26(1):41-44. HE Xuetao, CHENG Yuan, HUANG Zhong, et al. Transformation of homogeneous coordinates and its application in the analysis of spatial mechanism[J]. Journal of Beijing University of Chemical Technology(Natural Science Edition), 1999, 26(1): 41-44. (in Chinese with English abstract)
[25]	BOLYA D, ZHOU C, XIAO F, et al. YOLACT++ Better Real-time instance segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 44(2): 1108-1121. DOI: 10.1109/TPAMI.2020.3014297
[26]	HE K, GKIOXARI G, DOLLáR P, et al. Mask R-CNN[C]//Proceedings of 2017 Conference on Computer Vision (ICCV). Venice: IEEE, 2017: 2980-2988.
[27]	WANG X, ZHANG R, KONG T, et al. SOLOv2: Dynamic and Fast Instance Segmentation[EB/OL]. (2020-10-23)[2023-12-04]. https://doi.org/10.48550/arXiv.2003.10152
[28]	SELVARAJU R R, COGSWELL M, DAS A, et al. Grad-CAM: Visual explanations from deep networks via gradient-based localization[J]. International Journal of Computer Vision, 2020, 128(2): 336-359. DOI: 10.1007/s11263-019-01228-7
[29]	宋振帅,周艳,钟灵,等. 果树枝条识别与修剪点坐标确定方法[J]. 湖北农业科学,2024,63(8):39-46. SONG Zhenshuai, ZHOU Yan, ZHONG Ling, et al. Identification of branches of fruit trees and deter- mination of coordinates of pruning points[J]. Hubei Agricultural Sciences, 2024, 63(8): 39-46. (in Chinese with English abstract)