LIANG Xifeng, WEI Zhiwei. Segmenting tomato stems and branches at night time using improved CycleGAN and YOLOv8[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2025, 41(8): 147-155. DOI: 10.11975/j.issn.1002-6819.202410214
Citation: LIANG Xifeng, WEI Zhiwei. Segmenting tomato stems and branches at night time using improved CycleGAN and YOLOv8[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2025, 41(8): 147-155. DOI: 10.11975/j.issn.1002-6819.202410214

Segmenting tomato stems and branches at night time using improved CycleGAN and YOLOv8

  • Tomato is one of the most widely grown vegetables in the world. However, many challenges still remain in the actual cultivation in the broad market prospects of tomato production. Among them, the pruning of branches and leaves has been one of the most important steps during tomato growth and fruiting. However, manual pruning cannot fully meet the large-scale production in recent years, due to the high labor intensity and cost. Particularly, interval pruning is often required to work consistently for long periods of time. Fortunately, the tomato pruning robot can be expected to work all the whole day and night. It is an urgent need for the tomato pruning robots to accurately and efficiently identify the tomato stems and branches. Tomato stems and branches can often be recognized well during the daytime. But the low accuracy and missed detection of tomato branches can occur in a night environment at present. In this study, a segmentation model (NTS-YOLO) was proposed for the tomato stems and branches in night environment using improved CycleGAN and YOLOv8. The feature extraction of the CycleGAN module was improved to solve the color distortion and blurring of images. The low- and high-level features were then fused to introduce the efficient channel attention (ECA) mechanism in the network. The YOLOv8 backbone network was replaced by the lightweight backbone network (StarNet), in order to reduce the complexity of the improved model. The YOLOv8 neck network was also replaced by the Gold-YOLO. The convolutional block attention module (CBAM) attention mechanism was embedded in the head network, in order to improve the accuracy of the improved model. The results showed that the FID, LPIPS of the images generated by the improved CycleGAN were reduced by 12.23 and 0.07 and PSNR increased by 2.96 dB, respectively, compared to the original CycleGAN model. The NTS-YOLO improved mAP by 19.8 percentage points using the data-enhanced datasets. The ablation experiments indicated that the mean values of precision, recall, and average accuracy of the NTS-YOLO model were achieved at 92.5%, 86.1%, and 93.3%, respectively, which were improved by 3.8, 2.4, and 4.5 percentage points, respectively, compared with the original network. The frame rate of detection increased from 70.9 to 75.3 frame per second. The effectiveness of the NTS-YOLO model was validated using ablation experiments. The NTS-YOLO model achieved 95.3%, 92.4%, and 92.2% in the AP of the stem, the lateral branch, and the fruit branch, respectively, which were improved by 4.1, 4.1, and 5.3 percentage points, respectively, compared with the original network. Furthermore, the mean average accuracy of the NTS-YOLO model reached 93.3%, which was an increase of 15.0%, 18.8%, 5.7% and 4.5%, respectively, compared with the mainstream segmentation models, such as Mask R-CNN, YOLACT, YOLOv5l-seg and YOLOv8l-seg. The leakage rate reached 4.2%, which was reduced by 15.9, 18.2, 10.5, and 5.9 percentage points, respectively. The FPS reached 75.3 frame per second, which was faster than the rest networks by 58.2, 49.4, 2.7 and 4.4 frame per second, respectively. The NTS-YOLO network was more robust and faster than the rest of mainstream segmentation in segmenting tomato stems, lateral branches, and fruit branches in a night environment. This finding can also provide technical support for automatic and intelligent pruning in the tomato-growing industry.
  • loading

Catalog

    Turn off MathJax
    Article Contents

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return