SHI Yi, WANG Yingkuan, WANG Fei, et al. Recognizing young apples using improved YOLOv8n[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2025, 41(8): 204-210. DOI: 10.11975/j.issn.1002-6819.2024082228
Citation: SHI Yi, WANG Yingkuan, WANG Fei, et al. Recognizing young apples using improved YOLOv8n[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2025, 41(8): 204-210. DOI: 10.11975/j.issn.1002-6819.2024082228

Recognizing young apples using improved YOLOv8n

  • To enhance the precision and efficiency of young apple fruit recognition in agricultural settings, where timely identification is crucial for effective crop management and yield optimization, this paper introduces an innovative approach utilizing the YOLOv8n model, bolstered by transfer learning techniques. The recognition of young apple fruits presents unique challenges due to the variability in size, shape, and maturity stages, as well as the dynamic environmental conditions within orchards, such as varying lighting and occlusion by leaves or branches. To tackle these challenges, the study embarked on constructing a comprehensive and diverse dataset that captures the essence of young apple fruits under a wide range of scenarios. High-resolution digital cameras were employed to gather images of young apple fruits at different times of the day and throughout various stages of growth. Special attention was given to incorporating a spectrum of lighting conditions, from bright sunlight to shaded areas, to ensure the model's robustness against illumination changes. Furthermore, fruits of different sizes and maturity levels were included to reflect the natural heterogeneity within an orchard. These images underwent rigorous preprocessing, which involved meticulous cropping to focus solely on the fruits and augmentation techniques like rotation, flipping, and color adjustment to artificially diversify the dataset and prevent overfitting. The study then compared the efficacy of the transfer learning approach against the conventional method of random weight initialization across several state-of-the-art object detection models, including RetinaNet, EfficientDet, YOLOv5n, YOLOv8n, and YOLOv10n. The objective was to assess which model, when equipped with pre-trained weights from a related domain, could best adapt to the task of identifying young apple fruits with high accuracy and efficiency. Notably, the YOLOv8n model, known for its balance between performance and computational efficiency, emerged as the top performer when enhanced with transfer learning. It achieved remarkable detection precision of 99.3%, indicating a high degree of correctness in identifying young apple fruits, coupled with a recall rate of 94.2%, which underscores its ability to capture most of the relevant instances in the dataset. The average precision (AP) of 97.0% and extended average precision (AP) across multiple IoU thresholds (83.1%) further solidified its superiority in this specific detection task.To push the boundaries of detection accuracy even further, this research introduced the EMCA (Efficient Multiscale Channel Attention) module, integrating it into the YOLOv8n framework to create the YOLOv8n-EMCA model. The EMCA module is designed to enhance the model's capability to extract and process multi-scale features, enabling it to attend more effectively to critical details across different spatial resolutions. This refinement led to slight improvements in precision (99.6%) and a notable jump in recall to 95.6%, indicating fewer missed detections. Additionally, the mean Average Precision at 50% IoU (mAP50) reached 97.3%, and the more stringent mAP50-95 metric improved to 88.2%, demonstrating the model's robustness across a range of IoU thresholds. This research not only contributes a novel methodology tailored for the identification of young apple fruits but also serves as a valuable reference for the development of similar systems for other fruit trees. The findings underscore the transformative potential of transfer learning and advanced attention mechanisms in bolstering object detection capabilities, with profound implications for advancing agricultural automation and machine vision applications. By leveraging these techniques, the agriculture sector can move closer to achieving precision farming, where real-time, accurate monitoring and decision-making become the norm.
  • loading

Catalog

    Turn off MathJax
    Article Contents

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return