高级检索+

基于改进YOLOv8s与ByteTrack的养殖海参计数方法

A counting method for farmed sea cucumbers based on improved YOLOv8s and ByteTrack

  • 摘要: 为解决海参养殖过程中人工采样计数方法成本高、效率低、误差大等问题,该研究提出一种基于改进YOLOv8s和ByteTrack的自动化养殖海参计数方法。该方法由检测、跟踪和计数3个部分组成:在检测部分,针对YOLO系列检测器在水下环境中检测性能不足问题,提出改进模型YOLOv8s-BB。通过在Backbone和Neck分别引入BCAM(BiFormer convolutional attention module)和BiFormer注意力模块增强其特征提取和融合能力,提升检测精度;在跟踪部分,针对ByteTrack算法在水下环境中对目标关联匹配性能不佳问题,提出基于三级级联匹配的TriSORT跟踪算法,提升跟踪稳定性;在计数部分,设计了未激活轨迹去除计数法,对比分析了其与画线计数的性能差异。结果表明:YOLOv8s-BB检测器的平均精度达88.9%,召回率为77.8%,F1值为84.2%,相较于YOLOv8s、YOLOv7-tiny、YOLOv9s和YOLOv11s检测模型,均保持领先优势;TriSORT的多目标跟踪准确度(MOTA)和IDF1提升至74.00%和85.03%,较ByteTrack分别提高6.55和5.54个百分点;未激活轨迹去除计数法平均计数精度达95.46,绝对误差为1.90,显著优于画线计数法。该研究通过检测-跟踪-计数的全流程优化,实现了高效、准确的自动化养殖海参计数,为海参养殖的生物量估算、投喂管理、销售决策等关键环节提供可靠的数据支持。

     

    Abstract: Population census of sea cucumbers serves as a fundamental operational metric in contemporary aquaculture, providing essential data for production management, feed administration, and commercial transactions. Despite its critical importance, the industry continues to rely predominantly on manual sampling counting methods, which present multiple operational challenges. These conventional approaches suffer from three principal limitations: 1) exorbitant labor requirements leading to unsustainable operational costs; 2) suboptimal counting speeds resulting in processing bottlenecks during high-volume operations; 3) inherent human error factors causing significant measurement inaccuracies. The continuous expansion of sea cucumber farming has exacerbated these issues, with current production scales rendering manual methods economically and logistically impractical. This technological gap creates an urgent industry demand for automated, intelligent counting solutions capable of delivering accurate, real-time population data. To address these critical challenges, we present an innovative intelligent counting framework that leverages advanced computer vision technologies deployed on unmanned surface vehicles (USVs). Our integrated solution framework comprises three synergistic technical modules: object detection, target tracking, and automated counting. For the detection module, we developed YOLOv8s-BB, an optimized detection model that significantly improves underwater object recognition through two key architectural innovations. First, we designed the BiFormer Convolutional Attention Module (BCAM) by strategically combining a bi-Level routing attention mechanism with spatial attention mechanism. This hybrid attention module was then integrated at the terminal layer of the YOLOv8s Backbone network. Second, we inserted dedicated BiFormer modules after each C2f block in the Neck architecture. These modifications collectively enhance feature extraction and fusion capabilities, particularly for challenging underwater environments with variable lighting conditions and turbidity. In the tracking module, we introduce TriSORT, an advanced tracking algorithm featuring triple-matching that effectively addresses the target association limitations of conventional ByteTrack in aquatic settings. Specifically, the TriSORT algorithm implements a three-stage cascade matching process between detection boxes and predicted boxes, effectively leveraging high-confidence detection results to enhance tracking accuracy and robustness. The counting module implemented and rigorously compared two counting methodologies: traditional line-crossing method and our innovative inactive trajectory removal method. The latter method demonstrates superior performance by dynamically filtering transient detections and maintaining only validated trajectories, thereby eliminating common counting artifacts prevalent in aquatic environments. Comprehensive experimental evaluations demonstrate the superiority of our framwork: (1) The YOLOv8s-BB detector achieves state-of-the-art performance with 88.9% mean average precision (mAP), 77.8% recall, and 84.2% F1 score, outperforming existing YOLO model variants: YOLOv8s, YOLOv7-tiny, YOLOv9s, and YOLOv11s models; (2) The TriSORT tracking algorithm exhibits enhanced tracking robustness, attaining 74.00% multiple object tracking accuracy (MOTA) and 85.03% IDF1 score, representing improvements of 6.55 and 5.54 percentage points respectively over ByteTrack; (3)The inactive trajectory removal counting method achieved superior performance with an average counting accuracy of 95.46% and a mean absolute error of 1.90, demonstrating superior performance over conventional line-crossing counting methods in all evaluated metrics. This study makes an important contribution to aquaculture technology by establishing a fully optimized detection-tracking-counting pipeline that enables precise, efficient, and automated sea cucumber population assessment. The developed system provides reliable, data-driven support for critical aquaculture operations including biomass estimation, feeding optimization, and harvest planning, with potential applications extending to other marine species monitoring.

     

/

返回文章
返回