基于改进YOLOv10s的海洋牧场水下海参检测方法

刘侦龙; 王骥; 麦仁贵

doi:10.11975/j.issn.1002-6819.202412109

基于改进YOLOv10s的海洋牧场水下海参检测方法

Detecting underwater sea cucumber in marine ranching using improved YOLOv10s

摘要

摘要: 针对海洋牧场水下复杂环境中海参目标小且体表与背景区分难度大，光线高度弱化，图像存在大量噪声以及海参堆叠遮挡导致检测精度低的问题，该研究提出了一种基于改进YOLOv10s模型的水下海参检测方法YOLOv10-MECAS。该方法设计了中值增强的通道和空间注意力模块MECAS（median-enhanced channel and spatial），保留目标特征的同时减少图像噪声并通过多尺度深度卷积提升海参图像特征捕捉能力；引入可切换空洞卷积SAConv（switchable atrous convolution）模块替换SCDown（spatial-channel decoupling downsampling）模块中的3×3普通卷积模块，在无需增大卷积核尺寸的前提下扩大了感受野，增强模型捕获遮挡目标的特征能力；采用基于暗通道先验的水下图像增强算法UDCP（underwater dark channel prior）对数据集图像进行增强，优化对比度，提高图像质量；使用MPDIoU（minimum points distance intersection over union）损失函数，减少由于样本差异性大引起的检测框失真，提高模型鲁棒性。试验利用水下真实场景下采样的海参数据集对模型的性能进行了评价。结果显示，在常规数据集上，该模型的精确率、召回率、mAP_0.5分别达到85.7%、81.5%、89.7%，相比基线模型分别提高了6.4、4.4、5.0个百分点；在增强数据集上，该模型的精确率、召回率、mAP_0.5分别达到86.4%、82.6%、90.4%，相比基线模型分别提高了3.3、2.1、4.8个百分点。研究结果表明，该研究提出的模型在复杂海洋牧场水下环境中，能有效提高小目标海参的检测精度，可为海参自动化捕捞提供理论支持。

Abstract: Sea cucumber is required to accurately and rapidly detect under the underwater complex environments of ocean Sea cucumber is required to accurately and rapidly detect under the underwater complex environments of ocean ranching. However, the small size of the sea cucumbers target is difficult to distinguish from the background. Particularly, the challenge of detection can also be found under the weak lighting, serious noise, and occlusion due to the overlapping sea cucumbers. Therefore, this study aims to propose the YOLOv10-MECAS model improved by the YOLOv10s baseline to enhance the performance of detection. A median pooling enhanced channel attention and a spatial attention were used to design the MECAS (median-enhanced channel and spatial attention) module. MECAS effectively retained the target features and reduced the image noise. Sea cucumber features were then captured using multiscale depth wise convolution. Additionally, the SAConv (switchable atrous convolution) module was introduced to replace the standard 3×3 convolutional module in the SCDown (spatial-channel decoupling downsampling) module. The receptive field was expanded without increasing the convolution kernel size. Thereby the model was improved to capture the features of occluded targets. An enhancement algorithm was employed on the underwater image using UDCP (the underwater dark channel prior). The dataset images were enhanced to significantly optimize the contrast for the high image quality. Furthermore, the MPDIoU (minimum points distance intersection over union) regression loss function was adopted to reduce the distortion of detection boxes caused by large sample variability. Thereby the robustness of the model was enhanced. An experiment was carried out to evaluate the performance of the improved model. A dataset of sea cucumbers was sampled from real underwater scenarios. The experimental results show that the better performance of the improved model was achieved on the original dataset, with a prediction precision of 85.7%, recall of 81.5%, and the mean average precision at IoU (intersection over union) 0.5 of 89.7%, indicting the improvement by 6.4%, 4.4%, and 5.0% over the baseline model. Compared with the comparison models Faster-RCNN, SSD, YOLOv5s, YOLOv7, YOLOv8s, YOLOv9s and YOLOv11s, while maintaining advantages in terms of the number of parameters, GFLOPs (giga floating-point operations per second), and FPS (frames per seconds), the mean average precision at IoU 0.5 had been improved by 16.5, 15.4, 5.5, 6.1, 5.3, 5.8, and 4.6 percentage points. On the enhanced dataset by UDCP algorithm, the improved model was achieved in a prediction precision of 86.4%, recall of 82.6%, and the mean average precision at IoU 0.5 of 90.4%, indicating the improvement of 3.3%, 2.1%, and 4.8% over the baseline model. Compared with the comparison models Faster-RCNN, SSD, YOLOv5s, YOLOv7, YOLOv8s, YOLOv9s and YOLOv11s, the mean average precision at IoU 0.5 had been improved by 16.1, 15.4, 5.5, 5.9, 5.2, 6.1, and 4.3 percentage points. The MECAS module into the YOLOv10s also outperformed the combination of current mainstream attention modules, such as LSKA (large selective kernel attention), CA (coordinate attention) and ECA (efficient channel attention) in underwater sea cucumber detection. Finally, the experiment verified that the YOLOv10s with the MPDIoU also performed better than that with CIoU (complete intersection over union), EIoU (enhanced intersection over union), GIoU(generalized intersection over union), DIoU (distance intersection over union), and SIoU (scaled intersection over union). Consequently, the detection accuracy of small target sea cucumbers was effectively improved in complex underwater environments. The finding can provide a theoretical basis to detect the sea cucumber during harvesting.

HTML全文

参考文献(30)

施引文献

资源附件(0)