基于改进PSPNet的莲藕残留表皮自识别与补削

张国忠; 王雨; 黄成龙; 曾荣; 兰凌霄; 陈立明; 安玉德

doi:10.11975/j.issn.1002-6819.202510233

基于改进PSPNet的莲藕残留表皮自识别与补削

Automatic identification and targeted peeling of residual epidermis on lotus root based on an improved PSPNet

摘要

摘要: 针对莲藕机械去皮存在漏削、去净率检测依赖人工的问题，该研究提出一种基于机器视觉与改进PSPNet（pyramid scene parsing network）语义分割网络的残留藕皮识别与补削方法。首先构建改进PSPNet语义分割模型，通过在ResNet50主干网络中引入空洞卷积扩大感受野以捕获藕皮-果肉上下文差异，采用动态上采样器DySample优化边缘细节恢复，在解码阶段嵌入频域特征融合模块FreqFusion增强藕皮-果肉边缘特征提取能力。然后基于全局时序图像检测方法定位残留藕皮位置，以便于结合旋转路径规划算法控制刀具进行精准补削，进而基于线性扫描展开图，利用像素比值法实现去净率自动计算。在自行研制的刀具轴向送进式莲藕去皮机上布置补削检测平台进行试验验证，结果显示，改进的PSPNet模型平均交并比、精确率和召回率分别达92.15%、95.74%和95.85%，较基准模型分别提升1.53、0.92和0.75个百分点，残留藕皮角度定位平均误差为4.6°，残留藕皮区域定点补削去除率达87.50%，总体去净率提升至91.16%，较补削前提升17.54个百分点。研究结果可为莲藕去皮加工装备智能化改进提供重要指导。

Abstract: To address the issues of missed peeling and reliance on manual inspection for peeling rate assessment in mechanical lotus root peeling, this study proposes a method for identifying and targeted trimming of residual peel based on machine vision and an improved Pyramid Scene Parsing Network (PSPNet) for semantic segmentation. Given the complex curved surface of lotus root and the low contrast between peel and flesh, conventional mechanical peeling often leaves residual regions, which currently require manual trimming. The proposed method integrates visual recognition, precise localization, and automated trimming to overcome these limitations. First, an improved PSPNet semantic segmentation model is developed. Dilated convolutions with a dilation rate of 2 are incorporated into stage4 and stage5 of the backbone network ResNet50. This modification expands the receptive field without increasing the number of parameters, enabling the model to capture long-range contextual dependencies and better distinguish the subtle textural differences between lotus root peel and flesh. The original bilinear upsampler is replaced with a DySample dynamic upsampler, which uses 3×3 convolutions to generate spatially adaptive sampling weights based on the input feature map. This allows the decoder to reconstruct fine edge structures of residual peel more accurately, especially in irregularly shaped regions. Furthermore, a FreqFusion module is introduced into the decoder path. It adaptively fuses high-frequency edge details and low-frequency semantic information from multi-scale feature maps, significantly enhancing the model’s sensitivity to peel–flesh boundaries. Residual peel regions are localized using a global temporal image detection method. A fixed industrial camera captures the global video stream of the lotus root rotating 360° around the axis of the clamping mechanism. The trained improved PSPNet model processes the video stream frame by frame to generate binary segmentation masks of residual peel. Key frames are extracted when the Euclidean distance between the centroid of a detected residual region and the midline of the predefined region of interest (ROI) is minimized. This criterion ensures that the residual region is positioned directly facing the trimming cutter. Based on frame sequence analysis, the rotation angles corresponding to all residual regions are computed, and an optimized rotation path is planned. The stepper motor of the clamping mechanism then drives the lotus root to rotate incrementally, sequentially aligning each residual region with the trimming cutter. The trimming mechanism subsequently performs continuous feed motion to complete the targeted trimming operation with consistent cutting depth. An automatic calculation of the lotus root peeling rate is achieved using a pixel ratio method based on linear scan unfolded images. A line-scan camera acquires high-resolution images of the lotus root surface during a full rotation; these images are processed using the Hikvision industrial camera client software and distortion-corrected in conjunction with the captured front-view images. This yields an accurate circumferential unfolded image of the entire lotus root surface. The improved PSPNet model then detects residual peel regions in the unfolded image and generates a precise segmentation mask. Finally, the peeling rate is calculated by determining the pixel ratio of residual peel to the total lotus root surface area, providing a quantitative and objective cleanliness assessment. Experiments were conducted on a self-developed axial-feed lotus root peeling machine equipped with a trimming detection platform. The improved PSPNet model achieves a mean intersection over union (mIoU) of 92.15%, precision of 95.74%, and recall of 95.85%, representing improvements of 1.53, 0.92, and 0.75 percentage points, respectively, over the baseline PSPNet with ResNet50. The average angular positioning error of residual peel regions is as low as 4.6°, and the targeted trimming removal rate of these regions reaches 87.50%. As a result, the overall peeling rate is improved from 73.62% before trimming to 91.16% after trimming, an increase of 17.54 percentage points. These results demonstrate that the proposed method effectively eliminates residual peel with high precision and significantly enhances peeling quality. The research findings provide important guidance for the intelligent upgrading of lotus root peeling and processing equipment, and the proposed technical framework can be extended to other fruits and vegetables with similar peeling challenges.

HTML全文

参考文献(30)

施引文献

资源附件(0)