基于改进SqueezeNet网络模型的破碎玉米籽粒识别方法

姚艳春; 崔春晓; 耿端阳; 赵博

doi:10.11975/j.issn.1002-6819.202412248

摘要: 为解决SqueezeNet网络模型识别玉米等小籽粒目标存在网络层次深，卷积计算量大等问题，该研究提出了一种改进SqueezeNet网络模型的破碎玉米籽粒识别方法。首先，为优化网络结构并降低计算量，设计了SqueezeNet-dw2网络模型，改变SqueezeNet经典模型Fire层数，并修改了末尾卷积层的输入通道参数，修改普通卷积为深度可分离卷积；其次，利用Ghost模块设计了Fire模块expand层里的3×3卷积，改进SqueezeNet-dw2网络模型为SqueezeNet-dw2-gh网络模型，降低了模型计算量和参数量；最后，优选网络激活函数为具有参数化修正线性单元的变体激活函数PReLU，改进SqueezeNet-dw2-gh网络模型为SqueezeNet-dw2-gh-P网络模型，减小了因轻量化改进造成的准确率损失。结果表明，改进后的SqueezeNet-dw2-gh-P网络模型参数量仅为0.60 MB，比原始模型降低了51.61%，模型浮点运算量为36.71 MFLOPs，降低了48.54%，验证集准确率为93.98%，测试集准确率为92.33%，同时保证了破碎玉米籽粒识别精度。本文提出的改进SqueezeNet网络模型显著减少了参数量和浮点运算量，能够实现在移动端等资源受限的嵌入式设备上部署模型，对在线实时准确识别破碎玉米籽粒具有重要参考价值。

Abstract: An accurate and rapid detection is highly required for broken or damaged maize kernels in modern agriculture. However, the conventional manual approaches cannot fully meet the large-scale applications in recent years, due to the inherently error-prone tasks, labor-intensity, and time-consuming. Moreover, significant constraints have been posed on the high efficiency and scalability of modern farming. In contrast, image recognition can be expected to substantially enhance the accuracy and efficiency of broken kernel detection using deep learning, such as the SqueezeNet network. Some challenges of the SqueezeNet model still remained in the identification of small targets, such as the maize kernels. In the depth of the network, complex and multi-layered convolutions are better required to effectively process the input images. Since the deeper architectures can enhance feature extraction, substantial computational demands have also been imposed on the processing power, memory, and storage. Particularly, the real-time applications cannot fully meet the resource-constrained environments, such as the mobile devices or embedded systems that are commonly deployed in agricultural settings. In this study, an optimized variant of the SqueezeNet model was introduced to specifically detect the broken maize kernels. The architecture (termed SqueezeNet-dw2) was used to enhance the original SqueezeNet framework. The computational complexity was reduced to improve the efficiency more suitable for real-time agricultural applications. Several key modifications were also introduced into the classic SqueezeNet architecture, in order to enhance the efficiency with less computational complexity. Firstly, the number of fire layers was reduced to the input channels of the final convolutional layer. Additionally, the standard convolutions were replaced with the depthwise separable ones. The feature extraction was preserved to significantly lower the computational costs. Furthermore, the Ghost module was integrated to refine the expanding layer of the Fire module. A 3×3 convolution was also incorporated to effectively reduce the computational demands and the number of parameters. The enhanced architecture was termed SqueezeNet-dw2-gh, indicating the integration of the Ghost module. A more efficient network was obtained after refinement and is better suitable for real-time agricultural applications, compared with the original SqueezeNet. The parametric rectified linear unit (PReLU) was employed as the activation function, in order to adaptively learn the activation parameters during training. The degradation of the accuracy after network simplification was mitigated to maintain high performance with less computational complexity. The final model after optimization was termed SqueezeNet-dw2-gh-P. Experimental results show that the parameter count was reduced to 0.60 MB—a 51.61% decrease, compared with the original architecture—while the computational cost was lowered by 48.54%, with an operation count of 36.71 MFLOPs. Notably, the optimal network shared the validation and test accuracies of 93.98% and 92.33%, respectively, indicating the effectiveness and efficiency in the accurate detection of the broken maize kernels. In conclusion, the improved SqueezeNet architecture achieved substantial reductions in the parameter count, memory footprint, and computational demands. The suitability of the improved model was obtained for the deployment of resource-constrained mobile and embedded devices. The real-time detection of broken maize kernels can also offer a practical solution in modern agriculture.

基于改进SqueezeNet网络模型的破碎玉米籽粒识别方法

Broken maize kernel recognition method based on improved SqueezeNet network model