高级检索+

基于近红外光谱的灌浆期玉米籽粒水分定量分析通用模型

Generalized model for the quantitative moisture analysis of maize grains during filling stage based on near-infrared spectroscopy

  • 摘要: 玉米育种过程中,灌浆期籽粒含水率检测时,通常需要脱粒,采集穗中间200粒为检测样本。为了保护亲本,避免破坏性检测,该研究提出一种基于近红外光谱的灌浆期玉米籽粒水分定量分析通用模型,用于灌浆期玉米籽粒水分的田间原位检测。首先构建GA-IRIV-DS光谱数据处理策略。利用遗传算法(genetic algorithm,GA)和迭代保留信息变量(iterative retention of information variables,IRIV)二次波长筛选方法,提取光谱数据中有效的水分变量信息,减小特征空间维度的同时提高模型预测精度;再结合直接校正算法(direct standardization,DS),降低预测样本与建模样本的差异性,将玉米灌浆期穗尖部籽粒光谱数据校正为中间200籽粒的光谱,使水分定量分析模型能够具备中间200籽粒和穗尖部籽粒2种检测样本的通用性。在GA-IRIV-DS光谱数据处理策略的基础上,构建基于偏最小二乘法(partial lpeast squares regression,PLSR)的水分定量分析通用模型。经过验证,GA-IRIV-DS光谱数据处理策略校正后的光谱差异性降低了59.4%。为了进一步验证GA-IRIV-DS光谱数据处理策略的有效性,分析了GA+IRIVN组合波长筛选提取光谱特征,并分别与全光谱、多种典型波长筛选方法结合DS方法构建基于偏最小二乘法(PLSR)的水分定量分析模型结果相比较。试验结果表明,两种样本预测集GA-IRIVN-DS-PLSR模型效果均优于全光谱和其他模型,中间籽粒样本和穗尖部籽粒样本的预测决定系数(R2)达到了0.971 5和0.901 2,均方根误差(RMSEP)较全光谱下降了80.10%和64.60%。证明基于GA-IRIVN-DS光谱数据处理策略建立的近红外光谱水分定量分析模型具有一定泛化能力,可以为玉米育种过程中,减少检测过程中的样本破坏和提高检测效率提供可行的参考方法。

     

    Abstract: Grain moisture content at the filling stage is one of the most important indicators during maize breeding. However, the destructive sample (such as threshing) has been widely used to measure the moisture content of 200 grains in the middle of the ear. This study aims to address the challenge of threshing in moisture detection. A general model was proposed for the quantitative analysis and in-situ detection of maize grain moisture at the grain-filling stage using near-infrared spectroscopy. The GA-IRIV-DS strategy was established for spectral data processing. The genetic algorithm (GA) and the iteratively retained information variable (IRIV) secondary wavelength screening were combined to extract the effective moisture variables from the spectral data. The direct standardization algorithm (DS) was combined to correct the spectral data of maize spike tip grains at the filling stage. The spectrum of the middle 200 grains was also used as the baseline. The moisture quantitative analysis model was established on the two test samples of the middle 200 grains and the ear tip grains. As such, the general model was established for quantitative analysis of maize grain moisture at the filling stage. After the GA + IRIVN wavelength screening, the effective variables were extracted to reduce the feature space dimension for high prediction accuracy. In the parameter setting of the GA algorithm, the individual fitness increased with the increase of crossover rate and mutation rate. The highest value was achieved when the crossover rate was 0.5 and the mutation rate was 0.2. The variable threshold of 0.5 was set to retain the important features without too many redundance, in order to enhance the prediction accuracy of the improved model. In the IRIV algorithm, the residual variance was no longer significantly reduced, after the number of principal components increased to 10. The 5-fold cross-validation of the variables was provided for a better balance between the calculation cost and the evaluation accuracy. At the same time, the algorithm performed 10 iterations. As such, more effective moisture variables were extracted for the high accuracy of the improved model. Then the conversion matrix of DS was used to reduce the difference between the prediction and the modeling sample. Moreover, the spectral difference was reduced by 59.4 % after correction. The GA + IRIVN with the wavelength screening was combined to extract the spectral features, in order to further verify the effectiveness of the GA-IRIV-DS spectral data processing. The moisture quantitative model was constructed using partial least squares (PLSR). The DS was also combined with the full spectrum and various typical wavelength screenings. The test results show that the GA-IRIVN-DS-PLSR model on the two-sample prediction sets performed better than the full spectrum than the rest. The coefficient of determination (R2) reached 0.971 5 and 0.901 2, respectively, for the prediction of the middle and spike tip grain samples. The root mean square error of prediction (RMSEP) decreased by 80.10 % and 64.60 %, respectively, compared with the full spectrum. The generalization was improved in the moisture quantitative model with the near-infrared spectral using GA-IRIVN-DS spectral data processing. A feasible reference can also be provided to reduce the sample damage for the high efficiency of detection in the process of maize breeding.

     

/

返回文章
返回