高级检索+

mRMR-GBDT 结合高光谱成像检测混合菌体系中食源性致病菌

Detection of foodborne pathogens in mixed bacterial systems by mRMR-GBDT combined with hyperspectral imaging

  • 摘要: 针对现有食源性致病菌检测方法样本前处理复杂、耗时,且试剂依赖度高等问题,该文利用高光谱(hyperspectral imaging, HSI)结合最小冗余最大相关 - 梯度提升决策树(minimum redundancy maximum relevance - gradient boosting decision tree, mRMR-GBDT)系列特征选择方法开展羊肉源混合菌体系中大肠埃希氏菌、沙门氏菌和金黄色葡萄球菌定性检测研究。首先,通过分析新鲜羊肉的菌群组成结构,探明羊肉源共生菌群结构并构建与实际检测状况相吻合的多菌混合体系;其次,获取由羊肉源共生菌群和食源性致病菌组成的多菌混合体系的HSI数据,并通过生成掩膜和形态学方法提取一维光谱,然后,采用mRMR-GBDT系列方法(包括mRMR-XGBoost、mRMR-LightGBM和mRMR-CatBoost)对预处理后的光谱数据开展致病菌相关的特征波长筛选;最后,系统比较不同特征选择方法与致病菌分类模型的性能并确定最佳特征提取方法和分类模型。研究结果显示:90.5% 的羊肉样本菌落总数分布在102~104 CFU/g区间,核心优势菌群的相对丰度分别为35.67%(假单胞菌属)和35.56%(不动杆菌属);mRMR-GBDT系列方法分别获取特征17、21和12个,其中mRMR-CatBoost方法筛选的12个特征波长兼具最优光谱关联性和最小冗余度;基于该特征子集分别构建SVM、LightGBM和BPNN分类模型,经对比分析,BPNN模型表现最优,其验证集和测试集准确率分别达0.97630.9649。结果表明,HSI结合mRMR-CatBoost-BPNN能够实现真实来源的羊肉混合菌体系中食源性致病菌高效检测。该研究可为羊肉等动物源食品中致病菌的高效准确检测提供理论依据与技术参考。

     

    Abstract: Aiming at the limitations of existing foodborne pathogen detection methods, such as complex and time-consuming sample pretreatment and high reagent dependency, this study utilizes hyperspectral imaging (HSI) combined with the minimum redundancy maximum relevance - gradient boosting decision tree (mRMR-GBDT) series feature selection methods to conduct qualitative detection research on Escherichia coli, Salmonella, and Staphylococcus aureus in a mutton-derived mixed bacterial system. Firstly, by analyzing the composition structure of the microbial community in fresh mutton, the symbiotic bacterial structure of mutton origin was elucidated, and a multi-bacterial mixed system consistent with actual detection conditions was constructed. Secondly, HSI data of the mixed system composed of mutton-derived symbiotic bacteria and foodborne pathogens were acquired. One-dimensional spectra were extracted through mask generation and morphological methods. Then, the mRMR-GBDT series methods (including mRMR-XGBoost, mRMR-LightGBM, and mRMR-CatBoost) were used to perform feature wavelength screening related to pathogens on the preprocessed spectral data. Finally, the performance of different feature selection methods and classification models was systematically compared to determine the optimal feature extraction method and classification model. The research results showed that: the total bacterial colony count in 90.5% of mutton samples was distributed in the range of 102~104 CFU/g, with the relative abundances of the core dominant microbiota being 35.67% (Pseudomonas) and 35.56% (Acinetobacter), respectively. The mRMR-GBDT series methods obtained 17, 21, and 12 feature wavelengths, respectively. Among them, the 12 feature wavelengths selected by the mRMR-CatBoost method possessed optimal spectral relevance and minimal redundancy. Classification models (SVM, LightGBM, and BPNN) were constructed based on this feature subset. Through comparative analysis, the BPNN model performed best, achieving validation set and test set accuracies of 0.9763 and 0.9655, respectively. The results indicate that HSI combined with mRMR-CatBoost-BPNN enables efficient detection of foodborne pathogens in a mixed bacterial system derived from authentic mutton. This research can provide a theoretical basis and technical reference for the efficient and accurate detection of pathogenic bacteria in mutton and other animal-derived foods.

     

/

返回文章
返回