高级检索+

基于标签相关性的标签特定特征多标签学习

Multi-label learning with label-specific features via label correlations

  • 摘要: 针对标签特定特征多标签学习算法(multi-label learning with label-specific features, LIFT)未能在聚类以及分类阶段考虑标签相关性问题,提出一种基于标签相关性的标签特定特征多标签学习算法(multi-label learning with label-specific features via label correlations, LFLC).将标签空间加入特征空间进行聚类构建分类模型,采用考虑标签相关性的聚类集成技术为每个标签构造标签特定特征,使用相关性矩阵构建无向完全图并挖掘图中标签集合相关性,通过树集成表达标签间多种不同结构的强相关性.在试验部分,采用涵盖不同领域的10个数据集,以Hamming Loss、Ranking Loss、One-error、Coverage、Average Precision和macroAUC为评估指标,进行了参数敏感性分析和统计假设检验.结果表明:结合聚类集成与标签间强相关性的LFLC算法较其他对比多标签算法整体上能取得较好的效果.

     

    Abstract: To solve the problem that multi-label learning with label specific features(LIFT) could not consider label correlation in the clustering and classification stages, a method for multi-label learning with label-specific features via label correlations(LFLC) was proposed. The label space was added to the feature space for clustering to construct the classification model, and the clustering ensemble with considering label correlation was used to construct label-specific features for each label. The correlation matrix was used to construct undirected complete graph and mine the correlation of label sets in the graph. The strong correlation of multiple different structures between labels was expressed by tree ensemble. In the experiment, 10 data sets covering different fields were used, and Hamming Loss, Ranking Loss, One-error, Coverage, Average Precision and macroAUC were used as evaluation indexes to carry out parameter sensitivity analysis and statistical hypothesis test. The results show that the LFLC algorithm combined with clustering ensemble and strong correlation between labels can obtain better performance generally.

     

/

返回文章
返回