基于语义知识图谱的农业知识智能检索方法

张海瑜; 陈庆龙; 张斯静; 张子怡; 杨帆; 李鑫星

基于语义知识图谱的农业知识智能检索方法

Intelligent Retrieval Method of Agricultural Knowledge Based on Semantic Knowledge Graph

摘要

摘要: 针对我国现存网络农业数据库同质异构、知识零散化、一物多词、歧义解析缺乏规范等问题,提出了基于语义知识图谱的农业知识智能检索方法。本文方法围绕农作物品种、农作物病虫害、农作物简介、模型方法4个要素,自顶向下构建模式层;通过本体建模形成知识图谱的概念框架,自底向上构建数据层;通过数据获取、知识抽取、融合、存储建立实体间关联关系。针对语料中歧义字段问题,本文方法在构建知识图谱中收集大量专有词汇,并对其进行分词及词性标注。为了解决在农业知识中一物多词的问题,收集了数量庞大的主要农作物别名,并作为实体赋予明确属性,采用Bi-LSTM-CRF进行实体识别,并通过LSTM将问题进行分类,利用TF-IDF进行关键字提取,最后将知识存储于Neo4j图数据库中,从而对相关农业知识数据做规范分类,解决一物多词、一义多解问题。

Abstract: Aiming at the problems of huge agricultural data, low utilization rate, complex structure and fragmented knowledge in China, a top-down and bottom-up agricultural knowledge map construction method was proposed. Focusing on the four elements of crop varieties, crop diseases and insect pests, crop introduction, and model methods, the model layer was constructed from the top down, and the conceptual framework of the knowledge graph was formed through ontology modeling, the data layer was constructed from the bottom up, through data acquisition, knowledge extraction, and fusion, storing and establishing the relationship between entities. Aiming at the problem of ambiguous fields in the corpus, this method collects large number of proprietary vocabularies in the construction of knowledge graphs to segment and mark them. In order to solve the problem of multi-word in agricultural knowledge, many main crop aliases were collected and assigned as entities. Bi-LSTM-CRF was used for named entity recognition, and LSTM was used to classify the problem, and TF-IDF was used for keyword extraction, and finally the knowledge was stored in the Neo4 j graph database. The research can be used for agricultural knowledge intelligent retrieval systems, intelligent search systems and other applications to improve user experience.

HTML全文

参考文献(33)

施引文献

资源附件(0)