基于双注意力机制的常识引导归纳关系预测

基于双注意力机制的常识引导归纳关系预测: History

View Latest Version

Please note this is an old version of this entry, which may differ significantly from the current revision.

Subjects: Computer Science, Artificial Intelligence

Contributor:

Changsen Liu

知识图谱的归纳关系预测作为一个重要的研究课题，旨在预测未知实体与许多实际应用之间的缺失关系。现有的解决该问题的方法大多使用封闭子图来提取目标节点的特征进行预测;然而，有一种趋势是忽略封闭子图之外的相邻关系，从而导致预测不准确。此外，他们还忽略了丰富的常识性信息，这些信息可以帮助过滤掉不太令人信服的结果。

inductive relation prediction
commonsense
dual attention
contrastive learning

1. Introduction

Knowledge graphs (KGs) are composed of organized knowledge in the form of factual triples (entity, relation, entity), and they form a collection of interrelated knowledge, thereby facilitating downstream tasks such as question answering [1], relation extraction [2], and recommendation systems [3]. However, even state-of-the-art KGs suffer from an issue of incompleteness [4,5], such as FreeBase [6] and WikiData [7]. To solve this issue, many studies have been proposed mining missing triples in KGs, in which the embedding-based methods become a dominant paradigm, such as TransE [8], ComplEx [9], RGCN [10], and CompGCN [11]. In particular, certain scholars have explored knowledge graph completion under low-data regime conditions [12]. In actuality, the aforementioned methods are often only suitable for transductive scenarios, which assumes that the set of entities in KGs is fixed.

However, KGs undergo continuous updates, whereby new entities and triples are incorporated to store additional factual knowledge, such as new users and products on e-commerce platforms. Predicting the relation links between new entities requires inductive reasoning capabilities, which implies that generality should be derived from existing datasets and extended to a broader spectrum of fields, as shown in Figure 1. The crux of the inductive relation prediction [13] resides in utilizing information that is not specifically tied to a particular entity. A representative strategy in inductive relation prediction techniques is rule mining [14], which extracts first-order logic rules from a given KG and employs weighted combinations of these rules for inference. Each rule can be regarded as a relational path, comprising a set of relations from the head entity to the tail entity, which signifies the presence of a target relationship between two entities. For example, consider the straightforward rule (X, part_of, Y) ∧ (Y, located_in, Z) → (X, lives_in, Z), which was derived from the KG depicted in Figure 1a. These relational paths exist in symbolic forms and are independent of particular entities, thus rendering them inductive and highly interpretable.

Figure 1. An explanatory case in inductive relation prediction, which learned from a (a) training graph, and generalizes to be (b) without any shared entities for inference. A red dashed line denotes the relation to be predicted.

Motivated by graph neural networks (GNNs) that have the ability of aggregating local information, researchers have recently proposed GNN-based inductive models. GraIL [15] models the subgraphs of target triples to capture topologies. Based on GraIL, some works [16,17,18] have further utilized enclosing subgraphs for inductive prediction. Recent research has also considered few-shot settings for handling unseen entities [19,20]. SNRI [21] extracts the neighbor features of target node and path features, solves the problem of sparse subgraphs, and introduces mutual information (MI) maximization to model from a global perspective, which improves the prediction effect of inductive relationships.

2. Relation Prediction Methods

为了提高KG的完整性，最先进的方法使用KG内部结构[8,15]或外部KG信息[22,23]。

转导方法。转导方法用于学习每个节点的实体特定嵌入，它们有一个共同点：对原始 KG 进行推理。然而，很难预测看不见的节点之间缺失的环节。例如，TransE [8] 基于翻译，而 RGCN [10] 和 CompGCN [11] 基于 GNN。它们之间的主要区别在于评分函数和是否利用了KG中的结构信息。Wang等提出，利用全局邻域聚合器获取实体的全局结构信息，以解决某些快照下局部结构信息稀疏的问题[24]。Meng等提出了一种基于稀疏时间知识图谱的多跳路径推理模型[25]。最近，Wang等人提出了多层次交互的知识图谱补全，其中实体和关系在细粒度和粗粒度上同时交互[26]。

归纳法。归纳法可用于学习如何在看不见的节点上进行推理。方法主要分为两类：基于规则的方法和基于图形的方法。基于规则的方法旨在学习与实体无关的逻辑推理规则。例如，NeuralLP [14] 和 DRUM [27] 将神经网络与符号规则集成在一起，以端到端的微观方式学习逻辑规则和规则置信度。

在基于图的方法方面，近年来，研究人员从图神经网络的局部信息聚合能力中汲取灵感，并将图神经网络（GNN）纳入其模型中。GraIL[15]通过提取目标三元组的封闭子图来获取目标节点的拓扑结构，从而表现出归纳预测能力。TACT[16]在此模型的基础上，引入了子图中关系的相关性，并构建了关系相关网络（RCN）来增强子图的编码。CoMPLIE[17]提出了一种节点-边缘通信消息传播网络，以增强节点与边缘之间的交互，并自然地处理非对称或反对称关系，以增强关系信息的充分流动。ConGLR [13] 制定了一个上下文图来表示子图中的关系路径，其中应用两个 GCN 来处理封闭的子图和上下文，其中不同的层以交互方式利用对应的输出来更好地表示特征。RE-PORT [28] 聚合关系路径和上下文，通过统一的分层转换框架捕获实体的联系和内在性质。然而，由于RE-PORT的实验指标与其他先进模型不同，因此在本文的实验部分没有选择RE-PORT模型作为对比模型。RMPI[29]使用一种新颖的关系消息传递网络进行完全归纳知识图谱的完成。SNRI[21]提取节点的邻居关系特征和路径嵌入，充分利用实体的完整邻居关系信息，以获得更好的归纳效果。然而，所有这些方法都只是增加了额外的简单处理负荷，并没有充分利用KGs的整体结构特征，与SNRI不同，CNIA保留了集成的邻关系，利用双注意力机制对子图的结构特征进行处理，并引入了常识性重新排序。

3. 常识性知识

常识性知识是解决人工智能和知识工程技术瓶颈的关键环节。而常识知识的获取是该领域的基本问题。最早的构建方法涉及专家手动定义知识库的架构和关系类型。Lenat [30] 在 1980 年代构建了最古老的知识库之一 CYC。然而，专家的施工方法需要大量的人力和物力资源。因此，研究人员开始开发半结构化和非结构化文本提取方法。YAGO[31]构建的常识性知识库包含超过100万个实体和500万个事实，这些实体和事实来自半结构化的维基百科数据，并通过精心设计的基于规则和启发式方法的组合与WordNet协调一致。

上述方法通过建立定义明确的实体空间和相应的关系系统来优先考虑百科全书式的知识和结构存储。然而，实际的一般知识结构更加松散，并且很难应用于具有已知关系的两个实体的模型。因此，现有的解决方案是将实体部分建模为自然语言短语，并将关系建模为可以连接实体的任何概念。例如，OpenIE 方法揭示了开放文本实体和关系的属性。然而，该方法具有抽取性，难以获得文本的语义信息。

This entry is adapted from the peer-reviewed paper 10.3390/app14052044

© Text is available under the terms and conditions of the Creative Commons Attribution (CC BY) license; additional terms may apply. By using this site, you agree to the Terms and Conditions and Privacy Policy.