Welcome to Journal of University of Chinese Academy of Sciences,Today is

Journal of University of Chinese Academy of Sciences ›› 2022, Vol. 39 ›› Issue (2): 240-251.DOI: 10.7523/j.ucas.2020.0026

• Research Articles • Previous Articles     Next Articles

A new joint model for extracting overlapping relations based on deep learning

ZHAO Minjun1, ZHAO Yawei1, ZHAO Yajie2, LUO Gang2   

  1. 1 School of Engineering Science, University of Chinese Academy of Sciences, Beijing 100049, China;
    2 AI Lab of KnowLeGene Intelligent Technology Co, Ltd, Beijing 100088, China
  • Received:2020-03-23 Revised:2020-05-25 Online:2022-03-15
  • Supported by:
    Supported by the National Natural Science Foundation of China (61872331) and University of Chinese Academy of Sciences

Abstract: With the rapid developments of Internet technologies and popularization of Internet among daily activities, we are surrounded by all kinds of information every moment. Hence, to mine valuable information from massive data has always been a hotspot of research at home and abroad. In this environment, relationship extraction is an important subtask of information extraction, which purpose is to identify the relationship between entities from the text, so as to mine the structured information in the text, that is, fact triplet. In the text, entity overlapping and relationship overlapping are very common phenomena, but the existing joint extraction model cannot effectively solve such problems, so the paper proposes a new joint extraction model, which regards the relationship extraction task as consisting of entity recognition and relationship recognition of two subtasks. The two subtasks are identified using sequence labeling method and multi-classification method, respectively. In the joint extraction process, in order to fully mine the semantic information of the text, the part of speech (POS) and syntactic dependency (Deprel) features were added to the input layer of the model. Attention mechanism is also introduced in the model, which can eliminate the problem of long-distance dependence as sentence length increases. Finally, the paper conducts relationship extraction experiments on the NYT dataset and the WebNLG dataset. The experimental results show that the model proposed in the paper can effectively solve the problem of overlapping relationships and obtain the best extraction effect.

Key words: relation extraction, entity overlapped, joint extraction model, deep learning

CLC Number: