系统工程与电子技术 ›› 2022, Vol. 44 ›› Issue (7): 2241-2250.doi: 10.12305/j.issn.1001-506X.2022.07.20

• 系统工程 • 上一篇    下一篇

多模态公文的结构知识抽取与组织研究

徐瑞麟1,2, 耿伯英2,*, 刘树衎2,3   

  1. 1. 海军工程大学电子工程学院, 湖北 武汉 430033
    2. 中国人民解放军91001部队, 北京 100036
    3. 东南大学计算机科学与工程学院, 江苏 南京 211189
  • 收稿日期:2020-12-16 出版日期:2022-06-22 发布日期:2022-06-28
  • 通讯作者: 耿伯英
  • 作者简介:徐瑞麟(1997—), 男, 硕士研究生, 主要研究方向为军事知识工程、自然语言处理|耿伯英(1967—), 男, 副教授, 博士, 主要研究方向为计算机软件与应用、军事信息系统|刘树衎(1989—), 男, 讲师, 博士, 主要研究方向为军事知识工程
  • 基金资助:
    湖北省自然科学基金(2018CFC800)

Research on structural knowledge extraction and organization for multi-modal governmental documents

Ruilin XU1,2, Boying GENG2,*, Shukan LIU2,3   

  1. 1. School of Electronic Engineering, Naval University of Engineering, Wuhan 430033, China
    2. Unit 91001 of the PLA, Beijing 100036, China
    3. School of Computer Science and Engineering, Southeast University, Nanjing 211189, China
  • Received:2020-12-16 Online:2022-06-22 Published:2022-06-28
  • Contact: Boying GENG

摘要:

针对目前基于三元组知识构建的知识图谱结构逻辑性弱、难以形成知识体系的问题, 以公文应用背景为牵引, 提出多模态知识结构要素抽取模型, 构建多模态公文文档数据集GovDoc-CN, 在文本和图像两个模态对文档中包括各级标题、摘要、作者、成文时间、文档编号等在内的知识结构要素进行抽取。设计文档结构树模型对抽取的文档知识结构要素进行组织, 并构建结构化图网络实现文档的组织和管理。实验证明, 相较于单一模态的抽取模型, 多模态知识结构要素抽取模型取得了明显的效果提升, 文档结构树模型和基于文档结构树模型构建的结构化图网络为文档知识的组织与管理提供了一种新途径, 具有重要的应用价值。

关键词: 多模态, 信息抽取, 知识组织, 文档结构化, 公文自动化

Abstract:

For the fact that triplet-based knowledge in large-scale knowledge graphs lacks structural logic and is difficult to form a knowledge system, this paper presents a multi-modal governmental documents dataset called GovDoc-CN. A multi-modal knowledge structure elements extraction model is proposed and knowledge structure elements are extracted, including titles, abstracts, authors, time of completion, document number, and other knowledge structure elements in documents through both text modal and image modal. The document structure tree (DST) model is designed to organize the extracted document knowledge structure elements, and a structured graph network is constructed to realize organization and management. Experiments prove that the multi-modal knowledge structural elements extraction model has achieved a significant improvement compared with the single-modal extraction models. The DST model and the structured graph network based on the DST model can provide a new way for the organization and management of document knowledge and have significant application value.

Key words: multi-modal, information extraction, knowledge organization, document structuring, governmental documents automation

中图分类号: