系统工程与电子技术

• 软件、算法与仿真 • 上一篇    

基于改进协同训练的本体映射方法

孙煜飞1,2, 马良荔1, 吕闽晖3, 覃基伟1   

  1. 1. 海军工程大学电子工程学院, 湖北 武汉 430033; 2. 中国人民解放军91635部队,
    北京 102249; 3. 海军工程大学装备经济管理系, 湖北 武汉 430033
  • 出版日期:2017-01-20 发布日期:2010-01-03

Improved co-training based ontology matching method

SUN Yufei1,2, MA Liangli1, LV Minhui3, QIN Jiwei1   

  1. 1. Electronic Engineering College, Naval University of Engineering, Wuhan 430033, China;
    2. Unit 91635 of the PLA, Beijing 102249, China; 3. Department of Equipment Economics and
    Management, Naval University of Engineering, Wuhan 430033, China
  • Online:2017-01-20 Published:2010-01-03

摘要:

针对基于机器学习的本体映射方法存在的人工标记代价高和样本不平衡问题,将本体映射建模为二视图上的协同训练问题,分别从本体模式层和数据层提取特征集。通过对本体概念对进行预匹配,缩小样本不平衡度。分析了传统协同训练方法的局限性,结合主动学习思想,设计了一种考虑样本价值的改进协同训练算法,在每轮迭代中选择更有价值的未标记样本更新训练集。实验结果表明,该方法学习效率更高,同时能取得较好的本体映射结果。

Abstract:

Aiming at the problem of high labeling cost and imbalanced data exists in the machine learning based ontology matching, the model ontology matching is taken as an co-training problem on two views, which respectively extracts feature sets from the ontology schema level and the data level. Through pre-matching ontologies’ concept pairs, the imbalanced degree of samples is reduced. After analyzing the limitations of the traditional co-training method, in combination with the active learning theory, an improved sample value considered co-training method is proposed, it can choose more valuable unlabeled samples to update training sets in each iteration. Experimental results show that the proposed method is more efficient and achieves better competitive ontology matching results.