Journal of Systems Engineering and Electronics

• 软件、算法与仿真 • 上一篇    下一篇

改进的基于二次型距离模糊可能性C均值聚类模型

陈加顺1,2,皮德常1   

  1. 1.南京航空航天大学计算机科学技术学院,江苏南京 210016;
    2.淮海工学院计算机工程学院,江苏连云港 222003
  • 收稿日期:2012-05-17 修回日期:2013-02-16 出版日期:2013-07-22 发布日期:2013-05-15

Improved fuzzy possibilistic c-means model based on quadratic distance

Chen Jia-shun1, 2, Pi De-chang1   

  1. 1.College of Computer Science and Technology Nanjing University of Aeronautics and Astronautics,
    Nanjing 210016, China; 2..College of Computer Science and Technology Huaihai Institute of Technology,
    Lianyungang 222003, China
  • Received:2012-05-17 Revised:2013-02-16 Online:2013-07-22 Published:2013-05-15

摘要:

针对模糊聚类算法对点数据集聚类敏感性,以及区间类型数据聚类效果不明显等问题,提出了基于二次型距离改进的模糊可能性c 均值(fuzzy-possibilistic c-means,FPCM)聚类算法.首先分析了区间数据的特征,引入了区间值的数学表示方法,在此基础上提出了三种不同的基于区间数据距离度量方法以及相应权重矩阵计算方法,通过建立拉格朗日方程对目标方程优化,求得聚类中心、隶属度以及可能性迭代方程,并证明目标方程的收敛性,最后给出了算法执行步骤。在不同类型的数据集上实验,证明算法在点数据集和区间数据集上都具有较好聚类性能.

Abstract:

Aiming at the problem of most fuzzy clustering algorithms being sensitive to point data sets, and shortcoming of unobvious clustering results and so on, we propose improved fuzzy possibilistic C-means (FPCM) based on quadratic distance. We analyze the feature of interval-valued data and introduce mathematic representation method of interval-valued sample data. On the basis of these, we present three measure methods between interval-valued sample data and prototypes and corresponding computing methods of weight matrix, and then propose optimal objective function. The iterative function of centroid and membership and typicality are acquired by constructing Lagrange equation, and then iterative function is proved convergence. Finally, steps of algorithm are provided. Experiments on two types of three data sets show that algorithm has good performance not only on point prototype but also on interval-valued prototype.