An Improved K-means Algorithm Based on Structure Features

JSW 2017 Vol.12(1): 62-81 ISSN: 1796-217X
doi: 10.17706/jsw.12.1.62-80

Qiang Zhan

¹School of Computer Science and Technology, Beijing Institute of Technology, Beijing, China.
²College of Engineering, Forestry, and Natural Sciences, Northern Arizona University, Arizona, America.

Abstract—In K-means clustering, we are given a set of n data points in multidimensional space, and the problem is to determine the number k of clusters. In this paper, we present three methods which are used to determine the true number of spherical Gaussian clusters with additional noise features. Our algorithms take into account the structure of Gaussian data sets and the initial centroids. These three algorithms have their own emphases and characteristics. The first method uses Minkowski distance as a measure of similarity, which is suitable for the discovery of non-convex spherical shape or the clusters with a large difference in size. The second method uses feature weighted Minkowski distance, which emphasizes the different importance of different features for the clustering results. The third method combines Minkowski distance with the best feature factors. We experiment with a variety of general evaluation indexes on Gaussian data sets with and without noise features. The results showed that the algorithms have higher precision than traditional K-means algorithm.

Index Terms—K-means, feature weighting, clustering, cluster validity index.

[PDF]

Cite: Qiang Zhan, "An Improved K-means Algorithm Based on Structure Features," Journal of Software vol. 12, no. 1, pp. 62-80, 2017.

PREVIOUS PAPER

Improved AdaBoost Algorithm for Robust Real-Time Multi-face Detection

NEXT PAPER

Last page

General Information

ISSN: 1796-217X (Online)

Frequency: Quarterly

Editor-in-Chief: Prof. Antanas Verikas

Executive Editor: Ms. Yoyo Y. Zhou

Abstracting/ Indexing: DBLP, EBSCO, CNKI, Google Scholar, ProQuest, INSPEC(IET), ULRICH's Periodicals Directory, WorldCat, etc

E-mail: jsweditorialoffice@gmail.com

What's New

Mar 01, 2024 News!

Vol 19, No 1 has been published with online version　 [Click]
Jan 04, 2024 News!

JSW will adopt Article-by-Article Work Flow
Apr 01, 2024 News!

Vol 14, No 4- Vol 14, No 12 has been indexed by IET-(Inspec) 　 [Click]
Apr 01, 2024 News!

Papers published in JSW Vol 18, No 1- Vol 18, No 6 have been indexed by DBLP [Click]
Nov 02, 2023 News!

Vol 18, No 4 has been published with online version [Click]

Volume 12 Number 1 (Jan. 2017)

Home > Archive > 2017 > Volume 12 Number 1 (Jan. 2017) >

An Improved K-means Algorithm Based on Structure Features

General Information