An Efficient Parallel Clustering Algorithm for Large Scale Database - Volume 4 Number 10 (Dec. 2009) - JSOFTWARE
Volume 4 Number 10 (Dec. 2009)
Home > Archive > 2009 > Volume 4 Number 10 (Dec. 2009) >
JSW 2009 Vol.4(10): 1119-1126 ISSN: 1796-217X
doi: 10.4304//jsw.4.10.1119-1126

An Efficient Parallel Clustering Algorithm for Large Scale Database

Jianfeng Yang1, Puliu Yan1, Yinbo Xie1, Qing Geng2,Jolly Wang3, Nick Bao 3
1School of Electronic Information, Wuhan University, Wuhan, Hubei, China
2Hubei Bureau of Surveying and Mapping, Wuhan, Hubei, China
3PRC Education, Intel China Ltd. Shanghai, China

Abstract—In this paper, we propose a new parallel clustering algorithm, named Stem-Leaf-Point Plot Clustering Algorithm (SLPPCA). SLPPCA tends to produce clusters of different shapes and sizes, and according to our experiments, it can produces clusters more efficiently than traditional methods. SLPPCA can fully exploits the data-parallelism of data objects, and adopts a task decomposition design step to balance the workloads of multi-core processors to achieve a high speedup. We implemented SLPPCA to large scale data base on duo-core processor and quad-core processor based computer separately and analyzed its performance. The experimental results show that the clusters it produced were particularly good either in different density or shapes, furthermore, with the parallel pattern used in SLPPCA on multi-core platform, the speedup was almost linear with the numbers of cores in processor and the number of data points. Moreover, SLPPCA can generate satisfactory cluster number automatically in clustering process.

Index Terms—Clustering, SLPPCA, SLPP, Parallel Processing, Performance Analysis, Parallel Pattern

[PDF]

Cite: Jianfeng Yang, Puliu Yan, Yinbo Xie,Qing Geng,Jolly Wang,Nick Bao "An Efficient Parallel Clustering Algorithm for Large Scale Database," Journal of Software vol. 4, no. 10, pp. 1119-1126, 2009.

General Information

ISSN: 1796-217X
Frequency: Monthly
Editor-in-Chief: Prof. Antanas Verikas
Executive Editor: Ms. Yoyo Y. Zhou
Abstracting/ Indexing: DBLP, EBSCO, ProQuest, INSPEC, ULRICH's Periodicals Directory, WorldCat, CNKI,etc
E-mail: jsw@iap.org
  • Aug 01, 2018 News!

    Papers published in JSW Vol. 13, No. 1- Vol. 13 No. 6 have been indexed by DBLP.    [Click]

  • Aug 01, 2018 News!

    [CFP] 2018 the annual meeting of JSW Editorial Board, ICSTE 2018, will be held in Kuala Lumpur, Malaysia, October 27-29, 2018.   [Click]

  • Aug 01, 2018 News!

    Vol 13, No. 7 has been published with online version 4 original aritcles from 3 countries are published in this issue.      [Click]

  • Jun 25, 2018 News!

    The papers published in Vol.13, No. 6 have all received dois from Crossref.

  • Aug 01, 2018 News!

    The papers published in Vol.13, No. 7 have all received dois from Crossref.