Volume 4 Number 10 (Dec. 2009)
Home > Archive > 2009 > Volume 4 Number 10 (Dec. 2009) >
JSW 2009 Vol.4(10): 1119-1126 ISSN: 1796-217X
doi: 10.4304//jsw.4.10.1119-1126

An Efficient Parallel Clustering Algorithm for Large Scale Database

Jianfeng Yang1, Puliu Yan1, Yinbo Xie1, Qing Geng2,Jolly Wang3, Nick Bao 3

1School of Electronic Information, Wuhan University, Wuhan, Hubei, China
2Hubei Bureau of Surveying and Mapping, Wuhan, Hubei, China
3PRC Education, Intel China Ltd. Shanghai, China

Abstract—In this paper, we propose a new parallel clustering algorithm, named Stem-Leaf-Point Plot Clustering Algorithm (SLPPCA). SLPPCA tends to produce clusters of different shapes and sizes, and according to our experiments, it can produces clusters more efficiently than traditional methods. SLPPCA can fully exploits the data-parallelism of data objects, and adopts a task decomposition design step to balance the workloads of multi-core processors to achieve a high speedup. We implemented SLPPCA to large scale data base on duo-core processor and quad-core processor based computer separately and analyzed its performance. The experimental results show that the clusters it produced were particularly good either in different density or shapes, furthermore, with the parallel pattern used in SLPPCA on multi-core platform, the speedup was almost linear with the numbers of cores in processor and the number of data points. Moreover, SLPPCA can generate satisfactory cluster number automatically in clustering process.

Index Terms—Clustering, SLPPCA, SLPP, Parallel Processing, Performance Analysis, Parallel Pattern

[PDF]

Cite: Jianfeng Yang, Puliu Yan, Yinbo Xie,Qing Geng,Jolly Wang,Nick Bao "An Efficient Parallel Clustering Algorithm for Large Scale Database," Journal of Software vol. 4, no. 10, pp. 1119-1126, 2009.

General Information

  • ISSN: 1796-217X (Online)

  • Abbreviated Title: J. Softw.

  • Frequency:  Quarterly

  • APC: 500USD

  • DOI: 10.17706/JSW

  • Editor-in-Chief: Prof. Antanas Verikas

  • Executive Editor: Ms. Cecilia Xie

  • Abstracting/ Indexing: DBLP, EBSCO,
           CNKIGoogle Scholar, ProQuest,
           INSPEC(IET), ULRICH's Periodicals
           Directory, WorldCat, etc

  • E-mail: jsweditorialoffice@gmail.com

  • Jun 12, 2024 News!

    Vol 19, No 2 has been published with online version   [Click]

  • Jan 04, 2024 News!

    JSW will adopt Article-by-Article Work Flow

  • Apr 01, 2024 News!

    Vol 14, No 4- Vol 14, No 12 has been indexed by IET-(Inspec)     [Click]

  • Apr 01, 2024 News!

    Papers published in JSW Vol 18, No 1- Vol 18, No 6 have been indexed by DBLP   [Click]

  • Mar 01, 2024 News!

    Vol 19, No 1 has been published with online version    [Click]