Volume 9 Number 4 (Apr. 2014)
Home > Archive > 2014 > Volume 9 Number 4 (Apr. 2014) >
JSW 2014 Vol.9(4): 1028-1035 ISSN: 1796-217X
doi: 10.4304/jsw.9.4.1028-1035

The Large-scale Dynamic Data Rapid Reduction Algorithm Based on Map-Reduce

Jing-ling Yuan1, Jing Xie1, 2, Yan Yuan1, 2, Lin Li1, 2

1Computer Science and Technology school, Wuhan University of Technology, Wuhan, China
2School of Urban Design, Wuhan University, Wuhan, China


Abstract—With the advent of the era of “Big Data”, the application of the large-scale data is becoming popular. Efficiently using and analyzing the data has become an interesting research topic. Traditional knowledge reduction algorithms read small data samples once into a computer main memory for reduction, but it is not suitable for large-scale data. This paper takes large-scale sensor monitoring dynamic data as the research object and puts forward an incremental reduction algorithm based on Map-Reduce. Using a Hash fast partitioning strategy this algorithm divides the dynamic data set into multiple subdatasets to compute, which has greatly reduced the calculation time and space complexity of each node. Finally,experiments are conducted on the data from UCI Machine Learning Repository using Hadoop platform to prove that the algorithm is efficient and suitable for large-scale dynamic data. Compared to the traditional algorithms, the highest speedup of the parallel algorithm can be increased up to 1.55 times.

Index Terms—Large-scale dynamic data, increment knowledge reduction, Hash algorithm, Map-Reduce

[PDF]

Cite: Jing-ling Yuan, Jing Xie, Yan Yuan, Lin Li, "The Large-scale Dynamic Data Rapid Reduction Algorithm Based on Map-Reduce," Journal of Software vol. 9, no. 4, pp. 1028-1035, 2014.

General Information

  • ISSN: 1796-217X (Online)
  • Frequency:  Quarterly
  • Editor-in-Chief: Prof. Antanas Verikas
  • Executive Editor: Ms. Yoyo Y. Zhou
  • Abstracting/ Indexing: DBLP, EBSCO, CNKIGoogle Scholar, ProQuest, INSPEC(IET), ULRICH's Periodicals Directory, WorldCat, etc
  • E-mail: jsweditorialoffice@gmail.com
  • APC: 500USD
  • Jun 12, 2024 News!

    Vol 19, No 2 has been published with online version   [Click]

  • Jan 04, 2024 News!

    JSW will adopt Article-by-Article Work Flow

  • Apr 01, 2024 News!

    Vol 14, No 4- Vol 14, No 12 has been indexed by IET-(Inspec)     [Click]

  • Apr 01, 2024 News!

    Papers published in JSW Vol 18, No 1- Vol 18, No 6 have been indexed by DBLP   [Click]

  • Mar 01, 2024 News!

    Vol 19, No 1 has been published with online version    [Click]