Volume 8 Number 1 (Jan. 2013)
Home > Archive > 2013 > Volume 8 Number 1 (Jan. 2013) >
JSW 2013 Vol.8(1): 19-24 ISSN: 1796-217X
doi: 10.4304/jsw.8.1.19-24

Estimation of the Number of Distinct Values over Data Stream Based on Compound Sliding Window

Yingli Zhong, Jinghua Zhu, Meirui Ren, Yan Yang
School of Computer Science and Technology, Heilongjiang University, and Key Laboratory of Database and Parallel Computing, Harbin, China, 150080

Abstract—Estimating the number of distinct values in a data stream is a vital problem with many applications such as complex join query over multiple data streams. In this paper, we focus on the continuous and periodic distinct values estimation over sliding windows. We propose a compound sliding window model to compute the distinct values over basic sliding windows in an incremental way. LDV, HDV and AHDV are the three algorithms that are based on compound sliding windows. The basic idea behind the compound sliding windows is to organize the basic windows into a Hash table according to distinct values. Whenever a new data arrives at the data stream, it is inserted into a basic window. Once the basic window is full, a scan using distinct values is executed and the distinct values number is updated incrementally. Theoretical analysis and experiment results show that the distinct values estimation algorithms based on compound sliding windows have a great performance benefits.

Index Terms—Data stream, basic window, compound sliding window, distinct values estimation.

[PDF]

Cite: Yingli Zhong, Jinghua Zhu, Meirui Ren, Yan Yang, "Estimation of the Number of Distinct Values over Data Stream Based on Compound Sliding Window," Journal of Software vol. 8, no. 1, pp. 19-24, 2013.

General Information

ISSN: 1796-217X (Online)
Frequency:  Quarterly
Editor-in-Chief: Prof. Antanas Verikas
Executive Editor: Ms. Yoyo Y. Zhou
Abstracting/ Indexing: DBLP, EBSCO, CNKIGoogle Scholar, ProQuest, INSPEC(IET), ULRICH's Periodicals Directory, WorldCat, etc
E-mail: jsw@iap.org
  • Apr 26, 2021 News!

    Vol 14, No 4- Vol 14, No 12 has been indexed by IET-(Inspec)     [Click]

  • Nov 18, 2021 News!

    Papers published in JSW Vol 16, No 1- Vol 16, No 6 have been indexed by DBLP   [Click]

  • Dec 24, 2021 News!

     Vol 15, No 1- Vol 15, No 6 has been indexed by IET-(Inspec)   [Click]

  • Nov 18, 2021 News!

    [CFP] 2022 the annual meeting of JSW Editorial Board, ICCSM 2022, will be held in Rome, Italy, July 21-23, 2022   [Click]

  • Aug 01, 2023 News!

        [Click]