Volume 7 Number 1 (Jan. 2012)
Home > Archive > 2012 > Volume 7 Number 1 (Jan. 2012) >
JSW 2012 Vol.7(1): 228-236 ISSN: 1796-217X
doi: 10.4304/jsw.7.1.228-236

f-Fractional Bit Minwise Hashing

Xinpan YUAN, Jun LONG*, Zuping ZHANG, Yueyi LUO, Hao Zhang, Weihua Gui

School of Information Science and Engineering ,Central South University ,Changsha 410083 ,China

Abstract—In information retrieval, minwise hashing algorithm is often used to estimate similarities among documents. b-bit minwise hashing is capable of gaining substantial advantages in terms of computational efficiency and storage space by only storing the lowest b bits of each (minwise) hashed value (e.g., b=1 or 2). In this paper, we propose a fractional bit hashing method, which extends the existing b-bit Minwise hashing. It is shown theoretically that the fractional bit hashing has a wider range of selectivity for accuracy and storage space requirements. Theoretical analysis and experimental results demonstrate the effectiveness of this method.

Index Terms—similarity, hashing, fractional bit

[PDF]

Cite:Xinpan YUAN, Jun LONG*, Zuping ZHANG, Yueyi LUO, Hao Zhang, Weihua Gui, "f-Fractional Bit Minwise Hashing," Journal of Software vol. 7, no.1, pp. 228-236, 2012.

General Information

ISSN: 1796-217X (Online)
Frequency:  Quarterly
Editor-in-Chief: Prof. Antanas Verikas
Executive Editor: Ms. Yoyo Y. Zhou
Abstracting/ Indexing: DBLP, EBSCO, CNKIGoogle Scholar, ProQuest, INSPEC(IET), ULRICH's Periodicals Directory, WorldCat, etc
E-mail: jsweditorialoffice@gmail.com
  • Mar 01, 2024 News!

    Vol 19, No 1 has been published with online version    [Click]

  • Jan 04, 2024 News!

    JSW will adopt Article-by-Article Work Flow

  • Apr 01, 2024 News!

    Vol 14, No 4- Vol 14, No 12 has been indexed by IET-(Inspec)     [Click]

  • Apr 01, 2024 News!

    Papers published in JSW Vol 18, No 1- Vol 18, No 6 have been indexed by DBLP   [Click]

  • Nov 02, 2023 News!

    Vol 18, No 4 has been published with online version   [Click]