Volume 4 Number 5 (Jul. 2009)
Home > Archive > 2009 > Volume 4 Number 5 (Jul. 2009) >
JSW 2009 Vol.4(5): 460-468 ISSN: 1796-217X
doi: 10.4304//jsw.4.5.460-468

Research on Web Session Clustering

Li Chaofeng
College of Management, South-Central University for Nationalities, Wuhan , P.R. China

Abstract—The task of clustering web sessions is to group web sessions based on similarity and consists of maximizing the intra-group similarity while minimizing the inter-group similarity. The results of Web session clustering can be used in personalization, system improvement, site modification, business intelligence, usage characterization and so forth. This paper proposes a framework of Web session clustering first. Then several data preparation techniques that can be used to improve the performance of data preprocessing are presented. A new method for measuring similarities between web pages that takes into account not only the URL but also the viewing time of the visited web page is also introduced and a new method to measure the similarity of web sessions using sequence alignment and the similarity of web page access is given in detail. Finally, an algorithm of web session clustering is proposed. This algorithm defines the number of clusters according to the knowledge of application fields, takes advantage of ROCK to decide the initial data points of each cluster and determines the criterion function according to the contributions of overall increase in similarities made by dividing Web sessions into different clusters --- which not only overcomes the shortcomings of traditional clustering algorithm which merely focus on partial similarities, but also decreases the complexities of time and space.

Index Terms—Web session clustering; Data Preprocessing; sequence alignment; similarity measurement

[PDF]

Cite: Li Chaofeng, "Research on Web Session Clustering," Journal of Software vol. 4, no. 5, pp. 460-468, 2009.

General Information

ISSN: 1796-217X (Online)
Frequency: Monthly
Editor-in-Chief: Prof. Antanas Verikas
Executive Editor: Ms. Yoyo Y. Zhou
Abstracting/ Indexing: DBLP, EBSCO, ProQuest, INSPEC, ULRICH's Periodicals Directory, WorldCat, CNKI,etc
E-mail: jsw@iap.org
  • Jun 25, 2019 News!

    Papers published in JSW Vol. 14, No. 1- Vol. 14 No. 6 have been indexed by DBLP.    [Click]

  • Jun 25, 2019 News!

    Vol.13, No.9 has been indexed by EI (Inspec).   [Click]

  • Aug 01, 2018 News!

    [CFP] 2019 the annual meeting of JSW Editorial Board, ICCSM 2019, will be held in Barcelona, Spain, July 14-16, 2019.   [Click]

  • Jul 10, 2019 News!

    Vol 14, No.8 has been published with online version 4 original aritcles from 2 countries are published in this issue.    [Click]

  • Jul 22, 2019 News!

    Welcome Prof Ferhat Khendek from Canada to join the Editorial board of JSW    [Click]