Volume 5 Number 10 (Oct. 2010)
Home > Archive > 2010 > Volume 5 Number 10 (Oct. 2010) >
JSW 2010 Vol.5(10): 1137-1143 ISSN: 1796-217X
doi: 10.4304/jsw.5.10.1137-1143

The Chinese Text Categorization System with Category Priorities

Huan-Chao Keh, Ding-An Chiang, Chih-Cheng Hsu, Hui-Hua Huang

Department of Information Engineering, Tamkang University, Taipei, Taiwan

Abstract—The process of text categorization involves some understanding of the content of the documents and/or some previous knowledge of the categories. For the content of the documents, we use a filtering measure for feature selection in our Chinese text categorization system. We modify the formula of Term Frequency-Inverse Document Frequency (TF-IDF) to strengthen important keywords’ weights and weaken unimportant keywords’ weights. For the knowledge of the categories, we use category priority to represent the relationship between two different categories. Consequently, the experimental results show that our method can effectively not only decrease noise text but also increase the accuracy rate and recall rate of text categorization.

Index Terms—text categorization, feature selection, filtering measure, text mining.

[PDF]

Cite: Huan-Chao Keh, Ding-An Chiang, Chih-Cheng Hsu, Hui-Hua Huang, "The Chinese Text Categorization System with Category Priorities," Journal of Software vol. 5, no. 10, pp. 1137-1143, 2010.

General Information

ISSN: 1796-217X (Online)
Frequency:  Quarterly
Editor-in-Chief: Prof. Antanas Verikas
Executive Editor: Ms. Yoyo Y. Zhou
Abstracting/ Indexing: DBLP, EBSCO, CNKIGoogle Scholar, ProQuest, INSPEC(IET), ULRICH's Periodicals Directory, WorldCat, etc
E-mail: jsweditorialoffice@gmail.com
  • Mar 01, 2024 News!

    Vol 19, No 1 has been published with online version    [Click]

  • Apr 26, 2021 News!

    Vol 14, No 4- Vol 14, No 12 has been indexed by IET-(Inspec)     [Click]

  • Nov 18, 2021 News!

    Papers published in JSW Vol 16, No 1- Vol 16, No 6 have been indexed by DBLP   [Click]

  • Jan 04, 2024 News!

    JSW will adopt Article-by-Article Work Flow

  • Nov 02, 2023 News!

    Vol 18, No 4 has been published with online version   [Click]