Volume 7 Number 9 (Sep. 2012)
Home > Archive > 2012 > Volume 7 Number 9 (Sep. 2012) >
JSW 2012 Vol.7(9): 2119-2124 ISSN: 1796-217X
doi: 10.4304/jsw.7.9.2119-2124

A Grouped Structure-based Regularized Regression Model for Text Categorization

Wenbin Zheng1, 2, Yuntao Qian1, and Minchao Ye1

1College of Computer Science and Technology, Zhejiang University, Hangzhou, China
2College of Information Engineering, China Jiliang University, Hangzhou, China

Abstract—The lasso regularization has successfully been used in regression models for feature selection; however, lasso considers all variable to be independent and noncorrelative, which will yield an excessively sparse solution (i.e., some important discriminating features might be discarded) if the features are highly correlated. This paper proposes a novel approach in which a sparse model was developed for text categorization. We firstly constructed a grouped structure according the correlation of text features, and then embedded the structure into a regression model via a between- and within- group sparse manner. The goal of such manner is that the groups containing many discriminating features can be selected even the features in these groups are highly correlated, and the noise within the selected groups could be discarded simultaneously, which is beneficial for classification. The experimental results show that the proposed method achieves a good tradeoff between performance and sparsity on three benchmark data sets.

Index Terms—Text Categorization, Regularization, Sparse, Lasso, Grouped Structure

[PDF]

Cite: Wenbin Zheng, Yuntao Qian, and Minchao Ye, "Grouped Structure-based Regularized Regression Model for Text Categorization," Journal of Software vol. 7, no. 9, pp. 2119-2124, 2012.

General Information

ISSN: 1796-217X (Online)
Frequency:  Quarterly
Editor-in-Chief: Prof. Antanas Verikas
Executive Editor: Ms. Yoyo Y. Zhou
Abstracting/ Indexing: DBLP, EBSCO, CNKIGoogle Scholar, ProQuest, INSPEC(IET), ULRICH's Periodicals Directory, WorldCat, etc
E-mail: jsweditorialoffice@gmail.com
  • Mar 01, 2024 News!

    Vol 19, No 1 has been published with online version    [Click]

  • Jan 04, 2024 News!

    JSW will adopt Article-by-Article Work Flow

  • Apr 01, 2024 News!

    Vol 14, No 4- Vol 14, No 12 has been indexed by IET-(Inspec)     [Click]

  • Apr 01, 2024 News!

    Papers published in JSW Vol 18, No 1- Vol 18, No 6 have been indexed by DBLP   [Click]

  • Nov 02, 2023 News!

    Vol 18, No 4 has been published with online version   [Click]