doi: 10.4304/jsw.7.9.2119-2124
A Grouped Structure-based Regularized Regression Model for Text Categorization
2College of Information Engineering, China Jiliang University, Hangzhou, China
Abstract—The lasso regularization has successfully been used in regression models for feature selection; however, lasso considers all variable to be independent and noncorrelative, which will yield an excessively sparse solution (i.e., some important discriminating features might be discarded) if the features are highly correlated. This paper proposes a novel approach in which a sparse model was developed for text categorization. We firstly constructed a grouped structure according the correlation of text features, and then embedded the structure into a regression model via a between- and within- group sparse manner. The goal of such manner is that the groups containing many discriminating features can be selected even the features in these groups are highly correlated, and the noise within the selected groups could be discarded simultaneously, which is beneficial for classification. The experimental results show that the proposed method achieves a good tradeoff between performance and sparsity on three benchmark data sets.
Index Terms—Text Categorization, Regularization, Sparse, Lasso, Grouped Structure
Cite: Wenbin Zheng, Yuntao Qian, and Minchao Ye, "Grouped Structure-based Regularized Regression Model for Text Categorization," Journal of Software vol. 7, no. 9, pp. 2119-2124, 2012.
General Information
ISSN: 1796-217X (Online)
Abbreviated Title: J. Softw.
Frequency: Quarterly
APC: 500USD
DOI: 10.17706/JSW
Editor-in-Chief: Prof. Antanas Verikas
Executive Editor: Ms. Cecilia Xie
Abstracting/ Indexing: DBLP, EBSCO,
CNKI, Google Scholar, ProQuest,
INSPEC(IET), ULRICH's Periodicals
Directory, WorldCat, etcE-mail: jsweditorialoffice@gmail.com
-
Jun 12, 2024 News!
Vol 19, No 2 has been published with online version [Click]
-
Jan 04, 2024 News!
JSW will adopt Article-by-Article Work Flow
-
Apr 01, 2024 News!
Vol 14, No 4- Vol 14, No 12 has been indexed by IET-(Inspec) [Click]
-
Apr 01, 2024 News!
Papers published in JSW Vol 18, No 1- Vol 18, No 6 have been indexed by DBLP [Click]
-
Mar 01, 2024 News!
Vol 19, No 1 has been published with online version [Click]