doi: 10.17706/jsw.11.2.148-161
Handling Sparse Data Sets by Applying Contrast Set Mining in Feature Selection
Abstract—A data set is sparse if the number of samples in a data set is not sufficient to model the data accurately. Recent research emphasized interest in applying data mining and feature selection techniques to real world problems, many of which are characterized as sparse data sets. The purpose of this research is to define new techniques for feature selection in order to improve classification accuracy and reduce the time required for feature selection on sparse data sets. The extensive comparison with benchmarking feature selection techniques on 64 sparse data sets was conducted. Results have shown superiority of contrast set mining techniques in more than 80% of the analysis on sparse data sets. This paper provides a study on the new methodologies and detected superiority in handling data sparsity.
Index Terms—Classification, contrast set mining, data characteristics, data sparsity, feature selection.
Cite: Dijana Oreški, Mario Konecki, "Handling Sparse Data Sets by Applying Contrast Set Mining in Feature Selection," Journal of Software vol. 11, no. 2, pp. 148-161, 2016.
General Information
ISSN: 1796-217X (Online)
Abbreviated Title: J. Softw.
Frequency: Quarterly
APC: 500USD
DOI: 10.17706/JSW
Editor-in-Chief: Prof. Antanas Verikas
Executive Editor: Ms. Yoyo Y. Zhou
Abstracting/ Indexing: DBLP, EBSCO,
CNKI, Google Scholar, ProQuest,
INSPEC(IET), ULRICH's Periodicals
Directory, WorldCat, etcE-mail: jsweditorialoffice@gmail.com
-
Jun 12, 2024 News!
Vol 19, No 2 has been published with online version [Click]
-
Jan 04, 2024 News!
JSW will adopt Article-by-Article Work Flow
-
Apr 01, 2024 News!
Vol 14, No 4- Vol 14, No 12 has been indexed by IET-(Inspec) [Click]
-
Apr 01, 2024 News!
Papers published in JSW Vol 18, No 1- Vol 18, No 6 have been indexed by DBLP [Click]
-
Mar 01, 2024 News!
Vol 19, No 1 has been published with online version [Click]