doi: 10.17706/jsw.11.2.148-161
Handling Sparse Data Sets by Applying Contrast Set Mining in Feature Selection
Abstract—A data set is sparse if the number of samples in a data set is not sufficient to model the data accurately. Recent research emphasized interest in applying data mining and feature selection techniques to real world problems, many of which are characterized as sparse data sets. The purpose of this research is to define new techniques for feature selection in order to improve classification accuracy and reduce the time required for feature selection on sparse data sets. The extensive comparison with benchmarking feature selection techniques on 64 sparse data sets was conducted. Results have shown superiority of contrast set mining techniques in more than 80% of the analysis on sparse data sets. This paper provides a study on the new methodologies and detected superiority in handling data sparsity.
Index Terms—Classification, contrast set mining, data characteristics, data sparsity, feature selection.
Cite: Dijana Oreški, Mario Konecki, "Handling Sparse Data Sets by Applying Contrast Set Mining in Feature Selection," Journal of Software vol. 11, no. 2, pp. 148-161, 2016.
General Information
ISSN: 1796-217X (Online)
Abbreviated Title: J. Softw.
Frequency: Biannually
APC: 500USD
DOI: 10.17706/JSW
Editor-in-Chief: Prof. Antanas Verikas
Executive Editor: Ms. Cecilia Xie
Google Scholar, ProQuest,
INSPEC(IET), ULRICH's Periodicals
Directory, WorldCat, etcE-mail: jsweditorialoffice@gmail.com
-
Mar 07, 2025 News!
Vol 19, No 4 has been published with online version [Click]
-
Mar 07, 2025 News!
JSW had implemented online submission system [Click]
-
Apr 01, 2024 News!
Vol 14, No 4- Vol 14, No 12 has been indexed by IET-(Inspec) [Click]
-
Apr 01, 2024 News!
Papers published in JSW Vol 18, No 1- Vol 18, No 6 have been indexed by DBLP [Click]
-
Oct 22, 2024 News!
Vol 19, No 3 has been published with online version [Click]