Feature Selection Methods for an Improved SVM Classifier

Daniel Morariu; Lucian N. Vintan; Volker Tresp

doi:10.5281/zenodo.1332490

Published February 29, 2008 | Version 5384

Journal article Open

Feature Selection Methods for an Improved SVM Classifier

Text categorization is the problem of classifying text documents into a set of predefined classes. After a preprocessing step, the documents are typically represented as large sparse vectors. When training classifiers on large collections of documents, both the time and memory restrictions can be quite prohibitive. This justifies the application of feature selection methods to reduce the dimensionality of the document-representation vector. In this paper, three feature selection methods are evaluated: Random Selection, Information Gain (IG) and Support Vector Machine feature selection (called SVM_FS). We show that the best results were obtained with SVM_FS method for a relatively small dimension of the feature vector. Also we present a novel method to better correlate SVM kernel-s parameters (Polynomial or Gaussian kernel).

Files

5384.pdf

Files (7.0 MB)

Name	Size	Download all
5384.pdf md5:3369fe7869a82df50a38440021e8541a	7.0 MB	Preview Download

133

Views

118

Downloads

Show more details

	All versions	This version
Views	133	133
Downloads	118	118
Data volume	848.0 MB	848.0 MB

More info on how stats are collected....

DOI

Resource type

Journal article

Publisher

Zenodo

Published in

International Journal of Information, Control and Computer Sciences, 1.0(2), 2008.

Languages

English

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: August 7, 2018
Modified: August 2, 2024

Feature Selection Methods for an Improved SVM Classifier

Authors/Creators

Description

Files

5384.pdf

Files (7.0 MB)