Published February 29, 2008
| Version 5384
Journal article
Open
Feature Selection Methods for an Improved SVM Classifier
Authors/Creators
Description
Text categorization is the problem of classifying text
documents into a set of predefined classes. After a preprocessing
step, the documents are typically represented as large sparse vectors.
When training classifiers on large collections of documents, both the
time and memory restrictions can be quite prohibitive. This justifies
the application of feature selection methods to reduce the
dimensionality of the document-representation vector. In this paper,
three feature selection methods are evaluated: Random Selection,
Information Gain (IG) and Support Vector Machine feature selection
(called SVM_FS). We show that the best results were obtained with
SVM_FS method for a relatively small dimension of the feature
vector. Also we present a novel method to better correlate SVM
kernel-s parameters (Polynomial or Gaussian kernel).
Files
5384.pdf
Files
(7.0 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:3369fe7869a82df50a38440021e8541a
|
7.0 MB | Preview Download |