Text Summarization versus CHI for Feature Selection
Creators
- 1. Department of Computer Science, University of Jordan, Jordan.
Description
Text Classification is an important technique for handling the huge and increasing amount of text documents on the web. An important problem of text classification is features selection. Many feature selection techniques were used in order to solve this problem, such as chi-square (CHI). Rather than using these techniques, this paper proposes a method for feature selection based on text summarization. We demonstrate this method on Arabic text documents and use text summarization for feature selection. Support Vector Machine (SVM) is then used to classify the summarized documents and the ones processed by CHI. The classification indicators (precision, recall, and accuracy) achieved by text summarization are higher than the ones achieved by CHI. However, text summarization has negligible higher execution time.
Files
Jabri2242017BJMCS33615.pdf
Files
(205.2 kB)
Name | Size | Download all |
---|---|---|
md5:3fdf6e5b8df8adfab13f94d348cdddc9
|
205.2 kB | Preview Download |