Published June 6, 2017 | Version v1
Journal article Open

Text Summarization versus CHI for Feature Selection

  • 1. Department of Computer Science, University of Jordan, Jordan.

Description

Text Classification is an important technique for handling the huge and increasing amount of text documents on the web. An important problem of text classification is features selection. Many feature selection techniques were used in order to solve this problem, such as chi-square (CHI). Rather than using these techniques, this paper proposes a method for feature selection based on text summarization. We demonstrate this method on Arabic text documents and use text summarization for feature selection. Support Vector Machine (SVM) is then used to classify the summarized documents and the ones processed by CHI. The classification indicators (precision, recall, and accuracy) achieved by text summarization are higher than the ones achieved by CHI. However, text summarization has negligible higher execution time.

Files

Jabri2242017BJMCS33615.pdf

Files (205.2 kB)

Name Size Download all
md5:3fdf6e5b8df8adfab13f94d348cdddc9
205.2 kB Preview Download