A SURVEY ON MACHINE LEARNING TECHNIQUES FOR TEXT CLASSIFICATION

Amey K. Shet Tilve*, Surabhi N. Jain

doi:10.5281/zenodo.322477

Published February 25, 2017 | Version v1

Journal article Open

A SURVEY ON MACHINE LEARNING TECHNIQUES FOR TEXT CLASSIFICATION

Amey K. Shet Tilve*, Surabhi N. Jain

This research focuses on Text Classification. Text classification is the task of automatically sorting a set of documents into categories from a predefined set. The domain of this research is the combination of information retrieval (IR) technology, Data mining and machine learning (ML) technology. This research will outline the fundamental traits of the technologies involved. This research uses three text classification algorithms (Naive Bayes, VSM for text classification and the new technique -Use of Stanford Tagger for text classification) to classify documents into different categories, which is trained on two different datasets (20 Newsgroups and New news dataset for five categories).In regards to the above classification strategies, Naïve Bayes is potentially good at serving as a text classification model due to its simplicity.

Files

Files (140.8 kB)

Name	Size	Download all
Amey Tilve.docx md5:59e01c7bb68545fdfc97cd3002645d05	140.8 kB	Download

190

Views

421

Downloads

Show more details

	All versions	This version
Views	190	190
Downloads	421	421
Data volume	61.1 MB	61.1 MB

More info on how stats are collected....

DOI

Resource type

Journal article

Publisher

Zenodo

Published in

INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY, 6(2), 513-520, 2017.

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: February 25, 2017
Modified: January 20, 2020

A SURVEY ON MACHINE LEARNING TECHNIQUES FOR TEXT CLASSIFICATION

Authors/Creators

Description

Files

Files (140.8 kB)