Published October 30, 2020 | Version v1
Journal article Open

SVM and Cross-Validation using R Studio

  • 1. Assistant Professor, Department of Information Technology, Jagan Institute of Management Studies - JIMS Rohini, Delhi, India.
  • 2. Professor, Jagan Institute of Management Studies - JIMS Rohini, Delhi, India.
  • 3. Computer Science Engineering Graduate, Jagannath University Bahadurgarh, Haryana, India.
  • 1. Publisher

Description

Each passing day data is getting multiplied. It is difficult to extract useful information from such big data. Data Mining is used to extract useful information. Data mining is used in majorly all fields like healthcare, marketing, social media platforms and so on. In this paper, data is loaded and preprocessed by dealing with some missing values. The dataset used is of Airbnb, the platform used for lodging and tourism industry. Analyzing the data by plotting correlation using spearman method. Further, applying PCA and Support Vector Machine classification technique on the dataset. There are various applications of SVM, it is used in face-detection, text and hypertext categorization, classification of images, bioinformatics and so on. SVM has high dimensional input space, sparse document vectors and regularization parameters therefore it is appropriate to use SVM. Cross-validation gives more accurate result. The dataset is divided into folds. The end product is the test set which is similar to full dataset. Confusion matrix is evaluated, grid approach is followed for building the matrix at various seeds and kernels (RBF, Polynomial). The aim of this research is to see which is the best kernel for the dataset.

Files

A16731010120.pdf

Files (2.1 MB)

Name Size Download all
md5:6779049817dccdce614344b1617d5007
2.1 MB Preview Download

Additional details

Related works

Is cited by
Journal article: 2249-8958 (ISSN)

Subjects

ISSN
2249-8958
Retrieval Number
100.1/ijeat.A16731010120