Published July 31, 2019 | Version version - 2
Dataset Open

E commerce text dataset

Creators

Description

This is the classification based E-commerce text dataset for 4 categories - "Electronics", "Household", "Books" and "Clothing & Accessories", which almost cover 80% of any E-commerce website. 

The dataset is in ".csv" format with two columns - the first column is the class name and the second one is the datapoint of that class. The data point is the product and description from the e-commerce website.

The dataset has the following features :

Data Set Characteristics:  Multivariate

Number of Instances: 50425

Number of classes:  4

Area: Computer science 

Attribute Characteristics: Real

Number of Attributes: 1

Associated Tasks: Classification

Missing Values? No

The dataset has been scraped from Indian e-commerce platform.

Files

ecommerceDataset.csv

Files (36.9 MB)

Name Size Download all
md5:64e4e22ae665399049414cb8745d79df
331 Bytes Download
md5:6f2cfdb475096244916aeed6f776c1a6
36.9 MB Preview Download