Published October 13, 2023 | Version v2
Dataset Open

A didactical dataset to learn supervised classification with candy

Description

A didactical dataset to learn supervised classification

It was obtained from university level students measuring candy that was mixed and distributed in bowls to them. The goal of this dataset creation was to expose the students to the data taking process. Further, the dataset is meant for classification.

Dataset Structure

The dataset consists of 6 csv files:

  • peanuts.csv represents the entire dataset (a concatenation of all group?.csv files) omitting the sample column
  • peanuts_all.csv represents the entire dataset (a concatenation of all group?.csv files)
  • files matching group[1-5].csv represent the measurements of each group

Data Representation

Each file contains 5 columns.  

  • color, int values, 0: white, 1: black, 2: brown, 3: other
  • shape, int values, 0: irregular, 1 round, 2: lens-like
  • height, float values, in millimeter
  • width, float values, in millimeter
  • label, category, peanut/nopeanut

For more information on the didactical background, see the original publication that presented the concept for this activity.

Files

peanuts.csv

Files (6.2 kB)

Name Size Download all
md5:cea2b1c0216fd1a34c8177488fb223d2
790 Bytes Preview Download
md5:314144a24906573929c49cfebefa6d56
493 Bytes Preview Download
md5:72e4a883f87a7e24affb51f2c1c82d8d
463 Bytes Preview Download
md5:158e51752ac169959a696813bf3a6866
238 Bytes Preview Download
md5:ed63530595671918dd79269ffb8ec672
279 Bytes Preview Download
md5:9aca09e0361617655bec3bda78b93271
1.8 kB Preview Download
md5:99c85de5b555705609911f5597ab930a
2.1 kB Preview Download

Additional details

Dates

Collected
2023-10-13
date of lecture