Dataset Open Access

Regression analysis in Galaxy with car purchase price prediction dataset

Kaivan Kamali

Source/Credit: Michael Grogan
https://github.com/MGCodesandStats
https://github.com/MGCodesandStats/datasets/blob/master/cars.csv

Sample dataset for regression analysis. Given 5 attributes (age, gender, miles driven per day, debt, and income) predict how much someone will spend on purchasing a car. All 5 of the input attributes have been scaled to be in 0 to 1 range. Training set has 723 training examples. Test set has 242 test examples.

This dataset will be used in an upcoming Galaxy Training Network tutorial (https://training.galaxyproject.org/training-material/topics/statistics/) on use of feedforward neural networks for regression analysis.  

Source/Credit: Michael Grogan https://github.com/MGCodesandStats https://github.com/MGCodesandStats/datasets/blob/master/cars.csv
Files (52.1 kB)
Name Size
X_test.tsv
md5:598d59873f8684891e9f1c048f1e01bb
10.9 kB Download
X_train.tsv
md5:36422148ce5c683a093a0dd72edb4092
32.5 kB Download
y_test.tsv
md5:fc28be1560ea94240201d3ad413259ac
2.2 kB Download
y_train.tsv
md5:4adff57b6357119fc06cb7ced5b13f0d
6.5 kB Download
115
186
views
downloads
All versions This version
Views 115115
Downloads 186186
Data volume 2.5 MB2.5 MB
Unique views 104104
Unique downloads 6767

Share

Cite as