Dataset Open Access
Kaivan Kamali
Credit: Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner. "Gradient-based learning applied to document recognition." Proceedings of the IEEE, 86(11):2278-2324, November 1998
This is a subset of MNIST handwritten digits dataset (http://yann.lecun.com/exdb/mnist/). Training data of composed of 12,000 images of digits 0 to 9. Test data is composed of 6,000 images of digits 0 to 9 (Original dataset has 60,000 training and 10,000 testing images. We are using a subset for a Galaxy tutorial, so the training is not too computationally intensive). Images are grayscale and 28 by 28 pixels. Each pixel has a value between 0 and 255 (0 for color black, 255 for color white, and all other values for different shades of gray).
Name | Size | |
---|---|---|
X_test.tsv
md5:12f731f0ac1b3197363ae04c555ee704 |
43.8 MB | Download |
X_train.tsv
md5:946db18725bfdeee7f079706635b5595 |
87.8 MB | Download |
y_test.tsv
md5:ffb3f9bb2640bd078691c51a03416ed2 |
12.0 kB | Download |
y_train.tsv
md5:52e85ce2d279b0dec03f4168959fcb3f |
24.0 kB | Download |
All versions | This version | |
---|---|---|
Views | 334 | 334 |
Downloads | 603 | 603 |
Data volume | 21.5 GB | 21.5 GB |
Unique views | 300 | 300 |
Unique downloads | 198 | 198 |