Zenodo.org will be unavailable for 2 hours on September 29th from 06:00-08:00 UTC. See announcement.

Dataset Open Access

Image classification in Galaxy with MNIST handwritten digits dataset

Kaivan Kamali

Credit: Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner. "Gradient-based learning applied to document recognition." Proceedings of the IEEE, 86(11):2278-2324, November 1998

This is a subset of MNIST handwritten digits dataset (http://yann.lecun.com/exdb/mnist/). Training data of composed of 12,000 images of digits 0 to 9. Test data is composed of 6,000 images of digits 0 to 9 (Original dataset has 60,000 training and 10,000 testing images. We are using a subset for a Galaxy tutorial, so the training is not too computationally intensive). Images are grayscale and 28 by 28 pixels. Each pixel has a value between 0 and 255 (0 for color black, 255 for color white, and all other values for different shades of gray).   

Credit: Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner. "Gradient-based learning applied to document recognition." Proceedings of the IEEE, 86(11):2278-2324, November 1998 http://yann.lecun.com/exdb/mnist/
Files (131.6 MB)
Name Size
X_test.tsv
md5:12f731f0ac1b3197363ae04c555ee704
43.8 MB Download
X_train.tsv
md5:946db18725bfdeee7f079706635b5595
87.8 MB Download
y_test.tsv
md5:ffb3f9bb2640bd078691c51a03416ed2
12.0 kB Download
y_train.tsv
md5:52e85ce2d279b0dec03f4168959fcb3f
24.0 kB Download
334
603
views
downloads
All versions This version
Views 334334
Downloads 603603
Data volume 21.5 GB21.5 GB
Unique views 300300
Unique downloads 198198

Share

Cite as