Published February 3, 2023 | Version v1
Dataset Open

Wildlife MNIST

Description

The Wildlife MNIST dataset contains MNIST digits with colored backgrounds and foregrounds with annotations, suitable for benchmarking disentangling or factor identification. Originally used for the project https://github.com/vitskvara/sgad. There are two versions - non-mixed and mixed. In the non-mixed version (data.npy and label.npy), the background and foreground textures are the same for all digits of a single MNIST class, therefore only a single label describes each sample. In the mixed version (data_test.npy and labels_test.npy), each sample image has a random digit, background and foreground (out of 10 classes for each factor of variation). Then, the label is a tuple of three numbers, describing the individual (digit,background,foreground) labels. Note that the data is scaled to the interval [-1,1], so rescaling them by computing "x*0.5 + 0.5" is necessary for some applications that require them to be in the interval [0,1]. Example images from both versions of the dataset are included. Note that the dataset was originally used in "Sauer, Axel, and Andreas Geiger. Counterfactual generative networks. 2021."

Notes

https://github.com/vitskvara/sgad

Files

test_background.png

Files (1.5 GB)

Name Size Download all
md5:13051cffd22f5758783e870f0c17ac1e
737.3 MB Download
md5:675cb23dfcfe8b45231eda1a6d2d1753
737.3 MB Download
md5:a594865790cca5d332423eed56c45a6a
480.1 kB Download
md5:b15b978ced86bb09d66f9732ca90ce5b
1.4 MB Download
md5:bc545770c1fb50c7256c9b5b4286f35e
323.2 kB Preview Download
md5:c9948968f1e81d35c265e0e44d1a00cf
336.3 kB Preview Download
md5:71c4123fa0304d7ad0a7747282a7245f
337.7 kB Preview Download
md5:83971d5b613dfab274691b51c56f1818
323.4 kB Preview Download