Published March 21, 2018 | Version 1.0
Dataset Open

PlantVillage Disease Classification Challenge - Color Images

Creators

  • 1. Beijing Forestry University

Contributors

  • 1. Beijing Forestry University

Description


This work is licensed under a Creative Commons Attribution-ShareAlike 3.0 United States License.

# Data origins
The dataset is originally hosted at PlantVillage Disease Classification Challenge.
We use the modified version in this github repository to do controlled experiments.
We only use the raw color images dataset and delete the unconventional characters in the classes directory name and `.csv` filenames.

# Directory explanation
The `80-20` direcotry has multiple `.txt` files which contain the training (~80%), validation(~10%) and testing (~10%) datasets instances filenames and the corresponding label indexes. The validation dataset quantity is `5430` in all data separation. In our experiment code (not included in this archive), the validation and testing dataset are merged together.

# Data usage
## Replicate our experiments
We have used this dataset in writing our paper. The reference information can be seen at https://gitlab.com/huix/leaf-disease-plant-village.

### Steps
1. `cd` to the direcotry (e.g. `/home/usrname/plantvillage_deeplearning_paper_dataset`) that contains the `color` directory.
2. run `python change_filename_prefix.py --prefix /home/usrname/plantvillage_deeplearning_paper_dataset` to modify the prefix path (which is `/home/h/plantvillage_deeplearning_paper_dataset` in our former generated datasets).
3. Fin. You can use our opens ource codes repository to do the later experiments.

## Generate your own training/validation/testing datasets
This data separation generating code isn't included in the dataset archive, it is in our open source code. Please see our open source code repository for the detailed information.
If you have any questions, you can contact the author through email.
The email address is a QR code in the archive.

Notes

https://gitlab.com/huix/leaf-disease-plant-village

Files

Files (824.2 MB)

Name Size Download all
md5:973b7c7b498f432d11e50b5620851b1e
824.2 MB Download

Additional details

References

  • Hughes, D.P., Salathe, M.: An open access repository of images on plant health to enable the development of mobile disease diagnostics. ArXiv e-prints (2015). 1511.08060
  • Mohanty, S.P., Hughes, D.P., Salathé, M.: Using deep learning for image-based plant disease detection. Frontiers in Plant Science 7, 1419 (2016). doi:10.3389/fpls.2016.01419