Published January 22, 2023 | Version v2
Dataset Open

Leeds Butterfly Dataset

  • 1. Imperial College London

Description

This dataset contains images and textual descriptions for ten categories (species) of butterflies. More specifically, it contains:

  • Images for ten butterfly categories
  • Segmentation masks for each image
  • Textual descriptions for each butterfly category

The image dataset comprises 832 images in total, with the distribution ranging from 55 to 100 images per category. Images were collected from Google Images by querying with the scientific (Latin) name of the species, for example "Danaus plexippus", and manually filtered for those depicting the butterfly of interest. 

The textual descriptions for each butterfly category were obtained from the eNature online nature guide back in 2008 (website no longer available).

Please refer to our paper for a more detailed description of the dataset:

Josiah Wang, Katja Markert, Mark Everingham (2009). Learning Models for Object Recognition from Natural Language Descriptions. In Proceedings of the 20th British Machine Vision Conference (BMVC2009), September 2009. Also see the video recording of the oral presentation.

Files

leedsbutterfly_dataset_v1.1.zip

Files (476.2 MB)

Name Size Download all
md5:f2db2046c6ef4fb5bf83cabb9a7144ba
476.2 MB Preview Download

Additional details

Related works

Is cited by
Conference paper: 10.5244/C.23.2 (DOI)