Collection of categorized Google Vision API image tags for museums related pictures

doi:10.5281/zenodo.3706462

Published March 11, 2020 | Version v1

Other Open

Collection of categorized Google Vision API image tags for museums related pictures

1. Chung-Ang University
2. University of Milano-Bicocca

This is a list of labels assigned by Google Vision API to a set of 10,000 images retrieved from Instagram using museum hashtags.

First, we employed computer vision techniques through the Google Cloud Platform Vision API. An object recognition algorithm returned at most ten tags for each image, for instance ‘Pyramid’, ‘Illustration’, ‘Person’, etc. Second, we trained a machine learning algorithm (word2vec) on all image tags to compute their semantic similarity. Looking at the output we then identified clusters of similar words, which correspond to similar contents in the images: body, food, clothes, music, nature, interior, architecture, museum, animals, sport. Third, we edited these data-driven categories and combined them with top-down art categories relevant for museum research, creating the following final list of image types: art exhibition (e.g. performances, events, and graphics), artifact (e.g. sculptures, paintings, and pottery), architecture (e.g. buildings, or parts of them, and indoor spaces), selfie (e.g. faces), food, human body (e.g. non-face body parts and people), landscape (e.g. outdoor spaces and nature). Fourth, in order to maximize the number of tags retrieved for each of the categories, we trained word2vec models separately on the tags’ subsets of 8 different museums and retrieved the 50 most similar tags for each category. We then manually checked all lists to make sure that they include only tags relevant for the respective categories, to resolve overlaps between categories, and to delete ambiguous tags.

This list can be used in combination with Google Vision API to easily categorize images.

Files

museum_categories.csv

Files (34.9 kB)

Name	Size	Download all
museum_categories.csv md5:5675cb1afaf76210e73643f6f9ae1584	11.0 kB	Preview Download
museum_categories.xlsx md5:e16fa3211795568b7b0aba8a9d098053	23.9 kB	Download

	All versions	This version
Views	104	101
Downloads	61	59
Data volume	960.5 kB	938.4 kB

Collection of categorized Google Vision API image tags for museums related pictures

Creators

Description

Files

museum_categories.csv

Files (34.9 kB)