Converting Image Labels to Meaningful and Information-rich Embeddings
Creators
- 1. Research Centre on Interactive Media, Smart Systems and Emerging Technologies (RISE), Nicosia, Cyprus
Description
A challenge of the computer vision community is to understand the semantics of an image that will allow for higher quality image generation based on existing high-level features and better analysis of (semi-) labeled datasets. Categorical labels aggregate a huge amount of information into a binary value which conceals valuable high-level concepts from the Machine Learning models. Towards addressing this challenge, this paper introduces a method, called Occlusion-based Latent Representations (OLR), for converting image labels to meaningful representations that capture a significant amount of data semantics. Besides being informationrich, these representations compose a disentangled low-dimensional latent space where each image label is encoded into a separate vector. We evaluate the quality of these representations in a series of experiments whose results suggest that the proposed model can capture data concepts and discover data interrelations.
Notes
Files
Image_Labels_Embeddings.pdf
Files
(907.7 kB)
Name | Size | Download all |
---|---|---|
md5:c96383c9ae09ac77dacf8b95ea8d4f49
|
907.7 kB | Preview Download |