Published October 28, 2020 | Version v1
Conference paper Open

A Cross-Modal Variational Framework For Food Image Analysis

  • 1. Information Technologies Institute, Centre for Research and Technology Hellas, Thessaloniki, Greece

Description

Food analysis resides at the core of modern nutrition recommender systems, providing the foundation for a high-level understanding of users’ eating habits. This paper focuses on the sub-task of ingredient recognition from food images using a variational framework. The framework consists of two variational encoder-decoder branches, aimed at processing information from different modalities (images and text), as well as a variational mapper branch, which accomplishes the task of aligning the distributions of the individual branches. Experimental results on the Yummly-28K data-set showcase that the proposed framework performs better than similar variational frameworks, while it surpasses current state-of-the-art approaches on the large-scale Recipe1M data-set.

Files

A Cross-Modal Variational Framework For Food Image Analysis.pdf

Files (410.7 kB)

Additional details

Funding

European Commission
PROTEIN – PeRsOnalized nutriTion for hEalthy livINg 817732