A conceptual framework of computations in mid-level vision

doi:10.5281/zenodo.11063

Published July 31, 2014 | Version v1

Poster Open

A conceptual framework of computations in mid-level vision

1. KU Leuven

In this poster, I describe a conceptual framework of mid-level vision that relies on three key ideas. First, I argue that the visual system relies on intermediate representations of inputs for performing multiple tasks. Such representations are based on a largely pre-semantic (prior to categorization) image segmentation into surfaces, which may be partially overlapping to compensate for occlusions and are ordered in depth, and are refined iteratively across the visual hierarchy. Second, I propose that intermediate representations could be formed by computing similarity between features in local image patches, followed by pooling of highly-similar units, and repeated several times across the visual hierarchy. Finally, I suggest to use datasets composed of realistically rendered artificial objects and surfaces in order to better understand model’s behavior and obtain informative feedback. To support this approach, I also present results from several experiments in our lab.

Files

kubilius_viihm2014_small.pdf

Files (2.5 MB)

Name	Size	Download all
kubilius_viihm2014_small.pdf md5:f62477b13501ff52bed9663ef9604726	2.5 MB	Preview Download

	All versions	This version
Views	57	57
Downloads	69	69
Data volume	171.7 MB	171.7 MB

A conceptual framework of computations in mid-level vision

Creators

Description

Files

kubilius_viihm2014_small.pdf

Files (2.5 MB)