Conference paper Open Access

Attention-enhanced Sensorimotor Object Recognition

Thermos, S; Papadopoulos, GT; Daras, P; Potamianos, G


JSON-LD (schema.org) Export

{
  "inLanguage": {
    "alternateName": "eng", 
    "@type": "Language", 
    "name": "English"
  }, 
  "description": "<p>Sensorimotor learning, namely the process of understanding the physical world by combining visual and motor information, has been recently investigated, achieving promising results for the task of 2D/3D object recognition. Following the recent trend in computer vision, powerful deep neural networks (NNs) have been used to model the &ldquo;sensory&rdquo; and &ldquo;motor&rdquo; information, namely the object appearance and affordance. However, the existing implementations cannot efficiently address the spatio-temporal nature of the humanobject interaction. Inspired by recent work on attention-based learning, this paper introduces an attention-enhanced NN-based model that learns to selectively focus on parts of the physical interaction where the object appearance is corrupted by occlusions and deformations. The model&rsquo;s attention mechanism relies on the confidence of classifying an object based solely on its appearance. Three metrics are used to measure the latter, namely the prediction entropy, the average N-best likelihood difference, and the N-best likelihood dispersion. Evaluation of the attention-enhanced model on the SOR3D dataset reports 33% and 26% relative improvement over the appearance-only and the spatio-temporal fusion baseline models, respectively.</p>", 
  "license": "https://creativecommons.org/licenses/by/4.0/legalcode", 
  "creator": [
    {
      "@type": "Person", 
      "name": "Thermos, S"
    }, 
    {
      "@type": "Person", 
      "name": "Papadopoulos, GT"
    }, 
    {
      "@type": "Person", 
      "name": "Daras, P"
    }, 
    {
      "@type": "Person", 
      "name": "Potamianos, G"
    }
  ], 
  "headline": "Attention-enhanced Sensorimotor Object Recognition", 
  "image": "https://zenodo.org/static/img/logos/zenodo-gradient-round.svg", 
  "datePublished": "2018-10-10", 
  "url": "https://zenodo.org/record/3727849", 
  "version": "pre-print", 
  "@type": "ScholarlyArticle", 
  "keywords": [
    "Sensorimotor object recognition, attention mechanism, stream fusion, deep neural networks"
  ], 
  "@context": "https://schema.org/", 
  "identifier": "https://doi.org/10.1109/ICIP.2018.8451158", 
  "@id": "https://doi.org/10.1109/ICIP.2018.8451158", 
  "workFeatured": {
    "alternateName": "IEEE ICIP 2018", 
    "@type": "Event"
  }, 
  "name": "Attention-enhanced Sensorimotor Object Recognition"
}
24
96
views
downloads
Views 24
Downloads 96
Data volume 74.7 MB
Unique views 21
Unique downloads 95

Share

Cite as