Published July 15, 2016 | Version v1
Conference paper Open

3D Object Recognition Based on Volumetric Representation Using Convolutional Neural Networks

  • 1. Movidius Ltd., 1st Floor, O’Connell Br House, D’Olier St, Dublin, Ireland
  • 2. Trinity College Dublin, College Green, Dublin, Ireland

Description

Following the success of Convolutional Neural Networks on object recognition and image classification using 2D images; in this work the framework has been extended to process 3D data. However, many current systems require huge amount of computation cost for dealing with large amount of data. In this work, we introduce an efficient 3D volumetric representation for training and testing CNNs and we also build several datasets based on the volumetric representation of 3D digits, different rotations along the x, y and z axis are also taken into account. Unlike the normal volumetric representation, our datasets are much less memory usage. Finally, we introduce a model based on the combination of CNN models, the structure of the model is based on the classical LeNet. The accuracy result achieved is beyond the state of art and it can classify a 3D digit in around 9 ms.

Notes

This research was founded by EC H2020-ICT-2014-1 GA: 643924.

Files

ID_12_3DObj_AMDO2016_Springer_1.pdf

Files (11.4 MB)

Name Size Download all
md5:4e20aa0651dd7463b4bb91d143e58294
11.4 MB Preview Download

Additional details

Funding

European Commission
EoT - Eyes of Things 643924