Planned intervention: On Thursday 19/09 between 05:30-06:30 (UTC), Zenodo will be unavailable because of a scheduled upgrade in our storage cluster.
Published October 1, 2019 | Version v1
Conference paper Open

Bayesian Gaussian mixture model for robotic policy imitation

  • 1. Idiap Research Institute, EPFL

Description

A common approach to learn robotic skills is to imitate a demonstrated policy. Due to the compounding of small errors and perturbations, this approach may let the robot leave the states in which the demonstrations were provided. This requires the consideration of additional strategies to guarantee that the robot will behave appropriately when facing unknown states. We propose to use a Bayesian method to quantify the action uncertainty at each state. The proposed Bayesian method is simple to set up, computationally efficient, and can adapt to a wide range of problems. Our approach exploits the estimated uncertainty to fuse the imitation policy with additional policies. It is validated on a Panda robot with the imitation of three manipulation tasks in the continuous domain using different control input/state pairs.

Files

bgmm.pdf

Files (1.6 MB)

Name Size Download all
md5:63430a540d6c42e91bb255ef4ef7cb57
1.6 MB Preview Download

Additional details

Funding

MEMMO – Memory of Motion 780684
European Commission
CoLLaboratE – Co-production CeLL performing Human-Robot Collaborative AssEmbly 820767
European Commission