Published February 27, 2019 | Version 1.0.0
Dataset Open

YCB-M: A Multi-Camera RGB-D Dataset for Object Recognition and 6DoF Pose Estimation

Description

While a great variety of 3D cameras have been introduced in recent years, most publicly available datasets for object recognition and pose estimation focus on one single camera.  This dataset consists of 32 scenes that have been captured by 7 different 3D cameras, totaling 49,294 frames. This allows evaluating the sensitivity of pose estimation algorithms to the specifics of the used camera and the development of more robust algorithms that are more independent of the camera model. Vice versa, our dataset enables researchers to perform a quantitative comparison of the data from several different cameras and depth sensing technologies and evaluate their algorithms before selecting a camera for their specific task. The scenes in our dataset contain 20 different objects from the common benchmark YCB object and model set. We provide full ground truth 6DoF poses for each object, per-pixel segmentation, 2D and 3D bounding boxes and a measure of the amount of occlusion of each object.

If you use this dataset in your research, please cite the following publication:

T. Grenzdörffer, M. Günther, and J. Hertzberg, “YCB-M: A Multi-Camera RGB-D Dataset for Object Recognition and 6DoF Pose Estimation,” in 2020 IEEE International Conference on Robotics and Automation, ICRA 2020, Paris, France, May 31-June 4, 2020. IEEE, 2020.

@InProceedings{Grenzdoerffer2020ycbm,
  title = {{YCB-M}: A Multi-Camera {RGB-D} Dataset for Object Recognition and {6DoF} Pose Estimation},
  author = {Grenzd{\"{o}}rffer, Till and G{\"{u}}nther, Martin and Hertzberg, Joachim},
  booktitle = {2020 {IEEE} International Conference on Robotics and Automation, {ICRA} 2020, Paris, France, May 31-June 4, 2020},
  year = {2020},
  publisher = {{IEEE}}
}

This paper is also available on arXiv: https://arxiv.org/abs/2004.11657

 

To visualize the dataset, follow these instructions (tested on Ubuntu Xenial 16.04):

# IMPORTANT: the ROS setup.bash must NOT be sourced, otherwise the following error occurs:
# ImportError: /opt/ros/kinetic/lib/python2.7/dist-packages/cv2.so: undefined symbol: PyCObject_Type

# nvdu requires Python 3.5 or 3.6
sudo add-apt-repository -y ppa:deadsnakes/ppa   # to get python3.6 on Ubuntu Xenial
sudo apt-get update
sudo apt-get install -y python3.6 libsm6 libxext6 libxrender1 python-virtualenv python-pip

# create a new virtual environment
virtualenv -p python3.6 venv_nvdu
cd venv_nvdu/
source bin/activate

# clone our fork of NVIDIA's Dataset Utilities that incorporates some essential fixes
pip install -e 'git+https://github.com/mintar/Dataset_Utilities.git#egg=nvdu'

# download and transform the meshes
# (alternatively, unzip the meshes contained in the dataset
# to <path to venv_nvdu>/lib/python3.6/site-packages/nvdu/data/ycb/aligned_cm)
nvdu_ycb -s

# run nvdu_viz to visualize the dataset
cd <a subdirectory of the YCB-M dataset with some frames>
nvdu_viz --name_filters '*.jpg'

For further details, see README.md.

Files

README.md

Files (15.8 GB)

Name Size Download all
md5:24d920d2764b5ddecb7da25723f3d2f6
1.5 GB Download
md5:4a379704d79fb4816cbe1f8de6cb0690
49.3 MB Download
md5:191bee1c047450bb1ee21d539f90d39d
1.5 GB Download
md5:cf2ffe1784f8b3c778cf214ea9e2b91f
60.8 MB Download
md5:fa051a087d5ffab56a7d7d5663a65465
5.2 GB Download
md5:7bd06c2d7d84b87fd01d9432329a1465
45.7 MB Download
md5:b0e048e1e711693290e96f3796bce527
3.1 GB Download
md5:bb30676e953f09f83cbab8cf0fb5dc56
53.8 MB Download
md5:888fd9ed7fde091a78ee3f210cc97c43
933.7 MB Download
md5:bb2080ffa4360802c6ce264f87c484ca
36.3 MB Download
md5:bcce826fc2521e6940a46bdd4acbab51
13.2 kB Preview Download
md5:d5bb0f71965857948405ba906ca224cd
1.7 GB Download
md5:e82f1168d0226eb683f93a4b5cfecf1a
39.6 MB Download
md5:46fa09b31f5116661993941c5d5f3962
1.4 GB Download
md5:669150aa1fc2bc5fbcbba64ca34f31fc
50.0 MB Download
md5:c060d71c274bcc0f8d1f6ed95e3145a3
152.1 MB Download
md5:86249eed973a94c1947c4693f002671f
9.5 MB Preview Download