YCB-M: A Multi-Camera RGB-D Dataset for Object Recognition and 6DoF Pose Estimation

Grenzdörffer, Till; Günther, Martin; Hertzberg, Joachim

doi:10.5281/zenodo.2579173

Published February 27, 2019 | Version 1.0.0

Dataset Open

YCB-M: A Multi-Camera RGB-D Dataset for Object Recognition and 6DoF Pose Estimation

1. DFKI

While a great variety of 3D cameras have been introduced in recent years, most publicly available datasets for object recognition and pose estimation focus on one single camera. This dataset consists of 32 scenes that have been captured by 7 different 3D cameras, totaling 49,294 frames. This allows evaluating the sensitivity of pose estimation algorithms to the specifics of the used camera and the development of more robust algorithms that are more independent of the camera model. Vice versa, our dataset enables researchers to perform a quantitative comparison of the data from several different cameras and depth sensing technologies and evaluate their algorithms before selecting a camera for their specific task. The scenes in our dataset contain 20 different objects from the common benchmark YCB object and model set. We provide full ground truth 6DoF poses for each object, per-pixel segmentation, 2D and 3D bounding boxes and a measure of the amount of occlusion of each object.

If you use this dataset in your research, please cite the following publication:

T. Grenzdörffer, M. Günther, and J. Hertzberg, “YCB-M: A Multi-Camera RGB-D Dataset for Object Recognition and 6DoF Pose Estimation,” in 2020 IEEE International Conference on Robotics and Automation, ICRA 2020, Paris, France, May 31-June 4, 2020. IEEE, 2020.

@InProceedings{Grenzdoerffer2020ycbm,
  title = {{YCB-M}: A Multi-Camera {RGB-D} Dataset for Object Recognition and {6DoF} Pose Estimation},
  author = {Grenzd{\"{o}}rffer, Till and G{\"{u}}nther, Martin and Hertzberg, Joachim},
  booktitle = {2020 {IEEE} International Conference on Robotics and Automation, {ICRA} 2020, Paris, France, May 31-June 4, 2020},
  year = {2020},
  publisher = {{IEEE}}
}

This paper is also available on arXiv: https://arxiv.org/abs/2004.11657

To visualize the dataset, follow these instructions (tested on Ubuntu Xenial 16.04):

# IMPORTANT: the ROS setup.bash must NOT be sourced, otherwise the following error occurs:
# ImportError: /opt/ros/kinetic/lib/python2.7/dist-packages/cv2.so: undefined symbol: PyCObject_Type

# nvdu requires Python 3.5 or 3.6
sudo add-apt-repository -y ppa:deadsnakes/ppa   # to get python3.6 on Ubuntu Xenial
sudo apt-get update
sudo apt-get install -y python3.6 libsm6 libxext6 libxrender1 python-virtualenv python-pip

# create a new virtual environment
virtualenv -p python3.6 venv_nvdu
cd venv_nvdu/
source bin/activate

# clone our fork of NVIDIA's Dataset Utilities that incorporates some essential fixes
pip install -e 'git+https://github.com/mintar/Dataset_Utilities.git#egg=nvdu'

# download and transform the meshes
# (alternatively, unzip the meshes contained in the dataset
# to <path to venv_nvdu>/lib/python3.6/site-packages/nvdu/data/ycb/aligned_cm)
nvdu_ycb -s

# run nvdu_viz to visualize the dataset
cd <a subdirectory of the YCB-M dataset with some frames>
nvdu_viz --name_filters '*.jpg'

For further details, see README.md.

Files

README.md

Files (15.8 GB)

Name	Size	Download all
astra.tar.gz md5:24d920d2764b5ddecb7da25723f3d2f6	1.5 GB	Download
astra_annotations.tar.gz md5:4a379704d79fb4816cbe1f8de6cb0690	49.3 MB	Download
basler_tof.tar.gz md5:191bee1c047450bb1ee21d539f90d39d	1.5 GB	Download
basler_tof_annotations.tar.gz md5:cf2ffe1784f8b3c778cf214ea9e2b91f	60.8 MB	Download
ensenso.tar.gz md5:fa051a087d5ffab56a7d7d5663a65465	5.2 GB	Download
ensenso_annotations.tar.gz md5:7bd06c2d7d84b87fd01d9432329a1465	45.7 MB	Download
kinect2.tar.gz md5:b0e048e1e711693290e96f3796bce527	3.1 GB	Download
kinect2_annotations.tar.gz md5:bb30676e953f09f83cbab8cf0fb5dc56	53.8 MB	Download
pico_flexx.tar.gz md5:888fd9ed7fde091a78ee3f210cc97c43	933.7 MB	Download
pico_flexx_annotations.tar.gz md5:bb2080ffa4360802c6ce264f87c484ca	36.3 MB	Download
README.md md5:bcce826fc2521e6940a46bdd4acbab51	13.2 kB	Preview Download
realsense_r200.tar.gz md5:d5bb0f71965857948405ba906ca224cd	1.7 GB	Download
realsense_r200_annotations.tar.gz md5:e82f1168d0226eb683f93a4b5cfecf1a	39.6 MB	Download
xtion.tar.gz md5:46fa09b31f5116661993941c5d5f3962	1.4 GB	Download
xtion_annotations.tar.gz md5:669150aa1fc2bc5fbcbba64ca34f31fc	50.0 MB	Download
ycb_models_aligned_cm.tar.gz md5:c060d71c274bcc0f8d1f6ed95e3145a3	152.1 MB	Download
ycbm_multicam_dataset_video.mp4 md5:86249eed973a94c1947c4693f002671f	9.5 MB	Preview Download

	All versions	This version
Views	2,631	2,623
Downloads	1,301	1,297
Data volume	3.1 TB	3.1 TB

YCB-M: A Multi-Camera RGB-D Dataset for Object Recognition and 6DoF Pose Estimation

Creators

Description

Files

README.md

Files (15.8 GB)