YCB-M: A Multi-Camera RGB-D Dataset for Object Recognition and 6DoF Pose Estimation
Description
While a great variety of 3D cameras have been introduced in recent years, most publicly available datasets for object recognition and pose estimation focus on one single camera. This dataset consists of 32 scenes that have been captured by 7 different 3D cameras, totaling 49,294 frames. This allows evaluating the sensitivity of pose estimation algorithms to the specifics of the used camera and the development of more robust algorithms that are more independent of the camera model. Vice versa, our dataset enables researchers to perform a quantitative comparison of the data from several different cameras and depth sensing technologies and evaluate their algorithms before selecting a camera for their specific task. The scenes in our dataset contain 20 different objects from the common benchmark YCB object and model set. We provide full ground truth 6DoF poses for each object, per-pixel segmentation, 2D and 3D bounding boxes and a measure of the amount of occlusion of each object.
If you use this dataset in your research, please cite the following publication:
T. Grenzdörffer, M. Günther, and J. Hertzberg, “YCB-M: A Multi-Camera RGB-D Dataset for Object Recognition and 6DoF Pose Estimation,” in 2020 IEEE International Conference on Robotics and Automation, ICRA 2020, Paris, France, May 31-June 4, 2020. IEEE, 2020.
@InProceedings{Grenzdoerffer2020ycbm,
title = {{YCB-M}: A Multi-Camera {RGB-D} Dataset for Object Recognition and {6DoF} Pose Estimation},
author = {Grenzd{\"{o}}rffer, Till and G{\"{u}}nther, Martin and Hertzberg, Joachim},
booktitle = {2020 {IEEE} International Conference on Robotics and Automation, {ICRA} 2020, Paris, France, May 31-June 4, 2020},
year = {2020},
publisher = {{IEEE}}
}
This paper is also available on arXiv: https://arxiv.org/abs/2004.11657
To visualize the dataset, follow these instructions (tested on Ubuntu Xenial 16.04):
# IMPORTANT: the ROS setup.bash must NOT be sourced, otherwise the following error occurs:
# ImportError: /opt/ros/kinetic/lib/python2.7/dist-packages/cv2.so: undefined symbol: PyCObject_Type
# nvdu requires Python 3.5 or 3.6
sudo add-apt-repository -y ppa:deadsnakes/ppa # to get python3.6 on Ubuntu Xenial
sudo apt-get update
sudo apt-get install -y python3.6 libsm6 libxext6 libxrender1 python-virtualenv python-pip
# create a new virtual environment
virtualenv -p python3.6 venv_nvdu
cd venv_nvdu/
source bin/activate
# clone our fork of NVIDIA's Dataset Utilities that incorporates some essential fixes
pip install -e 'git+https://github.com/mintar/Dataset_Utilities.git#egg=nvdu'
# download and transform the meshes
# (alternatively, unzip the meshes contained in the dataset
# to <path to venv_nvdu>/lib/python3.6/site-packages/nvdu/data/ycb/aligned_cm)
nvdu_ycb -s
# run nvdu_viz to visualize the dataset
cd <a subdirectory of the YCB-M dataset with some frames>
nvdu_viz --name_filters '*.jpg'
For further details, see README.md.
Files
README.md
Files
(15.8 GB)
Name | Size | Download all |
---|---|---|
md5:24d920d2764b5ddecb7da25723f3d2f6
|
1.5 GB | Download |
md5:4a379704d79fb4816cbe1f8de6cb0690
|
49.3 MB | Download |
md5:191bee1c047450bb1ee21d539f90d39d
|
1.5 GB | Download |
md5:cf2ffe1784f8b3c778cf214ea9e2b91f
|
60.8 MB | Download |
md5:fa051a087d5ffab56a7d7d5663a65465
|
5.2 GB | Download |
md5:7bd06c2d7d84b87fd01d9432329a1465
|
45.7 MB | Download |
md5:b0e048e1e711693290e96f3796bce527
|
3.1 GB | Download |
md5:bb30676e953f09f83cbab8cf0fb5dc56
|
53.8 MB | Download |
md5:888fd9ed7fde091a78ee3f210cc97c43
|
933.7 MB | Download |
md5:bb2080ffa4360802c6ce264f87c484ca
|
36.3 MB | Download |
md5:bcce826fc2521e6940a46bdd4acbab51
|
13.2 kB | Preview Download |
md5:d5bb0f71965857948405ba906ca224cd
|
1.7 GB | Download |
md5:e82f1168d0226eb683f93a4b5cfecf1a
|
39.6 MB | Download |
md5:46fa09b31f5116661993941c5d5f3962
|
1.4 GB | Download |
md5:669150aa1fc2bc5fbcbba64ca34f31fc
|
50.0 MB | Download |
md5:c060d71c274bcc0f8d1f6ed95e3145a3
|
152.1 MB | Download |
md5:86249eed973a94c1947c4693f002671f
|
9.5 MB | Preview Download |