Published September 26, 2023 | Version 1.0.0
Dataset Open

Domain-adaptive Data Synthesis for Large-scale Supermarket Product Recognition

  • 1. TU Wien, Computer Vision Lab

Description

Domain-Adaptive Data Synthesis for Large-Scale Supermarket Product Recognition

This repository contains the data synthesis pipeline and synthetic product recognition datasets proposed in [1].

Data Synthesis Pipeline:

We provide the Blender 3.1 project files and Python source code of our data synthesis pipeline pipeline.zip, accompanied by the FastCUT models used for synthetic-to-real domain translation models.zip. For the synthesis of new shelf images, a product assortment list and product images must be provided in the corresponding directories products/assortment/ and products/img/. The pipeline expects product images to follow the naming convention c.png, with c corresponding to a GTIN or generic class label (e.g., 9120050882171.png). The assortment list, assortment.csv, is expected to use the sample format [c, w, d, h], with c being the class label and w, d, and h being the packaging dimensions of the given product in mm (e.g., [4004218143128, 140, 70, 160]). The assortment list to use and the number of images to generate can be specified in generateImages.py (see comments). The rendering process is initiated by either executing load.py from within Blender or within a command-line terminal as a background process. 

Datasets:

  • SG3k - Synthetic GroZi-3.2k (SG3k) dataset, consisting of 10,000 synthetic shelf images with 851,801 instances of 3,234 GroZi-3.2k products. Instance-level bounding boxes and generic class labels are provided for all product instances.
  • SG3kt - Domain-translated version of SGI3k, utilizing GroZi-3.2k as the target domain. Instance-level bounding boxes and generic class labels are provided for all product instances.
  • SGI3k - Synthetic GroZi-3.2k (SG3k) dataset, consisting of 10,000 synthetic shelf images with 838,696 instances of 1,063 GroZi-3.2k products. Instance-level bounding boxes and generic class labels are provided for all product instances.
  • SGI3kt - Domain-translated version of SGI3k, utilizing GroZi-3.2k as the target domain. Instance-level bounding boxes and generic class labels are provided for all product instances.
  • SPS8k - Synthetic Product Shelves 8k (SPS8k) dataset, comprised of 16,224 synthetic shelf images with 1,981,967 instances of 8,112 supermarket products. Instance-level bounding boxes and GTIN class labels are provided for all product instances.
  • SPS8kt - Domain-translated version of SPS8k, utilizing SKU110k as the target domain. Instance-level bounding boxes and GTIN class labels for all product instances.

Table 1: Dataset characteristics. 

Dataset #images #products #instances   labels     translation
SG3k 10,000 3,234 851,801 bounding box & generic class¹ none
SG3kt 10,000 3,234 851,801 bounding box & generic class¹ GroZi-3.2k
SGI3k 10,000 1,063 838,696 bounding box & generic class² none
SGI3kt 10,000 1,063 838,696 bounding box & generic class² GroZi-3.2k
SPS8k 16,224 8,112 1,981,967 bounding box & GTIN none
SPS8kt 16,224 8,112 1,981,967 bounding box & GTIN SKU110k

 

Sample Format

A sample consists of an RGB image (i.png) and an accompanying label file (i.txt), which contains the labels for all product instances present in the image. Labels use the YOLO format [c, x, y, w, h].

¹SG3k and SG3kt use generic pseudo-GTIN class labels, created by combining the GroZi-3.2k food product category number i (1-27) with the product image index j (j.jpg), following the convention i0000j (e.g., 13000097).

²SGI3k and SGI3kt use the generic GroZi-3.2k class labels from https://arxiv.org/abs/2003.06800.

Download and Use
This data may be used for non-commercial research purposes only. If you publish material based on this data, we request that you include a reference to our paper [1].

[1] Strohmayer, Julian, and Martin Kampel. "Domain-Adaptive Data Synthesis for Large-Scale Supermarket Product Recognition." International Conference on Computer Analysis of Images and Patterns. Cham: Springer Nature Switzerland, 2023.

BibTeX citation:

@inproceedings{strohmayer2023domain,
  title={Domain-Adaptive Data Synthesis for Large-Scale Supermarket Product Recognition},
  author={Strohmayer, Julian and Kampel, Martin},
  booktitle={International Conference on Computer Analysis of Images and Patterns},
  pages={239--250},
  year={2023},
  organization={Springer}
}

Files

models.zip

Files (54.5 GB)

Name Size Download all
md5:1a2f6e30146a39c2ca92d7395640a676
163.8 MB Preview Download
md5:5390a74f4bf0ca43964aa99396cf68d2
65.2 MB Preview Download
md5:70b17e95473df180446c96a1bb0861c9
8.4 GB Preview Download
md5:33412605a1791f36ac43310261f4fdf7
7.9 GB Preview Download
md5:c631f840953678b6ff8acf3e42b0e2d9
7.5 GB Preview Download
md5:ead0caa57bdc3df28c12f792033c1636
7.0 GB Preview Download
md5:53f9bca3e69dd425a89267a2a4237450
11.9 GB Preview Download
md5:f6a8504861b861ee7b03ec9342d69412
11.6 GB Preview Download

Additional details