Domain-adaptive Data Synthesis for Large-scale Supermarket Product Recognition

Strohmayer, Julian; Kampel, Martin

doi:10.5281/zenodo.7750242

Published September 26, 2023 | Version 1.0.0

Dataset Open

Domain-adaptive Data Synthesis for Large-scale Supermarket Product Recognition

1. TU Wien, Computer Vision Lab

Domain-Adaptive Data Synthesis for Large-Scale Supermarket Product Recognition

This repository contains the data synthesis pipeline and synthetic product recognition datasets proposed in [1].

Data Synthesis Pipeline:

We provide the Blender 3.1 project files and Python source code of our data synthesis pipeline pipeline.zip, accompanied by the FastCUT models used for synthetic-to-real domain translation models.zip. For the synthesis of new shelf images, a product assortment list and product images must be provided in the corresponding directories products/assortment/ and products/img/. The pipeline expects product images to follow the naming convention c.png, with c corresponding to a GTIN or generic class label (e.g., 9120050882171.png). The assortment list, assortment.csv, is expected to use the sample format [c, w, d, h], with c being the class label and w, d, and h being the packaging dimensions of the given product in mm (e.g., [4004218143128, 140, 70, 160]). The assortment list to use and the number of images to generate can be specified in generateImages.py (see comments). The rendering process is initiated by either executing load.py from within Blender or within a command-line terminal as a background process.

Datasets:

SG3k - Synthetic GroZi-3.2k (SG3k) dataset, consisting of 10,000 synthetic shelf images with 851,801 instances of 3,234 GroZi-3.2k products. Instance-level bounding boxes and generic class labels are provided for all product instances.
SG3kt - Domain-translated version of SGI3k, utilizing GroZi-3.2k as the target domain. Instance-level bounding boxes and generic class labels are provided for all product instances.
SGI3k - Synthetic GroZi-3.2k (SG3k) dataset, consisting of 10,000 synthetic shelf images with 838,696 instances of 1,063 GroZi-3.2k products. Instance-level bounding boxes and generic class labels are provided for all product instances.
SGI3kt - Domain-translated version of SGI3k, utilizing GroZi-3.2k as the target domain. Instance-level bounding boxes and generic class labels are provided for all product instances.
SPS8k - Synthetic Product Shelves 8k (SPS8k) dataset, comprised of 16,224 synthetic shelf images with 1,981,967 instances of 8,112 supermarket products. Instance-level bounding boxes and GTIN class labels are provided for all product instances.
SPS8kt - Domain-translated version of SPS8k, utilizing SKU110k as the target domain. Instance-level bounding boxes and GTIN class labels for all product instances.

Table 1: Dataset characteristics.

Dataset	#images	#products	#instances	labels	translation
SG3k	10,000	3,234	851,801	bounding box & generic class¹	none
SG3kt	10,000	3,234	851,801	bounding box & generic class¹	GroZi-3.2k
SGI3k	10,000	1,063	838,696	bounding box & generic class²	none
SGI3kt	10,000	1,063	838,696	bounding box & generic class²	GroZi-3.2k
SPS8k	16,224	8,112	1,981,967	bounding box & GTIN	none
SPS8kt	16,224	8,112	1,981,967	bounding box & GTIN	SKU110k

Sample Format

A sample consists of an RGB image (i.png) and an accompanying label file (i.txt), which contains the labels for all product instances present in the image. Labels use the YOLO format [c, x, y, w, h].

¹SG3k and SG3kt use generic pseudo-GTIN class labels, created by combining the GroZi-3.2k food product category number i (1-27) with the product image index j (j.jpg), following the convention i0000j (e.g., 13000097).

²SGI3k and SGI3kt use the generic GroZi-3.2k class labels from https://arxiv.org/abs/2003.06800.

Download and Use
This data may be used for non-commercial research purposes only. If you publish material based on this data, we request that you include a reference to our paper [1].

[1] Strohmayer, Julian, and Martin Kampel. "Domain-Adaptive Data Synthesis for Large-Scale Supermarket Product Recognition." International Conference on Computer Analysis of Images and Patterns. Cham: Springer Nature Switzerland, 2023.

BibTeX citation:

@inproceedings{strohmayer2023domain,
  title={Domain-Adaptive Data Synthesis for Large-Scale Supermarket Product Recognition},
  author={Strohmayer, Julian and Kampel, Martin},
  booktitle={International Conference on Computer Analysis of Images and Patterns},
  pages={239--250},
  year={2023},
  organization={Springer}
}

Files

models.zip

Files (54.5 GB)

Name	Size	Download all
models.zip md5:1a2f6e30146a39c2ca92d7395640a676	163.8 MB	Preview Download
pipeline.zip md5:5390a74f4bf0ca43964aa99396cf68d2	65.2 MB	Preview Download
SG3k.zip md5:70b17e95473df180446c96a1bb0861c9	8.4 GB	Preview Download
SG3k_t.zip md5:33412605a1791f36ac43310261f4fdf7	7.9 GB	Preview Download
SGI3k.zip md5:c631f840953678b6ff8acf3e42b0e2d9	7.5 GB	Preview Download
SGI3k_t.zip md5:ead0caa57bdc3df28c12f792033c1636	7.0 GB	Preview Download
SPS8k.zip md5:53f9bca3e69dd425a89267a2a4237450	11.9 GB	Preview Download
SPS8k_t.zip md5:f6a8504861b861ee7b03ec9342d69412	11.6 GB	Preview Download

Additional details

Is cited by: Conference paper: 10.1007/978-3-031-44237-7_23 (DOI); Conference paper: https://www.researchgate.net/publication/374047329_Domain-Adaptive_Data_Synthesis_for_Large-Scale_Supermarket_Product_Recognition (URL)

	All versions	This version
Views	489	466
Downloads	282	280
Data volume	2.4 TB	2.4 TB

Domain-adaptive Data Synthesis for Large-scale Supermarket Product Recognition

Creators

Description

Files

models.zip

Files (54.5 GB)

Additional details

Related works