Domain-adaptive Data Synthesis for Large-scale Supermarket Product Recognition
Description
Domain-Adaptive Data Synthesis for Large-Scale Supermarket Product Recognition
This repository contains the data synthesis pipeline and synthetic product recognition datasets proposed in [1].
Data Synthesis Pipeline:
We provide the Blender 3.1 project files and Python source code of our data synthesis pipeline pipeline.zip, accompanied by the FastCUT models used for synthetic-to-real domain translation models.zip. For the synthesis of new shelf images, a product assortment list and product images must be provided in the corresponding directories products/assortment/ and products/img/. The pipeline expects product images to follow the naming convention c.png, with c corresponding to a GTIN or generic class label (e.g., 9120050882171.png). The assortment list, assortment.csv, is expected to use the sample format [c, w, d, h], with c being the class label and w, d, and h being the packaging dimensions of the given product in mm (e.g., [4004218143128, 140, 70, 160]). The assortment list to use and the number of images to generate can be specified in generateImages.py (see comments). The rendering process is initiated by either executing load.py from within Blender or within a command-line terminal as a background process.
Datasets:
- SG3k - Synthetic GroZi-3.2k (SG3k) dataset, consisting of 10,000 synthetic shelf images with 851,801 instances of 3,234 GroZi-3.2k products. Instance-level bounding boxes and generic class labels are provided for all product instances.
- SG3kt - Domain-translated version of SGI3k, utilizing GroZi-3.2k as the target domain. Instance-level bounding boxes and generic class labels are provided for all product instances.
- SGI3k - Synthetic GroZi-3.2k (SG3k) dataset, consisting of 10,000 synthetic shelf images with 838,696 instances of 1,063 GroZi-3.2k products. Instance-level bounding boxes and generic class labels are provided for all product instances.
- SGI3kt - Domain-translated version of SGI3k, utilizing GroZi-3.2k as the target domain. Instance-level bounding boxes and generic class labels are provided for all product instances.
- SPS8k - Synthetic Product Shelves 8k (SPS8k) dataset, comprised of 16,224 synthetic shelf images with 1,981,967 instances of 8,112 supermarket products. Instance-level bounding boxes and GTIN class labels are provided for all product instances.
- SPS8kt - Domain-translated version of SPS8k, utilizing SKU110k as the target domain. Instance-level bounding boxes and GTIN class labels for all product instances.
Table 1: Dataset characteristics.
Dataset | #images | #products | #instances | labels | translation |
SG3k | 10,000 | 3,234 | 851,801 | bounding box & generic class¹ | none |
SG3kt | 10,000 | 3,234 | 851,801 | bounding box & generic class¹ | GroZi-3.2k |
SGI3k | 10,000 | 1,063 | 838,696 | bounding box & generic class² | none |
SGI3kt | 10,000 | 1,063 | 838,696 | bounding box & generic class² | GroZi-3.2k |
SPS8k | 16,224 | 8,112 | 1,981,967 | bounding box & GTIN | none |
SPS8kt | 16,224 | 8,112 | 1,981,967 | bounding box & GTIN | SKU110k |
Sample Format
A sample consists of an RGB image (i.png) and an accompanying label file (i.txt), which contains the labels for all product instances present in the image. Labels use the YOLO format [c, x, y, w, h].
¹SG3k and SG3kt use generic pseudo-GTIN class labels, created by combining the GroZi-3.2k food product category number i (1-27) with the product image index j (j.jpg), following the convention i0000j (e.g., 13000097).
²SGI3k and SGI3kt use the generic GroZi-3.2k class labels from https://arxiv.org/abs/2003.06800.
Download and Use
This data may be used for non-commercial research purposes only. If you publish material based on this data, we request that you include a reference to our paper [1].
[1] Strohmayer, Julian, and Martin Kampel. "Domain-Adaptive Data Synthesis for Large-Scale Supermarket Product Recognition." International Conference on Computer Analysis of Images and Patterns. Cham: Springer Nature Switzerland, 2023.
BibTeX citation:
@inproceedings{strohmayer2023domain, title={Domain-Adaptive Data Synthesis for Large-Scale Supermarket Product Recognition}, author={Strohmayer, Julian and Kampel, Martin}, booktitle={International Conference on Computer Analysis of Images and Patterns}, pages={239--250}, year={2023}, organization={Springer} }
Files
models.zip
Files
(54.5 GB)
Name | Size | Download all |
---|---|---|
md5:1a2f6e30146a39c2ca92d7395640a676
|
163.8 MB | Preview Download |
md5:5390a74f4bf0ca43964aa99396cf68d2
|
65.2 MB | Preview Download |
md5:70b17e95473df180446c96a1bb0861c9
|
8.4 GB | Preview Download |
md5:33412605a1791f36ac43310261f4fdf7
|
7.9 GB | Preview Download |
md5:c631f840953678b6ff8acf3e42b0e2d9
|
7.5 GB | Preview Download |
md5:ead0caa57bdc3df28c12f792033c1636
|
7.0 GB | Preview Download |
md5:53f9bca3e69dd425a89267a2a4237450
|
11.9 GB | Preview Download |
md5:f6a8504861b861ee7b03ec9342d69412
|
11.6 GB | Preview Download |
Additional details
Related works
- Is cited by
- Conference paper: 10.1007/978-3-031-44237-7_23 (DOI)
- Conference paper: https://www.researchgate.net/publication/374047329_Domain-Adaptive_Data_Synthesis_for_Large-Scale_Supermarket_Product_Recognition (URL)