A real scanner data set on sold rice, sugar and milk products
Contributors
Data collector:
Description
This is a collection of scanner data on the sale of rice, sugar and milk products in one of Polish supermarkets in the period from December 2024 to January 2026.
This monthly aggregated data frame consists of 9 columns and 8090 rows. The used variables are as follows:
time - Dates of transactions (Year-Month-Day);
prices - Prices (unit values) of sold products [PLN];
quantities - Quantities of sold products [items];
retID - Unique codes identifying outlets/retailer sale points (data set contains 4 different retIDs);
description - Descriptions (labels) of sold products (data set contains 152 different descriptions in Polish);
retailer_code - Retailer codes for product definition (134 retailer codes);
EAN_code - EAN codes (bar codes) for product definition (138 EAN codes);
category - Product categories at the 6-digit COICOP level (4 categories in English);
subcategory - Product subcategories from 7-digit COICOP level (11 subcategories in English).
This data set is also available in the PriceIndices R package where it is named: dataRSM
Files
dataRSM.csv
Files
(912.7 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:f4c4dcca0e28f4ad624baa98a30c8914
|
912.7 kB | Preview Download |