UPDATE: Zenodo migration postponed to Oct 13 from 06:00-08:00 UTC. Read the announcement.
There is a newer version of this record available.

Dataset Restricted Access

GreenDB: A Product-by-Product Sustainability Database

Jäger, Sebastian; Bießmann, Felix

The publicly available open GreenDB is a product-by-product sustainability database. It contains product attributes, e.g., namedescriptioncolor, etc. and, on the other hand, information about the products' sustainability. Moreover, the sustainability information is transparently evaluated so that it is possible to rank products depending on their sustainability.

Source Code: https://github.com/calgo-lab/green-db 

Further Notes

For usability, we export two datatypes:

  1. CSV
  2. Parquet (using pyarrow python package)

Be careful with the columns' data types when using the CSV files! Use for example:

from ast import literal_eval
from datetime import datetime

import pandas as pd

products = pd.read_csv("products.csv", converters={"timestamp": datetime.fromisoformat, "image_urls": literal_eval, "sustainability_labels": literal_eval}).convert_dtypes()
sustainability_labels = pd.read_csv("sustainability_labels.csv", converters={"timestamp": datetime.fromisoformat}).convert_dtypes()


Supported by the Federal Ministry for the Environment, Nature Conservation and Nuclear Safety based on a decision of the German Bundestag. Förderkennzeichen: 67KI2022B
Restricted Access

You may request access to the files in this upload, provided that you fulfil the conditions below. The decision whether to grant/deny access is solely under the responsibility of the record owner.

For research purposes only.

All versions This version
Views 1,997499
Downloads 542101
Data volume 244.5 GB1.7 GB
Unique views 1,276386
Unique downloads 27070


Cite as