Planned intervention: On Wednesday June 26th 05:30 UTC Zenodo will be unavailable for 10-20 minutes to perform a storage cluster upgrade.

There is a newer version of the record available.

Published October 19, 2022 | Version 0.2.2
Dataset Restricted

GreenDB: A Product-by-Product Sustainability Database


The publicly available open GreenDB is a product-by-product sustainability database. It contains product attributes, e.g., namedescriptioncolor, etc. and, on the other hand, information about the products' sustainability. Moreover, the sustainability information is transparently evaluated so that it is possible to rank products depending on their sustainability.

Source Code: 

Further Notes

For usability, we export two datatypes:

  1. CSV
  2. Parquet (using pyarrow python package)

Be careful with the columns' data types when using the CSV files! Use for example:

from ast import literal_eval
from datetime import datetime

import pandas as pd

products = pd.read_csv("products.csv", converters={"timestamp": datetime.fromisoformat, "image_urls": literal_eval, "sustainability_labels": literal_eval}).convert_dtypes()
sustainability_labels = pd.read_csv("sustainability_labels.csv", converters={"timestamp": datetime.fromisoformat}).convert_dtypes()



Supported by the Federal Ministry for the Environment, Nature Conservation and Nuclear Safety based on a decision of the German Bundestag. Förderkennzeichen: 67KI2022B



The record is publicly accessible, but files are restricted to users with access.

Request access

If you would like to request access to these files, please fill out the form below.

You need to satisfy these conditions in order for this request to be accepted:

For research purposes only.

You are currently not logged in. Do you have an account? Log in here

Additional details


  • Jäger, S., Greene, J., Jakob, M., Korenke, R., Santarius, T., & Biessmann, F. (2022). GreenDB: Toward a Product-by-Product Sustainability Database. ArXiv, abs/2205.02908.