Dataset Restricted Access

GreenDB: A Product-by-Product Sustainability Database

Jäger, Sebastian; Bießmann, Felix; Flick, Alexander; Sanchez Garcia, Jessica Adriana; von den Driesch, Kaspar; Brendel, Karl

The publicly available open GreenDB is a product-by-product sustainability database. It contains product attributes, e.g., namedescriptioncolor, etc. and, on the other hand, information about the products' sustainability. Moreover, the sustainability information is transparently evaluated so that it is possible to rank products depending on their sustainability.

Source Code: https://github.com/calgo-lab/green-db 

Further Notes

For usability, we export two datatypes:

  1. CSV
  2. Parquet (using pyarrow python package)

Be careful with the columns' data types when using the CSV files! Use for example:

from ast import literal_eval
from datetime import datetime

import pandas as pd

products = pd.read_csv("products.csv", converters={"timestamp": datetime.fromisoformat, "image_urls": literal_eval, "sustainability_labels": literal_eval}).convert_dtypes()
sustainability_labels = pd.read_csv("sustainability_labels.csv", converters={"timestamp": datetime.fromisoformat}).convert_dtypes()

 

Supported by the Federal Ministry for the Environment, Nature Conservation and Nuclear Safety based on a decision of the German Bundestag. Förderkennzeichen: 67KI2022B
Restricted Access

You may request access to the files in this upload, provided that you fulfil the conditions below. The decision whether to grant/deny access is solely under the responsibility of the record owner.


For research purposes only.


  • Jäger, S., Greene, J., Jakob, M., Korenke, R., Santarius, T., & Biessmann, F. (2022). GreenDB: Toward a Product-by-Product Sustainability Database. ArXiv, abs/2205.02908.

363
99
views
downloads
All versions This version
Views 363157
Downloads 9927
Data volume 5.8 GB4.5 GB
Unique views 270128
Unique downloads 6412

Share

Cite as