GreenDB: A Product-by-Product Sustainability Database
Creators
- 1. Berliner Hochschule für Technik
Description
The publicly available open GreenDB is a product-by-product sustainability database. It contains product attributes, e.g., name, description, color, etc. and, on the other hand, information about the products' sustainability. Moreover, the sustainability information is transparently evaluated so that it is possible to rank products depending on their sustainability.
Source Code: https://github.com/calgo-lab/green-db
Further Notes
For usability, we export two datatypes:
- CSV
- Parquet (using pyarrowpython package)
Be careful with the columns' data types when using the CSV files! Use for example:
from ast import literal_eval
from datetime import datetime
import pandas as pd
products = pd.read_csv("products.csv", converters={"timestamp": datetime.fromisoformat, "image_urls": literal_eval, "sustainability_labels": literal_eval}).convert_dtypes()
sustainability_labels = pd.read_csv("sustainability_labels.csv", converters={"timestamp": datetime.fromisoformat}).convert_dtypes()
Notes
            
              Files
              
            
          
          Additional details
References
- Jäger, S., Greene, J., Jakob, M., Korenke, R., Santarius, T., & Biessmann, F. (2022). GreenDB: Toward a Product-by-Product Sustainability Database. ArXiv, abs/2205.02908.