GreenDB: A Product-by-Product Sustainability Database
Description
The publicly available open GreenDB is a product-by-product sustainability database. It contains product attributes, e.g., name
, description
, color
, etc. and, on the other hand, information about the products' sustainability. Moreover, the sustainability information is transparently evaluated so that it is possible to rank products depending on their sustainability.
Source Code: https://github.com/calgo-lab/green-db
Further Notes
For usability, we export two datatypes:
- CSV
- Parquet (using
pyarrow
python package)
Be careful with the columns' data types when using the CSV files! Use for example:
from ast import literal_eval
from datetime import datetime
import pandas as pd
products = pd.read_csv("products.csv", converters={"timestamp": datetime.fromisoformat, "image_urls": literal_eval, "sustainability_labels": literal_eval}).convert_dtypes()
sustainability_labels = pd.read_csv("sustainability_labels.csv", converters={"timestamp": datetime.fromisoformat}).convert_dtypes()