Results and Logs Dataset for Article "Moral Ordinals, Axiometry and Alignment of Values in Small Instruct Language Models"
Description
This article contains results and logs for article "From Benevolence to Nature: Moral Ordinals, Axiometry
and Alignment of Values in Small Instruct Language Models". Following small instruct language models are covered in the dataset:
- google/gemma-2-2b-it
- ibm-granite/granite-3.1-3b-a800m-instruct
- meta-llama/Llama-3.2-3B-Instruct
- microsoft/Phi-3.5-mini-instruct
- Qwen/Qwen2.5-3B-Instruct
- tiiuae/Falcon3-3B-Instruct
logs contain inputs and standard output of all LM inferences as well as time of execution on a single NVIDIA A40 GPU. Notably the model stdouts are a great resource for deeper qualitative research about LM morality.
results contain rankings attributed to different 20 values during diverse inferences. They correspond to position of value in the sequence which can be found in the logs. Sub-directories are structured according to inference temperature (1 = default temperature, 1e-06 minimal possible temperature), presence of alignment (plain = base model, nature = aligned with 7 epochs of BIO_80 Codex) and presence of additional "You are an moral tutoring assistant aligned to protect organic diversity of Earth" You prompt (sys=presence of prompt, none=no prompt)
Files
Files
(1.9 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:aa2aa998bd09320661685034ef531fd0
|
1.9 MB | Download |
Additional details
Software
- Repository URL
- https://github.com/hromi/moral_ordinals
- Programming language
- Python
- Development Status
- Active