Fine-Grained Human Feedback Gives Better Rewards for Language Model Training

Published June 5, 2023 | Version v2

Dataset Open

QA-Feedback used in the paper: Fine-Grained Human Feedback Gives Better Rewards for Language Model Training

Files

Name	Size	Download all
dev.json md5:24c35337c514c0322888be9753a6aba6	3.3 MB	Preview Download
dev_feedback.json md5:25f5c85fe1f040e49ecd0bfa6d88c2c0	3.3 MB	Preview Download
README.md md5:c06fb0601640f1915e8318debf3716e7	2.5 kB	Preview Download
test.json md5:666c1bcba0b9fa1cc28012576ae08a2f	6.2 MB	Preview Download
train.json md5:1f16269b6fcbf73a7bcacbe967336849	25.7 MB	Preview Download
train_1k.json md5:fcace67673438ec8c8cafd516dd9f1d2	6.8 MB	Preview Download
train_feedback.json md5:4949f2fd889057498b0871cd4efef5bb	19.0 MB	Preview Download

863

Views

468

Downloads

Show more details

DOI

Resource type

Dataset

Publisher

Zenodo

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more