Reinforcement Learning from Human Feedback: A Tutorial at ICML 2023

Lambert, Nathan; Ustalov, Dmitry

doi:10.5281/zenodo.8186168

Published July 24, 2023 | Version v1

Presentation Open

Reinforcement Learning from Human Feedback: A Tutorial at ICML 2023

1. Hugging Face
2. Toloka

Contributors

Project manager:

Fedorova, Natalia¹

Researchers:

1. Toloka
2. Yandex
3. Hugging Face

Reinforcement learning from human feedback (RLHF) has dramatically improved the real-world performance and user experience of large machine learning models. Still, this approach has primarily been applied at a scale of compute and data curation that limits academic availability. In this tutorial, we will describe the general framework of RLHF and explain the technical procedures required to apply this framework. The tutorial begins with a detailed conceptual overview and continues with an explanation of human-in-the-loop data collection procedures used when scaling state-of-the-art systems.

Files

ICML2023-RLHF-Tutorial.pdf

Files (21.4 MB)

Name	Size	Download all
ICML2023-RLHF-Tutorial.pdf md5:5be2cee9234f88f4b80ea03ce08412cd	21.4 MB	Preview Download

Additional details

Is derived from: Presentation: https://docs.google.com/presentation/d/1b_ymNDU0WRQ1-rcQDK45_bH9F0giNyRmdi0iKso6G5E/edit?usp=sharing (URL)
Is documented by: Report: https://evalovernite.substack.com/p/rlhf-math-aint-enough (URL)

	All versions	This version
Views	4,175	4,111
Downloads	2,022	1,988
Data volume	89.3 GB	88.4 GB

Reinforcement Learning from Human Feedback: A Tutorial at ICML 2023

Creators

Contributors

Project manager:

Researchers:

Description

Files

ICML2023-RLHF-Tutorial.pdf

Files (21.4 MB)

Additional details

Related works