Unit 2: Training safer models

RLHF and its limits

Resources (15 mins)

Exercises