RLHF: Reinforcement Learning from Human Feedback
Chip Huyen
Incorporating reinforcement learning with human feedback into NLP at a massive scale.
Added 8 months ago
Added 8 months ago
Added 8 months ago
Added 8 months ago
Added 9 months ago
Added 9 months ago