https://arxiv.org/abs/2302.08215
Aligning Language Models with Preferences through f-divergence Minimization (Dongyoung Go, Tomasz Korbak, Germán Kruszewski, Jos Rozen, Nahyeon Ryu, Marc Dymetman)
https://arxiv.org/abs/2302.08215
Aligning Language Models with Preferences through f-divergence Minimization (Dongyoung Go, Tomasz Korbak, Germán Kruszewski, Jos Rozen, Nahyeon Ryu, Marc Dymetman)