Releases: lucidrains/PaLM-rlhf-pytorch
Releases · lucidrains/PaLM-rlhf-pytorch
0.5.2
Full Changelog: 0.5.1...0.5.2
0.5.1
Full Changelog: 0.4.3...0.5.1
0.4.3
Full Changelog: 0.4.1...0.4.3
0.4.1
Full Changelog: 0.3.7...0.4.1
0.3.9
start wiring up dense rewarding with implicit prm
0.3.7
get rid of einx for now
0.3.4
take care of variable lengthed responses for implicit PRM
0.3.3
oops
0.3.2
export
0.3.0
add what may be a tiny breakthrough, which happened earlier last mont…