You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, thanks for uploading the code for pair_pm! Since in the blog, it seems that you are using SLiC for pair_pm models. In the directory of pair_pm, I can't find the code for using slic methods.
The text was updated successfully, but these errors were encountered:
We mention Slic paper because the pair-wise model training was first proposed in this paper. We do not do RLHF in this project. If you are interested in the subsequent RLHF stage, you may check this project https://github.com/RLHFlow/Online-RLHF
Hi, thanks for uploading the code for pair_pm! Since in the blog, it seems that you are using SLiC for pair_pm models. In the directory of pair_pm, I can't find the code for using slic methods.
The text was updated successfully, but these errors were encountered: