BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks Jun 18, 2024 โข 43
OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework Paper โข 2405.11143 โข Published May 20, 2024 โข 36
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models Paper โข 2501.03262 โข Published 8 days ago โข 71