arxiv:2501.03262
Jian Hu
chuyi777
AI & ML interests
Reinforcement Learning
Recent Activity
upvoted
a
paper
3 days ago
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language
Models
commented
a paper
3 days ago
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language
Models
liked
a dataset
8 days ago
AI-MO/NuminaMath-CoT