Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
Tom Long
tom1ong
Follow
AI & ML interests
None yet
Recent Activity
reacted
to
lewtun
's
post
with ðŸ§
4 days ago
I was initially pretty sceptical about Meta's Coconut paper [1] because the largest perf gains were reported on toy linguistic problems. However, these results on machine translation are pretty impressive! https://x.com/casper_hansen_/status/1875872309996855343 Together with the recent PRIME method [2] for scaling RL, reasoning for open models is looking pretty exciting for 2025! [1] https://huggingface.co/papers/2412.06769 [2] https://huggingface.co/blog/ganqu/prime
reacted
to
lewtun
's
post
with 🔥
4 days ago
I was initially pretty sceptical about Meta's Coconut paper [1] because the largest perf gains were reported on toy linguistic problems. However, these results on machine translation are pretty impressive! https://x.com/casper_hansen_/status/1875872309996855343 Together with the recent PRIME method [2] for scaling RL, reasoning for open models is looking pretty exciting for 2025! [1] https://huggingface.co/papers/2412.06769 [2] https://huggingface.co/blog/ganqu/prime
View all activity
Organizations
None yet
models
None public yet
datasets
None public yet