Tom Long's picture

1

Tom Long

tom1ong

AI & ML interests

None yet

Recent Activity

reacted to lewtun's post with 🧠 4 days ago

I was initially pretty sceptical about Meta's Coconut paper [1] because the largest perf gains were reported on toy linguistic problems. However, these results on machine translation are pretty impressive! https://x.com/casper_hansen_/status/1875872309996855343 Together with the recent PRIME method [2] for scaling RL, reasoning for open models is looking pretty exciting for 2025! [1] https://huggingface.co/papers/2412.06769 [2] https://huggingface.co/blog/ganqu/prime

reacted to lewtun's post with 🔥 4 days ago

I was initially pretty sceptical about Meta's Coconut paper [1] because the largest perf gains were reported on toy linguistic problems. However, these results on machine translation are pretty impressive! https://x.com/casper_hansen_/status/1875872309996855343 Together with the recent PRIME method [2] for scaling RL, reasoning for open models is looking pretty exciting for 2025! [1] https://huggingface.co/papers/2412.06769 [2] https://huggingface.co/blog/ganqu/prime

View all activity

Organizations

None yet

models

None public yet

datasets

None public yet