5 18 1

Haitham Bou Ammar

hba123

AI & ML interests

LLMs, VLMs, Robotics, Reinforcement Learning, Bayesian Optimisation

Recent Activity

reacted to their post with 🚀 3 days ago

I have some New Year presents for you, #MachineLearning and #AI community! We just opened our code for new state-of-the-art results that beat EAGLE-2 and Medusa #LLM inference. We also shared the model check pt on @huggingface! @MatthieuZ Check the blog out: https://huggingface.co/blog/hba123/sotaspeculativedecoding

upvoted an article 4 days ago

Accelerating Language Model Inference with Mixture of Attentions

posted an update 4 days ago

View all activity

Articles

Accelerating Language Model Inference with Mixture of Attentions

4 days ago

• 24

Deriving DPO's Loss

18 days ago

• 26

Organizations

None yet

hba123's activity

reacted to their post with 🚀 3 days ago

Post

1775

I have some New Year presents for you, #MachineLearning and #AI community! We just opened our code for new state-of-the-art results that beat EAGLE-2 and Medusa #LLM inference.

We also shared the model check pt on @huggingface ! @MatthieuZ

Check the blog out: https://huggingface.co/blog/hba123/sotaspeculativedecoding

upvoted an article 4 days ago

Article

Accelerating Language Model Inference with Mixture of Attentions

•

4 days ago

• 24

posted an update 4 days ago

Post

1775

I have some New Year presents for you, #MachineLearning and #AI community! We just opened our code for new state-of-the-art results that beat EAGLE-2 and Medusa #LLM inference.

We also shared the model check pt on @huggingface ! @MatthieuZ

Check the blog out: https://huggingface.co/blog/hba123/sotaspeculativedecoding

published an article 4 days ago

Article

Accelerating Language Model Inference with Mixture of Attentions

•

4 days ago

• 24

liked a model 4 days ago

huawei-noah/MOASpec-Llama-3-8B-Instruct

Updated 4 days ago • 34 • 4

authored 2 papers 12 days ago

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5, 2024 • 65

SparsePO: Controlling Preference Alignment of LLMs via Sparse Token Masks

Paper • 2410.05102 • Published Oct 7, 2024

reacted to their post with 🚀 15 days ago

Post

1791

Blindly applying algorithms without understanding the math behind them is not a good idea frmpv. So, I am on a quest to fix this!

I wrote my first hugging face article on how you would derive closed-form solutions for KL-regularised reinforcement learning problems - what is used for DPO.

Check it out: https://huggingface.co/blog/hba123/derivingdpo