18 52 131

Jeff Boudier

jeffboudier

https://huggingface.co/

AI & ML interests

Hugging Face!

Recent Activity

liked a model 3 days ago

microsoft/phi-4

upvoted a collection 3 days ago

Phi-4

reacted to andrewrreed's post with 🔥 4 days ago

🚀 Supercharge your LLM apps with Langfuse on Hugging Face Spaces! Langfuse brings end-to-end observability and tooling to accelerate your dev workflow from experiments through production Now available as a Docker Space directly on the HF Hub! 🤗 🔍 Trace everything: monitor LLM calls, retrieval, and agent actions with popular frameworks 1⃣ One-click deployment: on Spaces with persistent storage and integrated OAuth 🛠 Simple Prompt Management: Version, edit, and update without redeployment ✅ Intuitive Evals: Collect user feedback, run model/prompt evaluations, and improve quality 📊 Dataset Creation: Build datasets directly from production data to enhance future performance Kudos to the Langfuse team for this collab and the awesome, open-first product they’re building! 👏 @marcklingen @Clemo @MJannik 🔗 Space: https://huggingface.co/spaces/langfuse/langfuse-template-space 🔗 Docs: https://huggingface.co/docs/hub/spaces-sdks-docker-langfuse

View all activity

Articles

Going multimodal: How Prezi is leveraging the Hub and the Expert Support Program to accelerate their ML roadmap

Jun 19, 2024

• 11

Introducing the Hugging Face Embedding Container for Amazon SageMaker

Jun 7, 2024

• 16

Deploy models on AWS Inferentia2 from Hugging Face

May 22, 2024

• 13

From cloud to developers: Hugging Face and Microsoft Deepen Collaboration

May 21, 2024

• 8

Build AI on premise with Dell Enterprise Hub

May 21, 2024

• 18

Subscribe to Enterprise Hub with your AWS Account

May 9, 2024

• 6

Making thousands of open LLMs bloom in the Vertex AI Model Garden

Apr 10, 2024

• 18

Bringing serverless GPU inference to Hugging Face users

Apr 2, 2024

• 11

Easily Train Models with H100 GPUs on NVIDIA DGX Cloud

Mar 18, 2024

• 6

Hugging Face and Google partner for open AI collaboration

Jan 25, 2024

• 4

Introducing SafeCoder

Aug 22, 2023

Hugging Face Platform on the AWS Marketplace: Pay with your AWS Account

Aug 10, 2023

Leveraging Hugging Face for complex generative AI use cases

Jul 1, 2023

Hugging Face Collaborates with Microsoft to Launch Hugging Face Model Catalog on Azure

May 24, 2023

Hugging Face and AWS partner to make AI more accessible

Feb 21, 2023

• 2

Case Study: Millisecond Latency using Hugging Face Infinity and modern CPUs

Jan 13, 2022

• 2

Scaling up BERT-like model Inference on modern CPU - Part 2

Nov 4, 2021

• 1

Introducing Optimum: The Optimization Toolkit for Transformers at Scale

Sep 14, 2021

• 1

Organizations

jeffboudier's activity

reacted to andrewrreed's post with 🔥 4 days ago

Post

2525

🚀 Supercharge your LLM apps with Langfuse on Hugging Face Spaces!

Langfuse brings end-to-end observability and tooling to accelerate your dev workflow from experiments through production

Now available as a Docker Space directly on the HF Hub! 🤗

🔍 Trace everything: monitor LLM calls, retrieval, and agent actions with popular frameworks
1⃣ One-click deployment: on Spaces with persistent storage and integrated OAuth
🛠 Simple Prompt Management: Version, edit, and update without redeployment
✅ Intuitive Evals: Collect user feedback, run model/prompt evaluations, and improve quality
📊 Dataset Creation: Build datasets directly from production data to enhance future performance

Kudos to the Langfuse team for this collab and the awesome, open-first product they’re building! 👏 @marcklingen @Clemo @MJannik

🔗 Space: langfuse/langfuse-template-space
🔗 Docs: https://huggingface.co/docs/hub/spaces-sdks-docker-langfuse

1 reply

posted an update 4 days ago

Post

470

NVIDIA just announced the Cosmos World Foundation Models, available on the Hub: nvidia/cosmos-6751e884dc10e013a0a0d8e6

Cosmos is a family of pre-trained models purpose-built for generating physics-aware videos and world states to advance physical AI development.
The release includes Tokenizers nvidia/cosmos-tokenizer-672b93023add81b66a8ff8e6

Learn more in this great community article by @mingyuliutw and @PranjaliJoshi https://huggingface.co/blog/mingyuliutw/nvidia-cosmos

1 reply

reacted to MoritzLaurer's post with 🔥 5 days ago

Post

2156

🚀 Releasing a new zeroshot-classifier based on ModernBERT! Some key takeaways:

- ⚡ Speed & efficiency: It's multiple times faster and uses significantly less memory than DeBERTav3. You can use larger batch sizes and enabling bf16 (instead of fp16) gave me a ~2x speed boost as well
- 📉 Performance tradeoff: It performs slightly worse than DeBERTav3 on average across my zeroshot classification task collection
- 🧠 Use cases: I recommend using it for scenarios requiring speed and a larger context window (8k).
- 💡 What’s next? I’m preparing a newer version trained on better + longer synthetic data to fully leverage the 8k context window and improve upon the training mix of my older zeroshot-v2.0 models. I also hope that there will be a multilingual variant in the future.

Great work by https://huggingface.co/answerdotai !

If you’re looking for a high-speed zeroshot classifier, give it a try!

📄 Resources below: 👇
Base model: MoritzLaurer/ModernBERT-base-zeroshot-v2.0
Large model: MoritzLaurer/ModernBERT-large-zeroshot-v2.0
Updated zeroshot collection: MoritzLaurer/zeroshot-classifiers-6548b4ff407bb19ff5c3ad6f
ModernBERT collection with paper: answerdotai/modernbert-67627ad707a4acbf33c41deb

reacted to burtenshaw's post with 🤗❤️ 23 days ago

Post

2659

People are flexing their end of year stats, so I made this app to show hub stats in a tidy design!

Thanks @Ameeeee and @jfcalvo for the feature from Argilla!
burtenshaw/recap

1 reply

reacted to julien-c's post with 🤗❤️ about 1 month ago

Post

8203

After some heated discussion 🔥, we clarify our intent re. storage limits on the Hub

TL;DR:
- public storage is free, and (unless blatant abuse) unlimited. We do ask that you consider upgrading to PRO and/or Enterprise Hub if possible
- private storage is paid above a significant free tier (1TB if you have a paid account, 100GB otherwise)

docs: https://huggingface.co/docs/hub/storage-limits

We optimize our infrastructure continuously to scale our storage for the coming years of growth in Machine learning, to the benefit of the community 🔥

cc: @reach-vb @pierric @victor and the HF team

28 replies

reacted to clem's post with 🔥 about 1 month ago

Post

4513

Six predictions for AI in 2025 (and a review of how my 2024 predictions turned out):

- There will be the first major public protest related to AI
- A big company will see its market cap divided by two or more because of AI
- At least 100,000 personal AI robots will be pre-ordered
- China will start to lead the AI race (as a consequence of leading the open-source AI race).
- There will be big breakthroughs in AI for biology and chemistry.
- We will begin to see the economic and employment growth potential of AI, with 15M AI builders on Hugging Face.

How my predictions for 2024 turned out:

- A hyped AI company will go bankrupt or get acquired for a ridiculously low price
✅ (Inflexion, AdeptAI,...)

- Open-source LLMs will reach the level of the best closed-source LLMs
✅ with QwQ and dozens of others

- Big breakthroughs in AI for video, time-series, biology and chemistry
✅ for video 🔴for time-series, biology and chemistry

- We will talk much more about the cost (monetary and environmental) of AI
✅Monetary 🔴Environmental (😢)

- A popular media will be mostly AI-generated
✅ with NotebookLM by Google

- 10 millions AI builders on Hugging Face leading to no increase of unemployment
🔜currently 7M of AI builders on Hugging Face

4 replies

reacted to andito's post with ❤️ about 1 month ago

Post

3304

Let's go! We are releasing SmolVLM, a smol 2B VLM built for on-device inference that outperforms all models at similar GPU RAM usage and tokens throughputs.

- SmolVLM generates tokens 7.5 to 16 times faster than Qwen2-VL! 🤯
- Other models at this size crash a laptop, but SmolVLM comfortably generates 17 tokens/sec on a macbook! 🚀
- SmolVLM can be fine-tuned on a Google collab! Or process millions of documents with a consumer GPU!
- SmolVLM even outperforms larger models in video benchmarks, despite not even being trained on videos!

Check out more!
Demo: HuggingFaceTB/SmolVLM
Blog: https://huggingface.co/blog/smolvlm
Model: HuggingFaceTB/SmolVLM-Instruct
Fine-tuning script: https://github.com/huggingface/smollm/blob/main/finetuning/Smol_VLM_FT.ipynb

posted an update about 2 months ago

Post

1010

New - add your bluesky account to your HF profile:
https://huggingface.co/settings/profile

Is the grass greener, the sky bluer? Will try and figure it out at https://bsky.app/profile/jeffboudier.bsky.social

By the way, HF people starter pack https://bsky.app/starter-pack/huggingface.bsky.social/3laz5x7naiz22

replied to clem's post 3 months ago

Didn't have this in my tarot cards

replied to clem's post 3 months ago

📆 Wed Oct 30th - 9am PT / 12pm ET / 18h CET
Can't wait!

reacted to clem's post with ❤️🤗🔥🚀 3 months ago

Post

4442

This is no Woodstock AI but will be fun nonetheless haha. I’ll be hosting a live workshop with team members next week about the Enterprise Hugging Face hub.

1,000 spots available first-come first serve with some surprises during the stream!

You can register and add to your calendar here: https://streamyard.com/watch/JS2jHsUP3NDM

4 replies

reacted to victor's post with 🚀❤️🔥🤗 3 months ago

Post

2673

NEW - Inference Playground

Maybe like me you have always wanted a super easy way to compare llama3.2-1B vs. llama3.2-3B? or the same model with different temperatures?

Trying and comparing warm Inference API models has never been easier!
Just go to https://hf.co/playground, set your token and you're ready to go.
We'll keep improving, feedback welcome 😊

2 replies

Jeff Boudier

AI & ML interests

Recent Activity

Articles

Hugging Face models in Amazon Bedrock

Introducing HUGS - Scale your AI with Open Models

Deploy Meta Llama 3.1 405B on Google Cloud Vertex AI

Serverless Inference with Hugging Face and NVIDIA NIMs

Going multimodal: How Prezi is leveraging the Hub and the Expert Support Program to accelerate their ML roadmap

Introducing the Hugging Face Embedding Container for Amazon SageMaker

Deploy models on AWS Inferentia2 from Hugging Face

From cloud to developers: Hugging Face and Microsoft Deepen Collaboration

Build AI on premise with Dell Enterprise Hub

Subscribe to Enterprise Hub with your AWS Account

Making thousands of open LLMs bloom in the Vertex AI Model Garden

Bringing serverless GPU inference to Hugging Face users

Easily Train Models with H100 GPUs on NVIDIA DGX Cloud

Hugging Face and Google partner for open AI collaboration

Introducing SafeCoder

Hugging Face Platform on the AWS Marketplace: Pay with your AWS Account

Leveraging Hugging Face for complex generative AI use cases

Hugging Face Collaborates with Microsoft to Launch Hugging Face Model Catalog on Azure

Hugging Face and AWS partner to make AI more accessible

Case Study: Millisecond Latency using Hugging Face Infinity and modern CPUs

Scaling up BERT-like model Inference on modern CPU - Part 2

Introducing Optimum: The Optimization Toolkit for Transformers at Scale

Organizations

jeffboudier's activity