Add note on local inference alternative to HfApiModel #327

HKanoje · 2025-03-16T14:00:46Z

This PR adds a note in the docs explaining how users can run models locally using Ollama + LiteLLM as an alternative to HfApiModel() in case they hit Hugging Face API credit limits.

It includes a sample code snippet using:

LiteLLMModel with model_id="ollama_chat/qwen2.5:7b"

It closes huggingface/smolagents#967 Issues

davidberenstein1957 · 2025-03-17T06:52:50Z

Hi @HKanoje, this could be a nice addition if we could scope this a bit more uniform.

Could you create a uniform section on this within the onboarding? I think we can just mention local serving and ideally use similar API classed (the HF ones) but explain how to redirect the base URL that relies on OpenAIAPI spec for something like ollama.

WDYT?

HKanoje · 2025-03-17T22:11:15Z

@davidberenstein1957 Yes I think you're right. It makes more sense to put in as a different section because HfApiModel() is been used everywhere what do you suggest where should I make this section? During the onboarding steps?

HKanoje · 2025-03-19T19:31:23Z

Hey, @davidberenstein1957 I’ve added the local inference section to the onboarding as you suggested. Let me know if there’s anything you'd like me to adjust or improve! Check here #327

HKanoje mentioned this pull request Mar 16, 2025

List of Free Local LLM Models - Up and Running huggingface/smolagents#967

Closed

Added local inference section in onboarding

2139435

HKanoje force-pushed the add-local-inference-note branch from ae82b7b to 2139435 Compare March 19, 2025 19:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add note on local inference alternative to HfApiModel #327

Add note on local inference alternative to HfApiModel #327

HKanoje commented Mar 16, 2025 •

edited

Loading

davidberenstein1957 commented Mar 17, 2025

HKanoje commented Mar 17, 2025 •

edited

Loading

HKanoje commented Mar 19, 2025 •

edited

Loading

Add note on local inference alternative to HfApiModel #327

Are you sure you want to change the base?

Add note on local inference alternative to HfApiModel #327

Conversation

HKanoje commented Mar 16, 2025 • edited Loading

davidberenstein1957 commented Mar 17, 2025

HKanoje commented Mar 17, 2025 • edited Loading

HKanoje commented Mar 19, 2025 • edited Loading

HKanoje commented Mar 16, 2025 •

edited

Loading

HKanoje commented Mar 17, 2025 •

edited

Loading

HKanoje commented Mar 19, 2025 •

edited

Loading