Skip to content

Feature Request: Support for Gemini, DeepSeek, and OpenAI (Multi-LLM Fallback) #7

@kayacihan

Description

@kayacihan

First of all, thanks for this awesome project. It's a lifesaver for podcast listeners.

Currently, the server is hardcoded to use Claude (Anthropic). While Claude is great, it’s not always the most accessible option for everyone due to pricing or regional availability.

I’d like to suggest adding support for other LLMs, specifically Gemini 1.5 Flash (which has a very generous free tier) and DeepSeek (which is extremely cheap).

Proposed Idea: It would be amazing if we could implement a fallback system in the ad detection logic:

Primary: Use Gemini (via Google AI Studio or OpenAI-compatible API).

Fallback 1: DeepSeek V3 (very cost-effective).

Fallback 2: GPT-4o-mini.

Final Fallback: Claude.

Since most of these providers now support the OpenAI SDK format, we could probably achieve this by just making the base_url, model_name, and api_key configurable in the .env file.

Why this helps:

Lower Cost: Users can run this almost for free using Gemini's free tier.

Reliability: If one API is down or hits a rate limit, the server can still process the podcast.

Flexibility: Users can choose their preferred "brain" for ad detection.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions