Skip to content

Configuration options for direct to provider (Cerebras) #19

@Winston-Hsiao

Description

@Winston-Hsiao

We currently get LLM completions via OpenRouter; however, given the ties to Cerebras provided inference for speed, we want to also just provide the direct connection so users don't need to have an OpenRouter key, and BYOK through OpenRouter.

  1. We currently have RAYPASTE_API_KEY which actually takes the user's OpenRouter key and as a part of our LLM package we make a req to OpenRouter.
  2. We should make separate OPENROUTER_API_KEY and CEREBRAS_API_KEY options for config.
  3. Maintain the RAYPASTE_API_KEY but that will be used in the future, paired with our internal backend/web app (in-progress). We will provide an option for direct to provider CLI <--> inference (so users don't have to pass their API key through our backend/servers), OR if users authenticate/want to use our backend (we provide a Raypaste API key and manage usage/billing, routing requests, and provide extra services/features possible with stored memories/vectors.

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request
No fields configured for Feature.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions