I'd like to be able to utilize OCR, Embeddings, and Reranking models via API with LiteLLM Client SDK. This should improve accuracy when used with SotA models such as Qwen3-Embedding-8B and Qwen3-Rerank
I'd like to be able to utilize
OCR, Embeddings, and Reranking models via API with LiteLLM Client SDK.
This should improve accuracy when used with SotA models such as Qwen3-Embedding-8B and Qwen3-Rerank