v0.9.3 — Expanded Model Catalog

1bcMax released this 25 Mar 04:22

· 97 commits to main since this release

92e889b

What's New

50+ Models, 8 Providers

Massively expanded the model catalog with complete pricing and shortcuts.

New Model Shortcuts:

codex → GPT-5.3 Codex | nano → GPT-5 Nano | o3 / o4 → OpenAI reasoning
flash → Gemini 2.5 Flash | gemini-3 → Gemini 3.1 Pro
grok-4 → Grok 4 | grok-fast → Grok 4.1 Fast Reasoning
r1 → DeepSeek Reasoner | minimax → Minimax M2.7
free / nemotron → Nemotron Ultra 253B | devstral / qwen-coder / maverick → NVIDIA free models

Updated Smart Routing

Eco tier now uses Nemotron Ultra 253B (free, 253B params) as primary
Better fallback chains with Minimax, Qwen3 Coder, Mistral Large
Default free model upgraded from GPT-OSS 120B → Nemotron Ultra 253B

Complete Pricing

All 50+ models now have accurate input/output token pricing
Organized by provider: NVIDIA (free), Anthropic, OpenAI, Google, xAI, DeepSeek, Minimax, others

Full Changelog: v0.9.2...v0.9.3

Assets 2