Skip to content

v0.9.3 — Expanded Model Catalog

Choose a tag to compare

@1bcMax 1bcMax released this 25 Mar 04:22
· 97 commits to main since this release
92e889b

What's New

50+ Models, 8 Providers

Massively expanded the model catalog with complete pricing and shortcuts.

New Model Shortcuts:

  • codex → GPT-5.3 Codex | nano → GPT-5 Nano | o3 / o4 → OpenAI reasoning
  • flash → Gemini 2.5 Flash | gemini-3 → Gemini 3.1 Pro
  • grok-4 → Grok 4 | grok-fast → Grok 4.1 Fast Reasoning
  • r1 → DeepSeek Reasoner | minimax → Minimax M2.7
  • free / nemotron → Nemotron Ultra 253B | devstral / qwen-coder / maverick → NVIDIA free models

Updated Smart Routing

  • Eco tier now uses Nemotron Ultra 253B (free, 253B params) as primary
  • Better fallback chains with Minimax, Qwen3 Coder, Mistral Large
  • Default free model upgraded from GPT-OSS 120B → Nemotron Ultra 253B

Complete Pricing

  • All 50+ models now have accurate input/output token pricing
  • Organized by provider: NVIDIA (free), Anthropic, OpenAI, Google, xAI, DeepSeek, Minimax, others

Full Changelog: v0.9.2...v0.9.3