GPROXY

A high-performance LLM proxy server written in Rust. Multi-provider, multi-tenant, with an embedded React console — all in a single static binary.

📘 Documentation: https://gproxy.leenhawk.com
📦 Downloads: https://gproxy.leenhawk.com/downloads/
🦀 Crate: gproxy-sdk
🪪 License: AGPL-3.0-or-later
🌐 Languages: English · 简体中文

What it does

GPROXY exposes a unified, OpenAI / Anthropic / Gemini compatible HTTP surface on top of many upstream LLM providers, and adds the primitives you need to run it as a shared service:

Multi-provider routing — OpenAI, Anthropic, Vertex / Gemini, DeepSeek, Groq, OpenRouter, NVIDIA, Claude Code, Codex, Antigravity, and any OpenAI-compatible custom endpoint.
Two routing modes — aggregated /v1/... (provider encoded in the model name) and scoped /{provider}/v1/... (provider in the URL).
Same-protocol passthrough — minimal-parsing fast path when the client and upstream speak the same dialect.
Cross-protocol translation — an OpenAI client can route to a Claude upstream (and vice versa) through the protocol transform layer.
Multi-tenant auth — users, API keys, glob model permissions, RPM / RPD / token rate limits, and USD-denominated quotas.
Claude prompt caching — server-side cache_breakpoint rules and magic-string triggers for anthropic / claudecode channels.
Request & message rewrite rules — JSON-field manipulation on the request body, plus regex text substitution on message content.
Embedded React console — built into the binary, mounted at /console. No separate frontend to deploy.
Pluggable storage — SQLite, PostgreSQL, MySQL via SeaORM / SQLx, with optional XChaCha20-Poly1305 at-rest encryption.
Rust SDK — gproxy-sdk re-exports the protocol, routing, and provider crates so you can embed the engine into your own service.

Quick start

# 1. Build
git clone https://github.com/LeenHawk/gproxy.git
cd gproxy
cargo build -p gproxy --release

# 2. Run with a minimal config
GPROXY_CONFIG=./gproxy.toml ./target/release/gproxy

A minimal gproxy.toml seed that creates an admin user with wildcard permissions:

[global]
host = "127.0.0.1"
port = 8787
dsn = "sqlite://./data/gproxy.db?mode=rwc"
data_dir = "./data"

[[providers]]
name = "openai-main"
channel = "openai"
settings = { base_url = "https://api.openai.com/v1" }
credentials = [ { api_key = "sk-your-upstream-key" } ]

[[models]]
provider_name = "openai-main"
model_id = "gpt-4.1-mini"
enabled = true

[[users]]
name = "admin"
password = "change-me"
is_admin = true
enabled = true

[[users.keys]]
api_key = "sk-admin-1"
label = "default"
enabled = true

[[permissions]]
user_name = "admin"
model_pattern = "*"

Then open http://127.0.0.1:8787/console and log in as admin.

Full walkthrough: Quick Start.

Sending your first request

# Aggregated endpoint — provider/model prefix in the body
curl http://127.0.0.1:8787/v1/chat/completions \
  -H "Authorization: Bearer sk-admin-1" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "openai-main/gpt-4.1-mini",
    "messages": [ { "role": "user", "content": "Hello" } ]
  }'

# Scoped endpoint — provider in the URL, raw upstream model id in the body
curl http://127.0.0.1:8787/openai-main/v1/chat/completions \
  -H "Authorization: Bearer sk-admin-1" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-4.1-mini",
    "messages": [ { "role": "user", "content": "Hello" } ]
  }'

See First Request for Anthropic and Gemini examples.

Repository layout

apps/                  # Runnable binaries
  gproxy/              # Main binary (HTTP server + embedded console)
  gproxy-recorder/     # Upstream traffic recorder (dev/debugging)
crates/                # Server-side crates composed by the binary
  gproxy-core/         # Config, identity, policy, quota, routing types
  gproxy-storage/      # SeaORM storage + at-rest encryption + schema sync
  gproxy-api/          # Admin + user HTTP API, auth, login, CORS
  gproxy-server/       # The Axum server wiring it all together
sdk/                   # Framework-agnostic libraries (no DB/HTTP dependencies)
  gproxy-protocol/     # OpenAI/Claude/Gemini wire types + transforms
  gproxy-routing/      # Route classification, permission & rate-limit matching
  gproxy-provider/     # Channel trait, ProviderStore, GproxyEngine
  gproxy-sdk/          # Umbrella crate re-exporting the three above
frontend/console/      # React console, embedded into the binary at build time
docs/                  # Starlight documentation site (source for gproxy.leenhawk.com)

Documentation

The full documentation lives at https://gproxy.leenhawk.com. Some entry points:

To run the docs locally:

cd docs
pnpm install
pnpm dev

License

Released under the AGPL-3.0-or-later license.

Author: LeenHawk

Name		Name	Last commit message	Last commit date
Latest commit History 567 Commits
.github/workflows		.github/workflows
apps		apps
crates		crates
docs		docs
frontend/console		frontend/console
sdk		sdk
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile.action		Dockerfile.action
README.md		README.md
README.zh_CN.md		README.zh_CN.md
RELEASE_NOTE.md		RELEASE_NOTE.md
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
release.sh		release.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GPROXY

What it does

Quick start

Sending your first request

Repository layout

Documentation

License

About

Uh oh!

Releases 85

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GPROXY

What it does

Quick start

Sending your first request

Repository layout

Documentation

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 85

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages