Name	Name	Last commit message	Last commit date
parent directory ..
.env.example	.env.example
.env.sample	.env.sample
.gitignore	.gitignore
README.md	README.md
bench_100k_docs.json	bench_100k_docs.json
bench_chroma.py	bench_chroma.py
bench_moss.py	bench_moss.py
bench_pinecone.py	bench_pinecone.py
bench_qdrant.py	bench_qdrant.py
corpus.py	corpus.py
embedding.py	embedding.py
requirements.txt	requirements.txt
run_all.py	run_all.py
stats.py	stats.py

Name

Last commit message

Last commit date

.env.example

Benchmark: Moss vs Pinecone vs Qdrant vs ChromaDB

Reproducible end-to-end latency benchmarks for semantic search. All measurements include embedding generation time — the full cost a developer actually pays per query.

What's being measured

Each benchmark times the complete query cycle:

System	What happens per query
Moss	`client.query(index_name, "text")` — embedding + search in one call
Pinecone	Call embedding API → send vector to Pinecone cloud → get results
Qdrant	Call embedding API → send vector to Qdrant Cloud → get results
ChromaDB	Call embedding API → search local Chroma collection → get results

Moss bundles a built-in embedding model. Competitors require an external embedding service (OpenAI, self-hosted, etc.).

Setup

# 1. Install dependencies
pip install -r requirements.txt

# 2. Copy and fill in credentials
cp .env.example .env
# Edit .env with your API keys

# 3. Run all benchmarks
python run_all.py

# Or run individually
python run_all.py moss
python run_all.py qdrant chroma

Required credentials

Benchmark	What you need
Moss	`MOSS_PROJECT_ID` + `MOSS_PROJECT_KEY`
Pinecone	`PINECONE_API_KEY`
Qdrant	`QDRANT_URL` + `QDRANT_API_KEY`
ChromaDB	Nothing (runs locally in-memory)
Embedding	`OPENAI_API_KEY` or a custom endpoint URL

Embedding provider

Competitors need an embedding service. Two options:

Option A: OpenAI (default) Set OPENAI_API_KEY in .env. Uses text-embedding-3-small (1536 dims). This is the most common production setup.

Option B: Self-hosted on Modal Deploy the embedding server:

pip install modal
modal deploy embedding_server/modal_app.py

Then set in .env:

EMBEDDING_PROVIDER=custom
EMBEDDING_ENDPOINT=https://your-app--embedding-server-model-embed.modal.run
EMBEDDING_DIMENSION=768

Test data

bench_100k_docs.json contains the 100,000 FAQ-style documents used across all benchmarks. Use this file to reproduce results against the exact same corpus.

Benchmark parameters

Documents: 100,000 FAQ-style documents across 8 categories
Queries: 15 diverse search queries
Warmup: 3 rounds (excluded from measurements)
Measured: 50 rounds x 15 queries = 750 measurements per system
top_k: 5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Benchmark: Moss vs Pinecone vs Qdrant vs ChromaDB

What's being measured

Setup

Required credentials

Embedding provider

Test data

Benchmark parameters

FilesExpand file tree

benchmarks

Directory actions

More options

Directory actions

More options

Latest commit

History

benchmarks

Folders and files

parent directory

README.md

Benchmark: Moss vs Pinecone vs Qdrant vs ChromaDB

What's being measured

Setup

Required credentials

Embedding provider

Test data

Benchmark parameters