TTS Evaluation Pipeline

A unified evaluation pipeline for comparing multiple Text-to-Speech (TTS) models.

Supported Models

Model	Type	Features
GLM-TTS	`glm_tts`	Zero-shot voice cloning
GLM-TTS RL	`glm_tts_rl`	+ RL fine-tuning
Qwen3-TTS	`qwen_tts`	CustomVoice (preset speakers)
Qwen3-TTS VC	`qwen_tts_vc`	Voice cloning (Base model)
Qwen3-TTS vLLM	`qwen_tts_vllm`	vLLM-accelerated (Linux only)
CosyVoice	`cosyvoice`	Zero-shot voice cloning
CosyVoice RL	`cosyvoice_rl`	+ RL fine-tuning

Quick Start

# Install all environments
python setup_models.py --all

# Run evaluation with all enabled models
python main.py

# Run specific models
python main.py -m glm_tts qwen_tts cosyvoice

Output Structure

outputs/
├── latest/           # Current run results
│   ├── glm_tts/      # Model audio outputs
│   ├── qwen_tts/
│   ├── cosyvoice/
│   ├── metrics.json
│   ├── detailed_results.json
│   └── report.md
└── history/          # Archived runs
    └── 20260206_120000/

Key Metrics

RTF (Real-Time Factor): < 1.0 means faster than real-time
First Token Latency: Time to first audio chunk (streaming models)
GPU Memory: Peak memory usage during synthesis

Configuration

Edit config.yaml to customize models and settings. See CLAUDE.md for detailed documentation.

Requirements

CUDA-capable GPU
Conda (Miniconda or Anaconda)
50GB+ disk space for models

Deployment

See DEPLOYMENT.md for detailed setup instructions.

Claude Code Deployment Prompt

Please help me deploy this TTS evaluation project. Follow the steps in DEPLOYMENT.md:

1. Check system environment (GPU, disk space, conda)
2. Install system dependencies
3. If conda is not installed, install Miniconda
4. Clone model repositories (GLM-TTS, CosyVoice)
5. Create conda environments: python setup_models.py --all
6. Download model weights (GLM-TTS and CosyVoice)
7. Verify each environment works correctly
8. Run test: python main.py -m glm_tts

If the GPU is RTX 5080/5090, install PyTorch nightly (refer to the "RTX 50 Series" section in DEPLOYMENT.md).

Report results after each step. If errors occur, try to resolve them using the Troubleshooting section in DEPLOYMENT.md first.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
cosyvoice_vllm_plugin		cosyvoice_vllm_plugin
envs		envs
metrics		metrics
models		models
processed_audio		processed_audio
scripts		scripts
utils		utils
vllm-omni		vllm-omni
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
CLAUDE.md		CLAUDE.md
DEPLOYMENT.md		DEPLOYMENT.md
README.md		README.md
benchmark_cosyvoice_api.py		benchmark_cosyvoice_api.py
config.example.yaml		config.example.yaml
config.yaml		config.yaml
cosyvoice_server.py		cosyvoice_server.py
cosyvoice_server_vllm.py		cosyvoice_server_vllm.py
debug_comparison_test.py		debug_comparison_test.py
debug_cosyvoice_all.py		debug_cosyvoice_all.py
debug_env_compare.py		debug_env_compare.py
debug_exact_feb4.py		debug_exact_feb4.py
debug_llm_tokens.py		debug_llm_tokens.py
debug_onnx_providers.py		debug_onnx_providers.py
debug_server_audio.py		debug_server_audio.py
debug_single_model.py		debug_single_model.py
debug_whisper_compare.py		debug_whisper_compare.py
main.py		main.py
model_runner.py		model_runner.py
model_runner_api.py		model_runner_api.py
model_runner_cosyvoice_api.py		model_runner_cosyvoice_api.py
model_runner_vllm.py		model_runner_vllm.py
requirements.txt		requirements.txt
run_evaluation.py		run_evaluation.py
run_single_model.py		run_single_model.py
setup_models.py		setup_models.py
voice_config.json		voice_config.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TTS Evaluation Pipeline

Supported Models

Quick Start

Output Structure

Key Metrics

Configuration

Requirements

Deployment

Claude Code Deployment Prompt

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

TTS Evaluation Pipeline

Supported Models

Quick Start

Output Structure

Key Metrics

Configuration

Requirements

Deployment

Claude Code Deployment Prompt

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages