Usage
First, install the outetts
library from NPM with:
npm i outetts
You can then generate speech as follows:
import { HFModelConfig_v1, InterfaceHF } from "outetts";
// Configure the model
const model_config = new HFModelConfig_v1({
model_path: "onnx-community/OuteTTS-0.2-500M",
language: "en", // Supported languages in v0.2: en, zh, ja, ko
dtype: "fp32", // Supported dtypes: "fp32", "fp16", "q8", "q4", "q4f16"
// device: "wasm", // Supported devices: "webgpu", "wasm" (browser) or "cpu", "cuda", "dml" (Node.js, OS-specific)
});
// Initialize the interface
const tts_interface = await InterfaceHF({ model_version: "0.2", cfg: model_config });
// Print available default speakers
tts_interface.print_default_speakers();
// Load a default speaker
const speaker = tts_interface.load_default_speaker("male_1");
// Generate speech
const output = await tts_interface.generate({
text: "Speech synthesis is the artificial production of human speech.",
temperature: 0.1, // Lower temperature values may result in a more stable tone
repetition_penalty: 1.1,
max_length: 4096,
// Optional: Use a speaker profile for consistent voice characteristics
// Without a speaker profile, the model will generate a voice with random characteristics
speaker,
});
// Save the synthesized speech to a file
output.save("output.wav");
- Downloads last month
- 762
Inference API (serverless) does not yet support transformers.js models for this pipeline type.
Model tree for onnx-community/OuteTTS-0.2-500M
Base model
OuteAI/OuteTTS-0.2-500M