ElevenLabs GUI Studio v3.0

A modern, user-friendly desktop application for interacting with the ElevenLabs text-to-speech API.

Features

Core Features

Playground: Convert text to speech using ElevenLabs' advanced AI voices
Voice Cloning (NEW v2.0): Create custom voice clones by uploading audio samples
Voice Management: Browse, manage, and organize your voice library
Voice Library (NEW v2.0): Access and search through available voices
API Key Management: Save and manage multiple named API keys securely
Voice Parameter Controls: Fine-tune voice output with visual sliders:
- Speed (0.5-2.0)
- Stability (0-1)
- Similarity Boost (0-1)
- Style Exaggeration (0-1)
Preset Management (NEW v2.0): Save and load custom voice parameter combinations
Test Settings: Preview voice settings with sample text
Break Tags: Insert SSML break tags for natural pauses
History Tracking: Review and replay previous generations
Tips and Tricks: Best practices guide for optimal results

Requirements

An ElevenLabs API key (get one at elevenlabs.io)
Node.js and npm installed on your system

Installation

Option 1: Download the Release

Go to the Releases page
Download the latest version for your operating system
Install and run the application

Option 2: Build from Source

Clone this repository:

git clone https://github.com/SannidhyaSah/ElevenLabs-GUI-Studio-.git

Navigate to the project directory:
```
cd ElevenLabs-GUI-Studio-
```
Install dependencies:
```
npm install
```
Start the application in development mode:
```
npm start
```

Building

To build installers for distribution:

npm run build

This will create distributable packages in the dist directory.

For detailed build instructions, troubleshooting, and platform-specific requirements, see BUILD.md.

Note for Windows users: You may need to enable Developer Mode or run as Administrator. See BUILD.md for details.

Usage

Setting Up API Keys

When you first start the application, you'll be directed to the Settings tab to add an API key
Enter your ElevenLabs API key in the input field
Give your API key a name (e.g., "Personal", "Work", "Testing")
Click "Save API Key"
You can add multiple API keys and switch between them using the dropdown
API keys are stored securely in the data folder

Generating Speech

Navigate to the Playground tab (formerly Text to Speech)
Select a voice and model
Adjust voice parameters using the sliders
Enter the text you want to convert to speech
Use the "Add Break" button to insert pause tags if needed
Click "Generate Speech" to create the audio
Use the player controls to listen to the generated speech
Click "Save Audio" to save the audio file to your computer
Click the "New" button to start a fresh generation

Using Presets (New in v2.0)

Select a preset from the dropdown to instantly apply voice settings:
- Balanced: Default settings for general use
- Expressive: Lower stability for emotional range
- Stable: High stability for consistent narration
- Fast Speech: 1.5x speed for quick delivery
- Slow & Clear: 0.8x speed for clarity
To save current settings as a preset:
- Adjust parameters to your liking
- Enter a name in "Save as..." field
- Click the save icon
To delete a preset:
- Select it from the dropdown
- Click the delete icon

Testing Voice Settings

Adjust the voice parameters (Stability, Similarity, Style, Speed) using the sliders
Click the "Test Settings" button to generate a sample audio with current settings
Listen to the audio to hear how your settings affect the voice
Click "Reset Settings" to return to default values if needed

Voice Cloning (New in v2.0)

Navigate to the Voice Management tab
Click on "Clone Voice" sub-tab
Enter a name for your voice clone
Add an optional description
Upload audio samples by:
- Clicking the upload area to browse files
- Dragging and dropping audio files
Add optional labels (e.g., "accent:british, age:middle")
Click "Create Voice Clone" to generate your custom voice
Your cloned voice will appear in the voice selection dropdown

Tips for Voice Cloning:

Use clear, high-quality audio samples
Provide multiple samples for better results
Ensure minimal background noise
Samples should be between 30 seconds to 3 minutes

Managing History

Navigate to the History tab to view your previous generations
Click "Play" on any history item to hear it again
Click "Use Text" to load the text from a previous generation
Click "Delete" to remove a specific history item
Click "Clear History" to remove all history items

Voice Parameters

Stability (0-1): Controls how stable/consistent the voice is. Lower values (0.0-0.3) allow for more emotional range and variability, while higher values (0.7-1.0) make the voice more monotonous but consistent.
Similarity Boost (0-1): Controls how closely the AI adheres to the original voice. Higher values (0.7-1.0) make it sound more like the original speaker, while lower values (0.0-0.3) allow for more creativity but may sound less like the original voice.
Style (0-1): Controls style exaggeration of the voice. Higher values (0.7-1.0) amplify the style of the original speaker, making the voice more distinctive and characterized. Default is 0.0 (no style exaggeration).
Speed (0.5-2.0): Controls the speed of the generated speech. Lower values create slower speech (0.5 is half speed), while higher values create faster speech (2.0 is double speed). Default is 1.0 (normal pace).

You can test different combinations of these parameters using the "Test Settings" button to find the perfect voice for your needs.

Tips for Best Results

Use proper punctuation to guide the pacing and intonation of the speech
Use the "Add Break" button to insert pauses of specific duration with <break time="Xs" /> tags
Break long texts into smaller paragraphs for better results
Different voices work better with different models - experiment to find the best combination
Use the "Test Settings" button to quickly hear how different parameter combinations sound
For emotional speech, use lower stability values (0.1-0.3)
For narration or audiobooks, use medium stability (0.4-0.6) and high similarity (0.7-0.9)
For consistent voice assistants, use high stability (0.7-0.9)

SSML Support

The application supports SSML (Speech Synthesis Markup Language) tags for more control over the speech:

<break time="Xs" /> - Add a pause of X seconds (use the "Add Break" button)
<emphasis>text</emphasis> - Emphasize text
<prosody rate="slow/medium/fast">text</prosody> - Control speech rate
<prosody pitch="low/medium/high">text</prosody> - Control pitch

Attribution

Created by @SannidhyaSah

Disclaimer

This is an unofficial application and is not affiliated with ElevenLabs. You must have a valid ElevenLabs API key to use this application. All API usage is subject to ElevenLabs' terms of service.

Changelog

For detailed version history and release notes, see CHANGELOG.md.

Latest Version: 3.0.0 - Professional installer configuration, optimized layouts, enhanced UI/UX

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
assets		assets
js		js
utils		utils
.gitignore		.gitignore
BUILD.md		BUILD.md
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
index.html		index.html
main.js		main.js
package-lock.json		package-lock.json
package.json		package.json
preload.js		preload.js
renderer.js		renderer.js
styles-optimized.css		styles-optimized.css

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ElevenLabs GUI Studio v3.0

Features

Core Features

Requirements

Installation

Option 1: Download the Release

Option 2: Build from Source

Building

Usage

Setting Up API Keys

Generating Speech

Using Presets (New in v2.0)

Testing Voice Settings

Voice Cloning (New in v2.0)

Managing History

Voice Parameters

Tips for Best Results

SSML Support

Attribution

Disclaimer

Changelog

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

SannidhyaSah/ElevenLabs-GUI-Studio-

Folders and files

Latest commit

History

Repository files navigation

ElevenLabs GUI Studio v3.0

Features

Core Features

Requirements

Installation

Option 1: Download the Release

Option 2: Build from Source

Building

Usage

Setting Up API Keys

Generating Speech

Using Presets (New in v2.0)

Testing Voice Settings

Voice Cloning (New in v2.0)

Managing History

Voice Parameters

Tips for Best Results

SSML Support

Attribution

Disclaimer

Changelog

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages