This guide will help you get started with RVC-MacOS (Retrieval-based Voice Conversion) quickly.
RVC-MacOS is a voice conversion application that allows you to:
- Convert voice recordings from one voice to another
- Train AI models on custom voices
- Process audio with high quality using Apple Silicon acceleration (MPS)
- Use a web-based interface for easy operation
Important: RVC requires ~1.5-2GB of AI model files to function. These will either be pre-bundled in the app or downloaded on first launch.
-
Download the App
- Go to Releases
- Download
RVC-MacOS-Installer.dmg(~500MB)
-
Install the App
- Open the downloaded DMG file
- Drag
RVC-MacOS.appto your Applications folder - Close the DMG window
-
First Launch (Important!)
- Open Applications folder
- Double-click
RVC-MacOS.app - If you see "cannot be opened because it is from an unidentified developer":
- Right-click the app and select "Open"
- Click "Open" in the dialog
- Or go to System Preferences → Security & Privacy → General and click "Open Anyway"
-
First-Time Model Download
⚠️ IMPORTANT- A console/terminal window will appear with download progress
- The app will download ~1.5GB of AI model files
- This takes 5-10 minutes depending on your internet speed
- Keep the console window open! Do not close it during download
- You'll see clear progress messages:
FIRST-TIME SETUP: Downloading Required AI Models • HuBERT base model (~189MB) • RMVPE pitch detection models (~110MB) • Pretrained RVC models v1 (~600MB) • Pretrained RVC models v2 (~600MB) Estimated time: 5-10 minutes - Internet connection required for first launch
- This only happens once - models are saved for future use
-
App Starts Automatically
- After models download, the web interface starts
- A browser window opens at
http://localhost:7860 - If it doesn't open, manually visit
http://localhost:7860 - Keep the console window open while using the app
- To stop the app, close the console window or press Ctrl+C
- Open RVC-MacOS.app (models already present)
- App starts in 10-15 seconds
- Browser opens automatically
- Ready to use immediately!
-
Initial Setup (First Time)
- Wait for all models to download (if not pre-bundled)
- The web interface will open at
http://localhost:7860 - You should see the RVC web interface with tabs for Model, Inference, Train, etc.
-
Voice Conversion Workflow
- Step 1: Upload an audio file (WAV, MP3, etc.)
- Step 2: Select a voice model (use pretrained or train your own)
- Step 3: Adjust pitch (f0) and other parameters
- Step 4: Click "Convert" to process
- Step 5: Download the converted audio
-
Training Your Own Voice Model
- Prepare audio files:
- 10+ minutes of clean audio (WAV format recommended)
- Single speaker, clear voice
- Minimal background noise
- 16kHz or higher sample rate
- Use the "Train" tab in the web interface
- Name your experiment (e.g., "my_voice")
- Upload or select audio files
- Start training (takes 30-60 minutes on Apple Silicon)
- Use the trained model for conversion
- Prepare audio files:
-
Understanding the Interface
- Model Tab: Manage and download voice models
- Inference Tab: Convert voice in audio files
- Train Tab: Train new voice models
- UVR5 Tab: Separate vocals from instrumentals
- TTS Tab: Text-to-speech features
# Install Homebrew (if not already installed)
/bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)"
# Install Python 3.10
brew install python@3.10
# Install system dependencies
brew install portaudio# Clone the repository
git clone https://github.com/audiohacking/RVC-MacOS.git
cd RVC-MacOS
# Build the app (this creates .venv, installs deps, downloads models, and builds)
./build_app.sh
# Create DMG installer
./create_dmg.shThe built app will be at dist/RVC-MacOS.app and the installer at dist/RVC-MacOS-Installer.dmg.
If you want to run in development mode without building an app bundle:
# Create virtual environment
python3 -m venv .venv
source .venv/bin/activate
# Install dependencies
pip install -r requirements/gui.txt
# Download models
python download_models.py
# Run the app
python web.py
# Open browser to http://localhost:7860- "App is damaged": Run
xattr -cr /Applications/RVC-MacOS.appin Terminal - Permission denied: Check System Preferences → Security & Privacy
- "Python not found": This shouldn't happen with bundled app; rebuild if it does
- Check internet connection: Models are downloaded from Hugging Face
- Firewall blocking: Check if firewall is blocking the app
- Disk space: Ensure you have at least 3GB free space
- Manual download: See README.md for manual model download instructions
- Retry: Close app, delete
assets/folders with incomplete models, restart
- Verify checksums: The app checks SHA256 hashes of model files
- Corrupt downloads: Delete the
assetsfolder and restart to re-download - Check logs: Look at the terminal/console window for error messages
- File permissions: Ensure the app has read/write access to its directories
- This means the app bundle is incomplete
- Rebuild the app:
rm -rf build dist && ./build_app.sh - Ensure all dependencies are installed before building
- Another instance is running or another app is using port 7860
- Kill the process:
lsof -ti:7860 | xargs kill -9 - Or specify a different port by editing
web.pybefore building
- RAM: Ensure you're on a Mac with at least 8GB RAM (16GB recommended)
- CPU: Apple Silicon Macs (M1/M2/M3) perform significantly better than Intel
- Background apps: Close other resource-intensive applications
- Training: Voice model training can take 30-60 minutes on Apple Silicon
- First conversion: First conversion may be slower as models load into memory
- Use high-quality source: 16kHz or higher sample rate
- Clean audio: Remove background noise before conversion
- Adjust f0: Pitch (f0) parameter needs tuning per voice
- Try different models: Experiment with v1 vs v2 pretrained models
- Check RMVPE: Use RMVPE pitch extraction for best results
- Python version: Must be Python 3.8-3.10:
python3 --version- RVC requires this due to fairseq dependency
- Install correct version:
brew install python@3.10
- Missing dependencies:
pip install -r requirements/gui.txt - Clean build:
rm -rf build dist .venv - System dependencies:
brew install portaudio - Try again:
./build_app.sh
- Check Console.app: Look for crash logs and error messages
- Test in Terminal: Run
./dist/RVC-MacOS.app/Contents/MacOS/launcher - Missing models: Ensure models downloaded correctly
- Memory: Close other apps if you have low RAM
- Rebuild: Try
rm -rf build dist && ./build_app.sh
- Issues: https://github.com/audiohacking/RVC-MacOS/issues
- Documentation: See PACKAGING_MACOS_PYTHON_APPS.md
- Original Project: https://github.com/fumiama/Retrieval-based-Voice-Conversion-WebUI
- Better Results: Use high-quality, clean audio recordings for training
- Training Time: Expect 30-60 minutes for training on Apple Silicon
- Disk Space: Keep at least 10GB free for models and training data
- Updates: Check the Releases page regularly for new versions
- Backups: Save your trained models regularly (found in
logs/folder)
- Read the full README.md for more features
- Check out the PACKAGING_MACOS_PYTHON_APPS.md for build details
- Explore training options in the web interface
- Try different pitch extraction algorithms for best results