Spaces for Audio / Voices
- Running on Zero360🚀
- Running on Zero11👅🎙️🥰
SBV2 Chupa Demo
- Running2😊🎙️📖
VisualNovel_sbv_demo
- Running on CPU Upgrade611😊🎙️
Moe TTS
- Running6🏺
Bert-VITS2 AI Abe&Suga&Kishida
- Running36🚀
AICoverGen
- Build error13:🎤
rvc-Blue-archives-hoyogames
- Running39▶️🎤
VTuber RVC Models
- Running342👀
RVC Inference HF
- Running on Zero222🏃
Audio🔹Separator
Vocal and background audio separator
- Running43📉
BlueArchiveTTS
- Running141😆🌖😀
Multi Voice TTS(English/Chinese/Japanese)
[中文/English/日本語]multilingual text-to-speech
- Running on Zero380🔥
Stable Audio Open Zero
- Running142🍏
Applio
A simple, high-quality voice conversion tool
- Running on Zero1.61k🗣️
Voice Clone
- Running on Zero151⚡
RVC⚡ZERO
Voice conversion framework based on VITS
- Running6🎙🐴
Multilingual Anime TTS
- Runtime error1🎶
DiffSinger🎶 Diffusion for Singing Voice Synthesis
- Running129🎵
Ultimate Vocal Remover WebUI
- Running237🍏😺
Aesthetic RVC Inference HF
- Running64⚡
Advanced RVC Inference
- Running776🏃
Vits Models
- Running499🎙🐴
Multilingual Anime TTS
- Running32⚡
LoveLive-ShojoKageki VITS
- Running362🐨
vits-uma-genshin-honkai
- Running3🏺
おしゃべり晋さんメーカー(Style-Bert-VITS2)
- Running11😊▶️
Hololive Style-Bert-VITS2
- Running on Zero469🎼🎶
Midi Music Generator
- Running22🎼
Japanese Lyric Generator
- Running on L4350🎙
VALL E X
- Running2🔥
AI晋さんメーカー
- Running7📉
BangDream-ShojoKageki Bert VITS2
- Running3📈
lovelive-ShojoKageki VITS JPZH
- Running17🌖
Lovelive-nijigasaki-MB-iSTFT-VITS-ZH&JP
- Running on T42.11k🐶
Bark
- Running1k🤗
OpenVoice
- Running273🤗
OpenVoiceV2
- Runtime error59🐠
ChatTTS OpenVoice
- Running on T4179🌍🦜
MassivelyMultilingualTTS
- Running on T42.24k🐸
XTTS
- Running on A10G4.69k🎵
MusicGen
- Runtime error515📞
Seamless M4T v2
- Sleeping59📉
Mars5 Space
- Running on Zero9🎙️💾🔄🗣️
FAcodecV2
- Running on A10G231👋
TTS x Hallo Talking Portrait
Generate Talking avatars from Text-to-Speech
- Running on CPU Upgrade389🎤
RVC Genshin Impact
- Running on Zero91📚
FoleyCrafter
- Running201🏃
Voice Clone Multilingual
Languages ru,en,zh-cn,ja,de,fr,it,pt,pl,tr,ko,nl,cs,ar,es,hu
- Running on Zero14🐨
Talkalkai Cover
- Running on Zero460🎺
Image to Music v2
Get a music sample inspired by the mood of an image
- Running190🕒
Whisper Timestamped
In-browser speech recognition w/ word-level timestamps
- Running on CPU Upgrade565🏆
TTS Arena
Vote on the latest TTS models!
- Running19🥇
TTSDS Benchmark and Leaderboard
Text-To-Speech (TTS) Evaluation using objective metrics.
- Runtime error6🐨
LAKH MIDI Dataset Search
Search and explore LAKH MIDI dataset with MidiCaps
- Running on Zero23📈
PicoAudio
- Running15🏆
Advanced MIDI Search
Search and explore 179k+ MIDI titles
- Running on Zero78🐠
SenseVoice
- Running227🗣️
Whisper Speaker Diarization
- Running240🚀
Faster Whisper Webui
- Running on Zero31🎤
Vocal Separation SOTA
- Running86🐠
BangDream-ShojoKageki Bert VITS2
- Running2🐠
BangDream-ShojoKageki Api
- Running15🐠
BangDream-ShojoKageki Bert VITS2
- Running13🔊
Efficient Audio Captioning
- Running on Zero175🏃
NaturalSpeech3 FACodec
- Running246🌍
tts Text To Speech
- Sleeping4🌍
Edge Tts
- Runtime error14🏆
JA TTS Arena
Vote on the top Japanese TTS models!
- Running10⚡
MIKU TTS
- Running10🎮️🎹
Genshin music generation
Genshin Impact Game Style Music Generator
- Sleeping3⚡
Advanced RVC Inference
- Sleeping🐠
Style Bert VITS2 MT
- Paused3🎙️
ZeroRVC
- Running13👁
Edge TTS w/ More Options
- Runtime error33⚡
EZ Voice Clone
- Running3⚡
Training Helper Rvc
easy training helper For RVC
- Running on Zero20🚀
Anitalker
- Running6:🎤
rvc-Blue-archives
- Sleeping74🌊
Fish Diffusion (HiFiSinger) Demo
- Running15🥰
Japanese Ero Voice Classifier
- Running29😊🎙️📖
Style Bert VITS2 Editor Demo
- Running on L4417🏆
Fish Speech 1
- Running1⚡
Rvc Demo
A demo of RVC pip
- Running102🐶
Bark Voice Cloning
- Sleeping1🐸
NeonAI Coqui AI TTS Plugin
- Running105🐸
NeonAI Coqui AI TTS Plugin
- Running149🌍
Qwen2 Audio Instruct Demo
- Running8🗣️
StyleTTS 2
Efficient, fast, and natural text to speech with StyleTTS 2!
- Runtime error12🔥
AICoverGen
- Running11🔥
Harmonic Melody MIDI Mixer
Harmonize and mix any MIDI melody
- Running7🎻
MusicGen Riff
Music Generator | Song Maker Free | Lyrics Generator
- Runtime error30🎵
Ilaria Audio Analyzer
- Running on Zero712😻
Ilaria RVC
- Running4🚀 🗿
MDX UVR
- Running on Zero104🤗
GPT SoVITS V2
- Running7🗣️
Read My Pdf Outloud
- Running6⚡
Vocal Remover
- Running on Zero777🥖
Parler-TTS
High-fidelity Text-To-Speech
- Runtime error3🥰
Japanese Ero Voice Classifier
- Running3🐠
GPT-SoVITS-ToneControl_test
- Running19📊
Umamusume Bert Vits2
- Running1📈
Animalese Py
- Running2🔶
Animalese RVC
- Build error4📊
AI Hanser
- Running on Zero156💻
Stable Audio Live Multiplayer
- Running485👁
Edge TTS Text To Speech
- Running15🐨
Youtube AI Summarizer
- Running4🚀
AICoverGen
- Running1💻
Animalese Js
- Sleeping1💬
ASR Model Comparison
- Running4🔥
AICoverGenMod
- Running1🔨
Ilaria Converter
- Running1👁
RVC UI TES
- Running8🎤
RVC Genshin Impact
- Running1🦀
Voice2VoiceChatbot
- Sleeping🌖
RealTimeVoicetoVoiceChatbot
sp-uhh/speech-enhancement-sgmse
Audio-to-Audio • Updated • 4 • 9- Running2🏃
RVC UI
An easy-to-use voice conversion framework based on VITS.
- Sleeping🏃
RVC
- Running🌍
AI Voice Assistance
- Running on Zero1🗣️
Voice Clone
- Sleeping5🌍
Optimus
- Running38👀
Doc To Dialogue
Transform a report or document into an interview/discussion
- Running48⚡
Voicee
Super fastest Voice Assistant
- Running6🐟
Fish Audio API Demo
- Running on Zero59👁
Musicgen Songstarter Demo
- Running81▶️🐻💿
Hololive Rvc Models V2
- Running24🎹
Advanced MIDI Renderer
Transform and render any MIDI
- Sleeping3🚀
Imagen POP Music Medley Diffusion Transformer
Generate POP music medley with Imagen diffusion transformer
- Running2🔥
Ultimate MIDI Classifier
Classify absolutely any MIDI by genre, song and artist
- Running on Zero4📚
Intelligent MIDI Comparator
Intelligently compare any pair of MIDIs
- Running91🌍
ChatTTS Speaker
- Running2🌖
Bridge Music Transformer
Generate a seamless bridge between two composition parts
- Running57👀
vits-simple-api
- Running11🎙️
Bert VITS Umamusume Genshin HonkaiSR
- Running on Zero35🔊⏫
Audio SR
Fixed fork of the original audio sr!
- Running on Zero171🎤🔄
Seed Voice Conversion
- Running40⚡
Mini Omni
- Running4⚡
Monophonic MIDI Melody Harmonizer
Retrieval augmented harmonization of any MIDI melody
- Running10⚡
MIDI Melody
Add a unique melody to any MIDI file
- Running3🔥
MIDI Chords Mixer
Mix chords from one MIDI to another MIDI
- Running2🏆
Morse To Audio
- Sleeping1🚀
RCV EASY GUI
- Running1⚡
Advanced RVC Inference
- Running3⚡
Lyricsgenius
Get Lyrics from Genius's Link
- Sleeping1👁
Groq Gradio Voice Assistant
- Sleeping2🐠
Hex Separator
- Running3🐠
Groq API Models
Groq API Playground
- Running on Zero16👁
GPT-SoVITS-V2-NIIMI SORA
- Paused2🎵
AI Tube Engine MusicGen
- Paused1🎵
AI Tube Engine MusicGen
- Paused1🎵
AI Tube Engine MusicGen
- Paused5🎵
AI Tube Engine MusicGen
- Build error18📚
GPT-SoVITS-V2-Gakuen Idolmaster
- Running on Zero8🌖
UTMOSv2
- Runtime error5⚡
Mini Omni
- Build error10👁
GPT-SoVITS-V2-misc_models
- Configuration error12📊
Bench.audio
LMSYS bench for audio agents
- Runtime error78🌟
Compressed Wav2Lip
- Running81👄
Gradio Lipsync Wav2lip
- Sleeping7🐨
EchoMimic
- Running23🌍
Wav2lip Gpu
- Running1🏃
Matcha TTS Japanese
Description of Matcha TTS Japanese
- Running89💩
DeepFilterNet2
- Running on Zero12🇫🇷🥖
French Parler-TTS
High-fidelity Text-To-Speech
- Running on Zero257🟣
EzAudio
- Running on Zero14🔥
Kotoba Whisper Demo
- Running1🦀
Matcha Tts Onnx Benchmarks
Benchmark load model and tts time
- Runtime error7⚡
Mini Omni
- Running on Zero2🐠
AIChat-matcha-tts-onnx-en
Give your space a voice! (Demo)
- Running on Zero13🌍
GAMA
- Running on Zero4🏆
GAMA-IT
- Sleeping1🦀
Sbv2 Py
- Running on Zero216🎶
OpenMusic
- Running73🎙️
PodcastGen
Generate a 2-speaker podcast from text input or documents!
- Running3🐠
Mistral 7B Instruct v0.3 Matcha-TTS English
Enjoy TTS Chat
- Sleeping2💨
Moshi
- Running on Zero46🟣
EzAudio ControlNet
- Running3🐟
Fish Audio API Demo
- Runtime error1🐠
Whisper En Tiny
- Running on Zero7🏃
Guided Rock Music Transformer
Controlled source augmented rock music transformer
- Running on Zero21🎷
Long-form MusicGen
Long-form Musicgen
- Running75💻
Multilingual TTS
- Running4🔥
AI岸田文雄メーカー
- Running1🔥
AI菅義偉メーカー
- Running1😻
Audio Mouth
- Running390📚
Pdf2audio
- Running on CPU Upgrade586🏆
Open ASR Leaderboard
- Running on T41.02k🎙️
Open NotebookLM
Personalised Podcasts For All - Available in 13 Languages
- Sleeping4🔥
Kotoba Whisper Bilingual Demo
- Running on T4405🗣️
MeloTTS
Fast, efficient, & multilingual text-to-speech
- Running on T4184🐤
Canary 1b
- Running1😻
Style Bert VITS2 SW
- Runtime error21👁
Llama 3.2 3b Voice
- Runtime error1📚
Pdf2audio
- Running on Zero732🤯
Whisper Turbo
- Running on Zero286🤯
Realtime Whisper Turbo
Realtime implementation of Whisper large turbo
- Running141🚀
Whisper Large V3 Turbo WebGPU
ML-powered speech recognition directly in your browser
- Running on T4260🐢
Tortoise Tts
ExpressivText-to-Speech
- Running32💻
Russian Text To Speech
- Running5📉
Yt-dlp Wav
- Running on T4284🎼
UnlimitedMusicGen
unlimited Audio generation with a few added features
- Runtime error84🎶
AudioCraft Plus v2.0.0a (MusicGen + AudioGen)
- Runtime error22🎼
MusicGen+ V1.2.7 (HuggingFace Version)
- Running on Zero61🏢
VoiceRestore
- Running on Zero3⚡
Whisperturbo
whisper3 turbo
- Running34🎙️
GPT-SoVITS-3s-cloning-free-TTS
- Running4🏺
おしゃべり石破茂メーカー(Style-Bert-VITS2)
- Sleeping1🏺
おしゃべり二階俊博メーカー
- Runtime error3🐠
Text To Meow
- Running4🔥
Rvc Ui
- Running26🌍
Reverb ASR Demo
- Running1😻
Ilaria RVC Mod
- Running on T4303🚀
Resemble Enhance
- Sleeping2💻
Openai Whisper Large V3 Turbo
- Running45💻
RVC PlayGround
- Running52🚀
Podcastfy.ai - An Open Source alternative to NotebookLM's podcast feature
- Running on Zero68🎞️🎺
Video to Music
Generate and apply matching music background to video shot
- Running174👂🎞️
Video SoundFX
Generates a sound effect that matches video shot
- Paused171👂
Image2SFX Comparison
Generates audio environment from an image
- Running on Zero184🍏
Applio
- Running on Zero1.62k🗣️
F5-TTS
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
- Running1💜
Heartbeat
- Running153🤗🏆
TTS Spaces Arena
Vote on the top HF TTS models!
- Running on CPU Upgrade64🧝♀️🧛♂️🧚♀️
xVASynth TTS
CPU powered, low RTF, emotional, multilingual TTS
- Running284🎶
— AI Jukebox —
Generate music powered by AI
- Running on L40S325🐠
TANGO
Co-Speech Gesture Video Generation
- Running on Zero14🥰🎤📝
Anime Whisper Demo
- Running on Zero62🏢
Ichigo Llama3.1 S Instruct
- Running6🚀
Whisper Japanese Phone Demo
Whisper model to transcript japanese audio to katakana.
- Running on Zero120📈
ClearerVoice-Studio (Speech Enhancement, Separation and Extraction)
Better AI powered platform to purify your speech signal
- Running20♫🔒
Steganography
Text | Image | Audio | Video to Spectrogram || Steganography
- Running15🔥
AICoverGenMod
- Running12🚀
UVR5 UI
- Running on Zero16🗣️
Diva Realtime Chat
- Sleeping2👁
Kotoba Whisper Diarization Demo
- Running on Zero11📚
Synthio Stable Audio Open
Stable audio open model from Synthio paper.
- Sleeping1🚀
RYO EVC
- Runtime error1😻
UVR
- Running on Zero35🌒
Moonshine ASR
Fast & efficient ASR outperforming Whisper!
- Running22🔊
seewav-gui
- Running on Zero70🎵
RWKV Music
Generate MIDI music using RWKV v4!
- Running4💻
MP3 Transcribe
Whisper Transcribe MP3 files, use a GPU to convert faster!
- Running6🗣️0️⃣
StyleTTS 2 Zero
Efficient, fast, and natural text to speech with StyleTTS 2!
- Running on Zero246😻
MaskGCT TTS Demo
MaskGCT TTS Demo
- Running on Zero62🎵
MelodyFlow
- Running on Zero574🤫
Whisper Large V3
- Sleeping6🚀
Ultimate Chords Progressions Transformer
Self-correcting multi-instrumental chords transformer
- Runtime error8🎶♫
Chords Progressions Transformer
Chords-conditioned music transformer
- Running on Zero25⚡
Fast Whisper Turbo
Ultra-fast Whisper Turbo inference ⚡
- Running on A10G291🔊
AudioLDM2 Text2Audio Text2Music Generation
- Running2🗣️👂
Hey Buddy!
In-Browser Audio Wake-Word Spotting
- Running3🎹
Streamlit Pianoroll
Streamlit pianoroll playback element
- Running9⚡
Audio-Separator
Audio-Separator by Politrees
- Running on Zero99🚀
Giant Music Transformer
Fast multi-instrumental music transformer
- Sleeping23🌖
Omni Mini (WebRTC)
- Sleeping5🎹
Fortepyan Datasets
Streamlit browser for piano music datasets.
- Sleeping4🎹
PIANO Dataset
Demo of masking tasks from the PIANO dataset
- Running on L40S132💬
Fish Agent
An end-to-end (e2e) Voice Language Model by Fish Audio.
- Running7🎵
Audio to Stems to MIDI Converter
- Running25🌍
Podcast Generation
Generate podcasts with AI avatars
- Sleeping🐠
ChatTTS OpenVoice
- Running1📚
OpenVoice
- Running on Zero7🗣️
F5-TTS
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
- Running317📊
Bark with Voice Cloning
- Running27📉
OuteTTS 0.1 350M Demo
- Running on Zero9🎼🎶
Midi Music Generator
- Sleeping4🎵
Audio Lyrics Extractor
- Running10🤔
Did StyleTTS 2 Generate It?
Did StyleTTS 2 generate that audio?!?
- Paused35🌍
Hertz Dev
base model for mono-channel completion
- Sleeping7⚡
Xtts
- Running on Zero225💬
ChatTTS Forge
- Running on Zero300❤️
Kokoro TTS
Now in 5 languages!
- Running6🌖
Pipertts
- Running53🎧
Nexa Omni Demo
- Running on Zero12😻
MaskGCT TTS Demo
MaskGCT TTS Demo
- Sleeping20📚
Video2music
- Running on L4794🔊
Audioldm Text To Audio Generation
- Running2🦀
So VITS SVC
- Sleeping2👀
GPT SoVITS
- Running on Zero272🗣️
Spanish F5
Spanish finetune for the original F5 model.
- Sleeping1🎤⚡🎤
Dolce SVC
- Running2🎤🦊
Dolce TTS
- Running1⚡
Lipsync
- Running5☕🐰🎤
Chino TTS
- Running2🐨
Style Bert VITS2 NO
- Running1📉
Style Bert VITS2 SU
シャルティアのAI音声合成モデルを作りました。
- Running1🔥
Style Bert VITS2 MHY
早乙女乱馬(女)のAI音声合成モデルを作りました。
- Running1🚀
Style Bert VITS2 SAR
ベアトリスのAI音声合成モデルを作りました。
- Running on L433⚡
Talk To Ultravox
Talk to Fixie.ai's Ultravox with WebRTC ⚡️
- Running2🏃
SoundOfWater
Estimate physical properties merely from pouring sound!
- Running9🐢
Llama Code Editor
Create interactive HTML web pages with your voice
- Running on CPU Upgrade28🐨
sutra-avatar-v2
- Running1🌍
Audio Transcriber
Record an audio, then use AI to transcribe and translate it.
- Running on Zero16🖌️🎶
Inpaint Music Transformer
Large and fast music transformer for pitches inpainting
- Running47🐠
OuteTTS 0.2 500M Demo
- Running13🌖
Tsukasa 司 Speech
- Running8🎵
MusicGen Continuation
- Running5🚀
Semanticodec Ultra Low Bitrate Audio Codec
Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a
- Running16📚
Audiosr Versatile Audio Super Resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR
- Running on Zero2🐠
OuteTTS 0.2 500M Demo GPU
- Running2💬
ChatTTS Forge English interface
TTS tool
- Running1📚
Style Bert VITS2 RU2
short_description: 猫屋敷まゆのAI音声合成モデルを作りました。
- Running11🥰🎤🤔
Galgame Voice Finder
- Sleeping1👁
Vad Go
- Running on Zero147👀
Indic Parler-TTS
A demo of Indic Parler-TTS
- Sleeping1🐳
Voice Activity Detection
- Running5👀
Vikhr 4o
- Running1⚡
Audio Arena
audio-arena
- Running18🏢
Wespeaker Demo
- Running4💻
Wesep Tse 2speaker Demo
Target Speaker Extraction with WeSep
- Running13🐢
Wenet Demo
- Runtime error4🏆
Open_ASR_Leaderboard
- Running30🗣️
Text-to-Speech WebGPU
WebGPU text-to-Speech powered by OuteTTS and Transformers.js
- Running12📈
SpeechScore (Speech Quality Metrics and Evaluation)
A home for scoring speech quality
- Running2🐠
Fish Speech Benchmark
Non official benchmark by Fish Speech
- Running on Zero6👅🎙️🥰
Chupa Generator
- Running on Zero5🌖
Japanese Parler-TTS Mini Demo
- Running on Zero4🏢
Japanese Parler-TTS Large Demo
- Runtime error3⚡
Make Anime Emotion Dataset
- Running6😊😱😠
Anime Speech Emotion Recognition
- Running on Zero443🔊
MMAudio — generating synchronized audio from video/text
- Running on Zero27🗣️
Voice Clone
- Running on Zero187🐠
Sound AI SFX
SText to Audio(Sound SFX) Generator
- Runtime error5👁
Talk To Moshi
Talk to Kyutai's moshi - powered by Gradio WebRTC!
- Running on T4372⚡
HierSpeech++ (Zero-shot TTS)
- Running10🌍
Talk To Gradio Docs Rag
Talk to the Gradio docs! Powered by Pydantic and WebRTC ⚡️
- Running6📊
Melody Workshop
"One-minute creation by AI Coding Autonomous Agent MOUSE-I"
- Running on Zero11📉
Text2midi
- Running on Zero106🔊
Audio Llama
generated sound from video/text and search
- Running2🐢
VM Sound Classification
- Running2🪷
Lotus
- Running102🌙
Moonshine Web
Real-time in-browser speech recognition
- Running8💻
Openai Realtime Voice
Talk with openAI's new Realtime Voice API
- Running on Zero8🏆
Fast GeCo
- Running on Zero6📉
SoloAudio
- Running on Zero2🐨
SSR Speech
- Running23🎶
Music Genre Classifier
Music Genre Classifier
- Running2🪕🎵
Guzheng Playing Tech
Guzheng Performance Technique Recognizer
- Running2🪕🎶
Chinese Instruments
Chinese Traditional Instrument Sound Retriever
- Running2🪕🎼
Pentatonic Mode
Chinese Music Pentatonic Mode Detector
- Running on T416🚀
Kotoba-Speech Demo
- Running2🐨
Audio Edit
- Paused4🔊
MMAudio
Video to Audio
- Running8🎙️
Audio Transcription
- Running5📉
Audio 8D
Make your audio to 8D
- Running11⚡
Audio Separator
Audio-Separator Demo
- Running2🎤
Real-time Whisper WebGPU (Vue)
Yet another Real-time Whisper with WebGPU, written in Vue
- Running5🦀
MIDI Identification
Identify any MIDI
- Running2🌙
Moonshine Web (Vue)
Yet another Real-time in-browser STT, re-implemented in Vue
- Running4🧸
アイリ VTuber
アイリ VTuber. LLM powered Live2D/VRM living character.
- Running8🎵🖥️
Figured Bass Calculator
figured bass calculator
- Running135🚀
Ebook2audiobook V2.0 Beta
Added improvements, 1107+ languages supported
- Running2🐸📖
Ebook2audiobook_v1.0
V1.0Convert any Ebook to AudioBook with Xtts + VoiceCloning!
- Running9🪈📖
Ebook2audiobookPiper-tts
Converts Ebooks into audiobooks with piper-tts
- Running5⚡
Ebook2AudiobookV2.0_Docker_Test
First ebook2audiobook Dockerfile test
- Running10🎵🔘
Music Vision
Audio Visualization Circle Effect Tool
- Running4📟
MS1-X Virtual Synth
Ready-to-play synth instrument!
- Running8🎮️💬
hoyoTTS
Genshin Impact & Honkai Star Rail Game Character Voice TTS
- Running9🪕
Erhu Playing Tech
Erhu Performance Technique Recognizer
- Running9🎙
Bel Canto Discriminator
Discriminator of Bel Canto and Chinese Folk Singing
- Running12🎹
Pianos
Piano Sound Quality Classifier
- Running13🎤
Chest Falsetto Discriminator
Discriminator of Chest Vocie and Falsetto
- Running on L4116🥳
CosyVoice2-0.5B
- Running on Zero3👾
Monster Piano Transformer
Ultra-fast and very well fitted solo Piano music transformer
- Running1🌖
Style Bert VITS2 IM2
ヘスティアのAI音声合成モデルを作りました。
- Sleeping1🏃
Style Bert VITS2 YHK2
フレイヤのAI音声合成モデルを作りました。
- Paused2📻🎙️
Anachrovox V0.1 Emerald (Bugged)
Hands-Free AI Voice Chat with a Retro Vibe
- Paused3📻🎙️
Anachrovox V0.1 Azure (Bugged)
Hands-Free AI Voice Chat with a Retro Vibe
- Paused2📻🎙️
Anachrovox V0.1 Amber (Bugged)
Hands-Free AI Voice Chat with a Retro Vibe
- Running67📉
MIDI-Melody-Generator
"One-minute creation by AI Coding Autonomous Agent MOUSE-I"
- Running2⚡
Audio Arena
audio-arena
- Running on Zero21📊
Audio Separator
- Running11🥇
Open Universal Arabic Asr Leaderboard
A benchmark for open-source multi-dialect Arabic ASR models
- Running on Zero248🔥
MusicGen Streaming
- Running2.44k⚡️
Whisper JAX
- Running on Zero25📝
Parler-TTS Streaming
High-fidelity Text-To-Speech
- Running on L4184👄
LatentSync
Audio Conditioned LipSync with Latent Diffusion Models
- Running on A10G239🎼
Singing Voice Conversion
- Running54🔥
Text To Speech
- Running on Zero3🔥
DeepfakeDetection
Deepfake Detection
- Running2🦀
Felguk Audio Edit
Audio edit
- Running on Zero104🎴
Kokoro TTS Zero
Accelerated Text-To-Speech on Kokoro-82M
- Running3📚🎧
📚 𝕡𝕕𝕗 𝕥𝕠 𝕊𝕡𝕖𝕖𝕔𝕙 ℂ𝕠𝕟𝕧𝕖𝕣𝕥𝕖𝕣 🎧
Accessibility PDF & pasted text to speech converter w/ gTTs
- Running on L41.19k😭
SadTalker
- Running1😎
OLLAMA TTS CLIENT
- Running7🚀
Piper TTS Spanish
- Running39🦀
Audio Visualizer
Audio Visualizer
- Running2🦀
JARVIS2
2
- Running on Zero235🚀
TangoFlux
Text to Audio (Sound SFX) Generator
- Running274🎤
Rvc Models
- Running8🎼🎶
Karaoke MIDI Search
- Running16🎵
Semantic Audio Search w/ Transformers.js
- Running on Zero3⚡
Misaki G2P
G2P