text-to-speech

Star

Here are 3,364 public repositories matching this topic...

RVC-Boss / GPT-SoVITS

Star

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

text-to-speech tts voice-cloning vits voice-clone voice-cloneai

Updated Dec 19, 2024
Python

coqui-ai / TTS

Star

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

python text-to-speech deep-learning speech pytorch tts speech-synthesis voice-conversion vocoder voice-synthesis tacotron voice-cloning speaker-encodings melgan speaker-encoder multi-speaker-tts glow-tts hifigan tts-model

Updated Aug 16, 2024
Python

babysor / MockingBird

Sponsor

Star

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

text-to-speech ai deep-learning speech pytorch tts

Updated Nov 15, 2024
Python

2noise / ChatTTS

Star

A generative speech model for daily dialogue.

python chat agent text-to-speech torch tts english chinese gpt natural-language-inference english-language chinese-language torchaudio llm chatgpt llm-agent chattts

Updated Dec 3, 2024
Python

myshell-ai / OpenVoice

Star

Instant voice cloning by MIT and MyShell.

text-to-speech tts voice-clone zero-shot-tts

Updated Dec 12, 2024
Python

leon-ai / leon

Star

🧠 Leon is your open-source personal assistant.

Updated Nov 20, 2024
TypeScript

jianchang512 / pyvideotrans

Star

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，同时支持语音识别转录、语音合成、字幕翻译。

text-to-speech speech-to-text video-transition

Updated Dec 21, 2024
Python

mozilla / TTS

Star

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

python text-to-speech deep-learning speech pytorch tts vocoder tacotron tensorflow2 tacotron2 melgan speaker-encoder dataset-analysis glow-tts multiband-melgan gantts

Updated Nov 9, 2023
Jupyter Notebook

FunAudioLLM / CosyVoice

Star

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

python text-to-speech japanese chatbot multi-lingual tts english chinese korean cantonese natural-language-generation cross-lingual fine-grained fine-tuning voice-cloning audio-generation chatgpt gpt-4o cosyvoice

Updated Dec 18, 2024
Python

espnet / espnet

Star

End-to-End Speech Processing Toolkit

text-to-speech deep-learning chainer end-to-end machine-translation pytorch speech-synthesis speech-recognition kaldi voice-conversion speaker-diarization speech-separation speech-enhancement spoken-language-understanding speech-translation singing-voice-synthesis

Updated Dec 23, 2024
Python

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.