Skip to content

🔥🔥 Kokoro in Rust. https://huggingface.co/hexgrad/Kokoro-82M Insanely fast, realtime TTS with high quality you ever have.

Notifications You must be signed in to change notification settings

lucasjinreal/Kokoros

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Kokoro Rust

img

Kokoro is a trending top 2 TTS model on huggingface. This repo provides insanely fast Kokoro infer in Rust, you can now have your built TTS engine powered by Kokoro and infer fast by only a command of koko.

kokoros is a rust crate that provides easy to use TTS ability. One can directly call koko in terminal to synthesize audio.

kokoros uses a relative small model 87M params, while results in extremly good quality voices results.

Languge support:

  • English;
  • Chinese (partly);
  • Japanese (partly);
  • German (partly);

🔥🔥🔥🔥🔥🔥🔥🔥🔥 Kokoros Rust version just got a lot attention now. If you also interested in insanely fast inference, embeded build, wasm support etc, please star this repo! We are keep updating it.

Currently help wanted! Implement OpenAI compatible API in Rust, anyone interested? Send me PR!

Updates

  • 2025.01.15: Phonemizer supported! Now Kokoros can inference E2E without anyother dependencies! Kudos to @tstm;
  • 2025.01.13: Espeak-ng tokenizer and phonemizer supported! Kudos to @mindreframer ;
  • 2025.01.12: Released Kokoros;

Build

First, fetch the voices.json data, this is need same as Kokoro official step.

python scripts/fetch_voices.py

Run:

cargo build --release

# test
cargo run

For production:

cargo build --release

./target/release/koko -h
./target/release/koko -t 'Hello, this is a TTS test'

For further development, for example, supports on embeded etc, please raise an issue to discuss your requirement.

Roadmap

Due to Kokoro actually not finalizing it's ability, this repo will keep tracking the status of Kokoro, and helpfully we can have language support incuding: English, Mandarin, Japanese, German, French etc.

Copyright

Copyright reserved by Lucas Jin under Apache License.

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy