rnnt

Here are 9 public repositories matching this topic...

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model rnnt dfsmn paraformer speechgpt speechllm

Updated Jan 24, 2025
Python

upskyy / Transformer-Transducer

Star

PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASSP 2020)

end-to-end transformer speech-recognition sequence-to-sequence rnnt transformer-transducer

Updated Feb 27, 2022
Python

stevenhillis / awesome-asr-contextualization

Star

A curated list of awesome papers on contextualizing E2E ASR outputs

awesome speech-recognition transducers awesome-list speech-processing asr contextualization error-correction listen-attend-and-spell rnnt

Updated May 10, 2023

iamjanvijay / rnnt_decoder_cuda

Star

An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.

cuda speech-recognition beam-search speech-to-text transducer handwriting-recognition prefix-search rnnt

Updated Dec 8, 2020
Cuda

iamjanvijay / rnnt

Star

An implementation of RNN-Transducer loss in TF-2.0.

ctc-loss asr-decoder asr-model transducer-loss rnnt

Updated Mar 25, 2023
Python

manhph2211 / ViSTT

Star

I'm building an end-to-end Vietnamese Speech Recognition System. I'll deploy it into production with the help of Flask, Uwsgi, Nginx, and AWS ...

nginx flask uwsgi pytorch hydra speech-to-text sst aws-deploy conformer pytorch-lightning vietnamese-speech-recognition rnnt vietnames-asr tranducer vivos vietnamese-speech-to-text

Updated Sep 9, 2022
Python

tuanio / conformer-rnnt

Star

Conformer RNN-Transducer

python speech-recognition conformer rnnt

Updated May 25, 2022
Python

George0828Zhang / ssnt_loss

Star

Pure PyTorch implementation of the loss described in "Online Segment to Segment Neural Transduction" https://arxiv.org/abs/1609.08194

python pytorch neural-transducer monotonic-attention rnnt

Updated Mar 12, 2022
Python

Andersonjesusvital / Speech-Recognition-RNN

Star

Deep learning-based subtitle generation model that processes audio datasets to generate accurate text transcriptions. Includes audio feature extraction, encoder-decoder architecture, training pipelines, and evaluation metrics for subtitle alignment.

deep-neural-networks tensorflow dnn recurrent-neural-networks lstm gru rnn speech-to-text sequence-to-sequence rnn-tensorflow gated-recurrent-units rnnt online-speech-recognition transformer-transducer

Updated Jan 24, 2025

Improve this page

Add a description, image, and links to the rnnt topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the rnnt topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rnnt

Here are 9 public repositories matching this topic...

modelscope / FunASR

upskyy / Transformer-Transducer

stevenhillis / awesome-asr-contextualization

iamjanvijay / rnnt_decoder_cuda

iamjanvijay / rnnt

manhph2211 / ViSTT

tuanio / conformer-rnnt

George0828Zhang / ssnt_loss

Andersonjesusvital / Speech-Recognition-RNN

Improve this page

Add this topic to your repo

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.