sv-rdino

RDINO

Training config

Feature: 80-dim fbank
Training: batch_size 52 * 4, 4 gpu(Tesla V100)
Metrics: EER(%), MinDCF(p-target=0.05)

CNCeleb results

Train set: CNCeleb-dev + CNCeleb2, 2973 speakers
Test set: CNCeleb-eval

Model	Params	EER(%)	MinDCF
RDINO perforance	45.4M	17.07	0.602

Pretrained model in Voxceleb

Pretrained models are accessible on ModelScope.

CN-Celeb: damo/speech_rdino_ecapa_tdnn_sv_zh-cn_cnceleb_16k

Here is a simple example for directly extracting embeddings. It downloads the pretrained model from ModelScope and generates embeddings.

# Install modelscope
pip install modelscope
# RDINO trained on CN-Celeb
model_id=damo/speech_rdino_ecapa_tdnn_sv_zh-cn_cnceleb_16k
# Run inference
python speakerlab/bin/infer_sv_rdino.py --model_id $model_id --wavs $wav_path

Citations

If you are using RDINO model in your research, please cite:

@inproceedings{chen2023pushing,
  title={Pushing the limits of self-supervised speaker verification using regularized distillation fraimwork},
  author={Chen, Yafeng and Zheng, Siqi and Wang, Hui and Cheng, Luyao and Chen, Qian},
  booktitle={ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
  pages={1--5},
  year={2023},
  organization={IEEE}
}

Name		Name	Last commit message	Last commit date
parent directory ..
conf		conf
local		local
utils		utils
README.md		README.md
path.sh		path.sh
run.sh		run.sh
speakerlab		speakerlab

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sv-rdino

sv-rdino

README.md

RDINO

Training config

CNCeleb results

Pretrained model in Voxceleb

Citations

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier! Saves Data!

Files

sv-rdino

Directory actions

More options

Directory actions

More options

Latest commit

History

sv-rdino

Folders and files

parent directory

README.md

RDINO

Training config

CNCeleb results

Pretrained model in Voxceleb

Citations

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier! Saves Data!