GCC-NMF

GCC-NMF is a blind source separation algorithm that combines:

GCC spatial localization method
NMF unsupervised dictionary learning algorithm

GCC-NMF has been applied to stereo speech separation and enhancement in both offline and real-time settings, though it is a generic source separation algorithm and could be applicable to other types of signals.

This GitHub repository is home to open source demonstrations in the form of iPython notebooks:

Offline Speech Separation iPython Notebook
Offline Speech Enhancement iPython Notebook

Offline Speech Separation and Enhancement

The notebooks in this section cover the initial presentation of GCC-NMF in the following publications:

Sean UN Wood and Jean Rouat, Speech Separation with GCC-NMF, Interspeech 2016.
DOI: 10.21437/Interspeech.2016-1449
Sean UN Wood, Jean Rouat, Stéphane Dupont, Gueorgui Pironkov, Speech Separation and Enhancement with GCC-NMF, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 25, no. 4, pp. 745–755, 2017.
DOI: 10.1109/TASLP.2017.2656805

Offline Speech Separation Demo

In the offline speech separation notebook, we show how GCC-NMF can be used to separate multiple concurrent speakers in an offline fashion. The NMF dictionary is first learned directly from the mixture signal, and sources are subsequently separated by attributing each atom at each time to a single source based on the dictionary atoms' estimated time delay of arrival (TDOA). Source localization is achieved with GCC-PHAT.

Offline Speech Enhancement Demo

The offline speech enhancement notebook demonstrates how GCC-NMF can can be used for offline speech enhancement, where instead of multiple speakers, we have a single speaker plus noise. In this case, individual atoms are attributed either to the speaker or to noise at each point in time base on the the atom TDOAs as above. The target speaker is again localized with GCC-PHAT.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
README_files		README_files
data		data
gccNMF		gccNMF
notebooks		notebooks
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GCC-NMF

Offline Speech Separation and Enhancement

Offline Speech Separation Demo

Offline Speech Enhancement Demo

About

Releases

Packages

Contributors 2

Languages

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier! Saves Data!

License

seanwood/gcc-nmf

Folders and files

Latest commit

History

Repository files navigation

GCC-NMF

Offline Speech Separation and Enhancement

Offline Speech Separation Demo

Offline Speech Enhancement Demo

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier! Saves Data!

Packages