Content-Length: 277068 | pFad | http://github.com/seanwood/gcc-nmf/tree/2be34a51db24416ab8c3d0dc7afbbf35f5836b3b

2A GitHub - seanwood/gcc-nmf at 2be34a51db24416ab8c3d0dc7afbbf35f5836b3b
Skip to content

Real-time GCC-NMF Blind Speech Separation and Enhancement

License

Notifications You must be signed in to change notification settings

seanwood/gcc-nmf

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GCC-NMF

GCC-NMF is a blind source separation algorithm that combines:

  • GCC spatial localization method
  • NMF unsupervised dictionary learning algorithm

GCC-NMF has been applied to stereo speech separation and enhancement in both offline and real-time settings, though it is a generic source separation algorithm and could be applicable to other types of signals.

This GitHub repository is home to open source demonstrations in the form of iPython notebooks:

Offline Speech Separation and Enhancement

The notebooks in this section cover the initial presentation of GCC-NMF in the following publications:

Offline Speech Separation Demo

In the offline speech separation notebook, we show how GCC-NMF can be used to separate multiple concurrent speakers in an offline fashion. The NMF dictionary is first learned directly from the mixture signal, and sources are subsequently separated by attributing each atom at each time to a single source based on the dictionary atoms' estimated time delay of arrival (TDOA). Source localization is achieved with GCC-PHAT.

png

Offline Speech Enhancement Demo

The offline speech enhancement notebook demonstrates how GCC-NMF can can be used for offline speech enhancement, where instead of multiple speakers, we have a single speaker plus noise. In this case, individual atoms are attributed either to the speaker or to noise at each point in time base on the the atom TDOAs as above. The target speaker is again localized with GCC-PHAT.

png









ApplySandwichStrip

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier!      Saves Data!


--- a PPN by Garber Painting Akron. With Image Size Reduction included!

Fetched URL: http://github.com/seanwood/gcc-nmf/tree/2be34a51db24416ab8c3d0dc7afbbf35f5836b3b

Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy