Unsupervised learning of auditory filter banks using non-negative matrix factorisation

14 years 6 months ago

Download www.esat.kuleuven.be

Non-negative matrix factorisation (NMF) is an unsupervised learning technique that decomposes a non-negative data matrix into a product of two lower rank non-negative matrices. The non-negativity constraint results in a parts-based and often sparse representation of the data. We use NMF to factorise a matrix with spectral slices of continuous speech to automatically ﬁnd a feature set for speech recognition. The resulting decomposition yields a ﬁlter bank design with remarkable similarities to perceptually motivated designs, supporting the hypothesis that human hearing and speech production are well matched to each other. We point out that the divergence cost criterion used by NMF is linearly dependent on energy, which may inﬂuence the design. We will however argue that this does not signiﬁcantly affect the interpretation of our results. Furthermore, we compare our ﬁlter bank with several hearing models found in literature. Evaluating the ﬁlter bank for speech recognition s...

Alexander Bertrand, Kris Demuynck, Veronique Stout

Real-time Traffic

ICASSP 2008 | Non-negative Matrix Factorisation | Signal Processing | Speech Recognition | ﬁlter Bank |

claim paper

Post Info
More Details (n/a)

Added	30 May 2010
Updated	30 May 2010
Type	Conference
Year	2008
Where	ICASSP
Authors	Alexander Bertrand, Kris Demuynck, Veronique Stouten, Hugo Van Hamme

Comments (0)

Sciweavers

Unsupervised learning of auditory filter banks using non-negative matrix factorisation

ICASSP 2008 | Non-negative Matrix Factorisation | Signal Processing | Speech Recognition | ﬁlter Bank |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers