Cross-Channel Spectral Subtraction for meeting speech recognition

13 years 4 months ago

Download mirlab.org

We propose Cross-Channel Spectral Subtraction (CCSS), a source separation method for recognizing meeting speech where one microphone is prepared for each speaker. The method quickly adapts to changes in transfer functions and uses spectral subtraction to suppress the speech of other speakers. Compared with conventional source separation methods based on independent component analysis (ICA) or that use binary masks, it requires less computational costs and the resulting speech signals have less distortion. In a recognition task of computer-simulated, partially-overlapped speech, CCSS improved the word accuracy from 66.5% to 77.7%. It also signiﬁcantly improved the recognition accuracy of speech data in actual meetings.

Yu Nasu, Koichi Shinoda, Sadaoki Furui

Real-time Traffic

Cross-Channel Spectral Subtraction | ICASSP 2011 | Signal Processing | Source Separation Methods | Spectral Subtraction |

claim paper

Post Info
More Details (n/a)

Added	21 Aug 2011
Updated	21 Aug 2011
Type	Journal
Year	2011
Where	ICASSP
Authors	Yu Nasu, Koichi Shinoda, Sadaoki Furui

Comments (0)

Sciweavers

Cross-Channel Spectral Subtraction for meeting speech recognition

Cross-Channel Spectral Subtraction | ICASSP 2011 | Signal Processing | Source Separation Methods | Spectral Subtraction |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers