Monaural speech separation is a very challenging task. CASAbased systems utilize acoustic features to produce a time-frequency (T-F) mask. In this study, we propose a classificat...
Automatic audio classification usually considers sounds as music, speech, silence or noise, but works about the noise class are rare. Audio features are generally specific to sp...
Pierre Hanna, Nicolas Louis, Myriam Desainte-Cathe...
This paper outlines ProSynth, an approach to speech synthesis which takes a rich linguistic structure as central to the generation of natural-sounding speech. We start from the as...
Richard Ogden, Sarah Hawkins, Jill House, Mark Huc...
For extractive meeting summarization, previous studies have shown performance degradation when using speech recognition transcripts because of the relatively high speech recogniti...
In this paper, we consider the extraction of speaker identity from audio records of broadcast news without a priori acoustic information about speakers. Using an automatic speech ...
Vincent Jousse, Simon Petit-Renaud, Sylvain Meigni...