A common technique to deploy linear prediction to nonstationary signals is time segmentation and local analysis. In [1], the temporal changes of linear prediction coefficients (L...
In this paper, we propose an MMSE a priori SNR estimator for speech enhancement. This estimator has similar benefits to the well-known decision-directed approach, but does not req...
We propose a model for speech recognition that consists of multiple semi-synchronized recognizers operating on a polyphase decomposition of standard speech features. Specifically...
In this paper, we consider online (sequential) portfolio selection in a competitive algorithm framework under transaction costs. We construct a sequential algorithm for portfolio ...
In this paper we present two new methods for speech enhancement based on the previously publised ne pitch model (FPM) for voiced speech. The rst method (FPM-NE) uses the FPM to pr...
Sentence segmentation and punctuation recovery are critical components for effective spoken language translation (SLT). In this paper we describe our recent work on sentence segme...
Matthias Paulik, Sharath Rao, Ian R. Lane, Stephan...
A phase synchronizationmethod, which provides non-uniform frequency offset compensation needed for wideband OFDM [1], is coupled with low-complexity channel estimation in the time...
State-of-the-art speaker diarization systems for meetings are now at a point where overlapped speech contributes significantly to the errors made by the system. However, little i...
Kofi Boakye, B. Trueba-Hornero, Oriol Vinyals, Ger...
To make voice conversion usable in practical applications, the number of training sentences should be minimized. With traditional Gaussian mixture model (GMM) based techniques sma...