The proliferation of consumer recording devices and video sharing websites makes the possibility of having access to multiple recordings of the same occurrence increasingly likely...
This paper investigates spoken term detection (STD) from audio recordings of course lectures obtained from an existing media repository. STD is performed from word lattices genera...
Richard Rose, Atta Norouzian, Aarthi Reddy, Andr&e...
Measuring similarity of two musical pieces is an ill-defined problem for which recent research on contextual information, assigned as free-form text (tags) in social networking s...
The problem of audio source separation from a monophonic sound mixture having known instrument types but unknown timbres is presented. An improvement to the Probabilistic Latent C...
We propose a novel semi-supervised method for building a statistical model that represents the relationship between sounds and text labels (“tags”). The proposed method, named...
Jun Takagi, Yasunori Ohishi, Akisato Kimura, Masas...