This paper presents a Bayesian method for temporally aligning a music score and an audio rendition. A critical problem in audio-toscore alignment is in dealing with the wide varie...
Akira Maezawa, Hiroshi G. Okuno, Tetsuya Ogata, Ma...
The segmentation of ultrasound images is challenging due to the difficulty of appropriate modeling of their appearance variations including speckle as well as signal dropout. We ...
Video based analysis of a persons' mood or behavior is in general performed by interpreting various features observed on the body. Facial actions, such as speaking, yawning o...
Realistic audio-visual mapping remains a very challenging problem. Having short time delay between inputs and outputs is also of great importance. In this paper, we present a new ...
Voice conversion has become more and more important in speech technology, but most of current works have to use parallel utterances of both source and target speaker as the traini...