Understanding facial expressions in image sequences is an easy task for humans. Some of us are capable of lipreading by interpreting the motion of the mouth. Automatic lipreading b...
We propose a new browsing system called "Web2Talkshow". It transforms declarative-based web content into humorous dialogbased TV-program-like content that is presented t...
In this paper, the design and implementation of a corpus-based singing voice synthesis (SVS) system for Mandarin Chinese was introduced. The design rules of three corpora for sing...
Cheng-Yuan Lin, Tzu-Ying Lin, Jyh-Shing Roger Jang
A polyglot text-to-speech synthesis system which is able to read aloud mixed-lingual text has first of all to derive the correct pronunciation. This is achieved with an accurate m...
We describe the speech-enabling approach to building auditory interfaces that treat speech as a first-class modality. The process of designing effective auditory interfaces is de...