Real-time streaming for the animation of talking faces in multiuser environments

14 years 6 months ago

Download iphome.hhi.de

In order to enable face animation on the Internet using high quality synthetic speech, the Text-to-Speech (TTS) servers need to be implemented on network-based servers and shared by many users. The output of a TTS server is used to animate talking heads as defined in MPEG-4. The TTS server creates two sets of data: audio data and Phonemes with optional Facial Animation Parameters (FAP) like smile. In order to animate talking heads on a client it is necessary to stream the output of the TTS server to the client. Real-time streaming protocols for audio data already exist. We developed a real-time transport protocol with error recovery capability to stream Phonemes and Facial animation Parameters (PFAP), which are used to animate the talking head. The stream was designed for interactive services and allows for low latency communications. The typical bit rate for enabling a talking face is less than 800 bit/s.

Jörn Ostermann, Jürgen Rurainsky, M. Reh

Real-time Traffic

Facial Animation Parameters | Hardware | ISCAS 2002 | Talking Head | TTS Server |

claim paper

Post Info
More Details (n/a)

Added	15 Jul 2010
Updated	15 Jul 2010
Type	Conference
Year	2002
Where	ISCAS
Authors	Jörn Ostermann, Jürgen Rurainsky, M. Reha Civanlar

Comments (0)

Sciweavers

Real-time streaming for the animation of talking faces in multiuser environments

Facial Animation Parameters | Hardware | ISCAS 2002 | Talking Head | TTS Server |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers