Continuous vocal imitation with self-organized vowel spaces in Recurrent Neural Network

16 years 1 months ago

Download winnie.kuis.kyoto-u.ac.jp

Abstract— A continuous vocal imitation system was developed using a computational model that explains the process of phoneme acquisition by infants. Human infants perceive speech sounds not as discrete phoneme sequences but as continuous acoustic signals. One of critical problems in phoneme acquisition is the design for segmenting these continuous speech sounds. The key idea to solve this problem is that articulatory mechanisms such as the vocal tract help human beings to perceive speech sound units corresponding to phonemes. To segment acoustic signal with articulatory movement, we apply the segmenting method to our system by Recurrent Neural Network with Parametric Bias (RNNPB). This method determines the multiple segmentation boundaries in a temporal sequence using the prediction error of the RNNPB model, and the PB values obtained by the method can be encoded as kind of phonemes. Our system was implemented by using a physical vocal tract model, called the Maeda model. Experimenta...

Hisashi Kanda, Tetsuya Ogata, Toru Takahashi, Kazu

Real-time Traffic

Continuous Vocal Imitation | ICRA 2009 | Phoneme Acquisition | Robotics | Speech Sound |

claim paper

» Vocal imitation using physical vocal tract model

» Segmenting acoustic signal with articulatory movement using Recurrent Neural Network for p...

Post Info
More Details (n/a)

Added	23 May 2010
Updated	23 May 2010
Type	Conference
Year	2009
Where	ICRA
Authors	Hisashi Kanda, Tetsuya Ogata, Toru Takahashi, Kazunori Komatani, Hiroshi G. Okuno

Comments (0)

Sciweavers

Continuous vocal imitation with self-organized vowel spaces in Recurrent Neural Network

Continuous Vocal Imitation | ICRA 2009 | Phoneme Acquisition | Robotics | Speech Sound |

Explore & Download

Productivity Tools

Sciweavers