Speech Structure and Its Application to Robust Speech Processing

14 years 25 days ago

Download www.gavo.t.u-tokyo.ac.jp

Speech communication consists of three steps: production, transmission, and hearing. Every step inevitably involves acoustic distortions due to gender diﬀerences, age, microphone- and room-related factors, and so on. In spite of these variations, listeners can extract linguistic information from speech as easily as if the communications had not been aﬀected by variations at all. One may hypothesize that listeners modify their internal acoustic models whenever extralinguistic factors change. Another possibility is that the linguistic information in speech can be represented separately from the extralinguistic factors. In this study, inspired by studies of humans and animals, a novel solution to the problem of intrinsic variations is proposed. Speech structures invariant to these variations are derived as transform-invariant features and their linguistic validity is discussed. Their high robustness is demonstrated by applying the speech structures to automatic speech recognition and ...

Nobuaki Minematsu, Satoshi Asakawa, Masayuki Suzuk

Real-time Traffic

Communications | Extralinguistic Factors | Linguistic Information | NGC 2010 | Speech Structures |

claim paper

Post Info
More Details (n/a)

Added	29 Jan 2011
Updated	29 Jan 2011
Type	Journal
Year	2010
Where	NGC
Authors	Nobuaki Minematsu, Satoshi Asakawa, Masayuki Suzuki, Yu Qiao

Comments (0)

Sciweavers

Speech Structure and Its Application to Robust Speech Processing

Communications | Extralinguistic Factors | Linguistic Information | NGC 2010 | Speech Structures |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers