Ageing Voices: The Effect of Changes in Voice Parameters on ASR Performance

14 years 2 months ago

Download www.cstr.ed.ac.uk

With ageing, human voices undergo several changes which are typically characterized by increased hoarseness and changes in articulation patterns. In this study, we have examined the effect on Automatic Speech Recognition (ASR) and found that the Word Error Rates (WER) on older voices is about 9% absolute higher compared to those of adult voices. Subsequently, we compared several voice source parameters including fundamental frequency, jitter, shimmer, harmonicity and cepstral peak prominence of adult and older males. Several of these parameters show statistically significant difference for the two groups. However, artificially increasing jitter and shimmer measures do not effect the ASR accuracies significantly. Artificially lowering the fundamental frequency degrades the ASR performance marginally but this drop in performance can be overcome to some extent using Vocal Tract Length Normalisation (VTLN). Overall, we observe that the changes in the voice source parameters do not have a ...

Ravichander Vipperla, Steve Renals, Joe Frankel

Real-time Traffic

ASR Performance | EJASMP 2010 | Fundamental Frequency | Voice Source Parameters |

claim paper

Post Info
More Details (n/a)

Added	17 May 2011
Updated	17 May 2011
Type	Journal
Year	2010
Where	EJASMP
Authors	Ravichander Vipperla, Steve Renals, Joe Frankel

Comments (0)

Sciweavers

Ageing Voices: The Effect of Changes in Voice Parameters on ASR Performance

ASR Performance | EJASMP 2010 | Fundamental Frequency | Voice Source Parameters |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers