Detecting Stress in Spoken English using Decision Trees and Support Vector Machines

14 years 2 months ago

Download crpit.com

This paper describes an approach to the detection of stress in spoken New Zealand English. After identifying the vowel segments of the speech signal, the approach extracts two different sets of features -- prosodic features and vowel quality features -- from the vowel segments. These features are then normalised and scaled to obtain speaker independent feature values that can be used to classify each vowel segment as stressed or unstressed. We used Decision Trees (C4.5) and Support Vector Machines (LIBSVM) to learn stress-detecting classifiers with various combinations of the features. The approach was evaluated on 60 adult female utterances with 703 vowels and a maximum accuracy of 84.72% was achieved. The results showed that a combination of features derived from duration and amplitude achieved the best performance but the vowel quality features also achieved quite reasonable results.

Huayang Xie, Peter Andreae, Mengjie Zhang, Paul Wa

Real-time Traffic

ACSW 2004 | ACSW 2007 | Prosodic Features | Vowel Quality Features | Vowel Segment |

claim paper

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2004
Where	ACSW
Authors	Huayang Xie, Peter Andreae, Mengjie Zhang, Paul Warren

Comments (0)

Sciweavers

Detecting Stress in Spoken English using Decision Trees and Support Vector Machines

ACSW 2004 | ACSW 2007 | Prosodic Features | Vowel Quality Features | Vowel Segment |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers