Sciweavers

ACL
2015

Vector-space calculation of semantic surprisal for predicting word pronunciation duration

8 years 7 months ago
Vector-space calculation of semantic surprisal for predicting word pronunciation duration
In order to build psycholinguistic models of processing difficulty and evaluate these models against human data, we need highly accurate language models. Here we specifically consider surprisal, a word’s predictability in context. Existing approaches have mostly used n-gram models or more sophisticated syntax-based parsing models; this largely does not account for effects specific to semantics. We build on the work by Mitchell et al. (2010) and show that the semantic prediction model suggested there can successfully predict spoken word durations in naturalistic conversational data. An interesting finding is that the training data for the semantic model also plays a strong role: the model trained on indomain data, even though a better language model for our data, is not able to predict word durations, while the out-ofdomain trained language model does predict word durations. We argue that this at first counter-intuitive result is due to the out-of-domain model better matching th...
Asad B. Sayeed, Stefan Fischer, Vera Demberg
Added 13 Apr 2016
Updated 13 Apr 2016
Type Journal
Year 2015
Where ACL
Authors Asad B. Sayeed, Stefan Fischer, Vera Demberg
Comments (0)