From flat direct models to segmental CRF models

14 years 19 days ago

Download research.microsoft.com

This paper summarizes recent work at Microsoft on the development of novel direct models. The key characteristic of our approaches is the use of long-span segment level features that relate acoustic properties directly to words. In this approach, the frame-level Markov assumption is replaced by the segment level Markov property, allowing us to extract long-span features. A key issue we address is the deﬁnition of generalizable features which allow us to model unseen words. We review two recently developed models that have this property: Flat Direct Models (FDMs), and Segmental CRFs (SCRFs). The ﬁrst operates in a log-linear framework, and uses utterance level features. The second is also a log-linear model, but deﬁnes features at the word-segment level. We present new experimental results comparing the two approaches. We ﬁnd that both show consistent improvements over a baseline system, and that the extra context available to the FDM enables slightly better performance in a re...

Geoffrey Zweig, Patrick Nguyen

Real-time Traffic

Direct Models | ICASSP 2010 | Long-span Segment Level | Segment Level | Signal Processing |

claim paper

Post Info
More Details (n/a)

Added	06 Dec 2010
Updated	06 Dec 2010
Type	Conference
Year	2010
Where	ICASSP
Authors	Geoffrey Zweig, Patrick Nguyen

Comments (0)

Sciweavers

From flat direct models to segmental CRF models

Direct Models | ICASSP 2010 | Long-span Segment Level | Segment Level | Signal Processing |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers