Deep Belief Networks for Real-Time Extraction of Tongue Contours from Ultrasound During Speech

14 years 6 months ago

Download www.u.arizona.edu

Ultrasound has become a useful tool for speech scientists studying mechanisms of language sound production. State-of-the-art methods for extracting tongue contours from ultrasound images of the mouth, typically based on active contour snakes [3], require considerable manual interaction by an expert linguist. In this paper we describe a novel method for fully automatic extraction of tongue contours based on a hierarchy of restricted Boltzmann machines (RBMs), i.e. deep belief networks (DBNs) [2]. Usually, DBNs are first trained generatively on sensor data, then discriminatively to predict human-provided labels of the data. In this paper we introduce the translational RBM (tRBM), which allows the DBN to make use of both human labels and raw sensor data at all stages of learning. This method achieves performance comparable to human labelers, without any temporal smoothing or human intervention required at runtime.

Ian Fasel, Jeff Berry

Real-time Traffic

Computer Vision | ICPR 2010 | Scientists Studying Mechanisms | Sensor Data | Tongue Contours |

claim paper

Post Info
More Details (n/a)

Added	02 Sep 2010
Updated	02 Sep 2010
Type	Conference
Year	2010
Where	ICPR
Authors	Ian Fasel, Jeff Berry

Comments (0)

Sciweavers

Deep Belief Networks for Real-Time Extraction of Tongue Contours from Ultrasound During Speech

Computer Vision | ICPR 2010 | Scientists Studying Mechanisms | Sensor Data | Tongue Contours |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers