Sciweavers

MLMI
2007
Springer
14 years 5 months ago
Integrating Semantics into Multimodal Interaction Patterns
A user experiment on multimodal interaction (speech, hand position and hand shapes) to study two major relationships: between the level of cognitive load experienced by users and t...
Ronnie Taib, Natalie Ruiz
MLMI
2007
Springer
14 years 5 months ago
Conditional Sequence Model for Context-Based Recognition of Gaze Aversion
Eye gaze and gesture form key conversational grounding cues that are used extensively in face-to-face interaction among people. To accurately recognize visual feedback during inter...
Louis-Philippe Morency, Trevor Darrell
MLMI
2007
Springer
14 years 5 months ago
To Separate Speech
The PASCAL Speech Separation Challenge (SSC) is based on a corpus of sentences from the Wall Street Journal task read by two speakers simultaneously and captured with two circular ...
John W. McDonough, Ken'ichi Kumatani, Tobias Gehri...
MLMI
2007
Springer
14 years 5 months ago
Microphone Array Beamforming Approach to Blind Speech Separation
In this paper, we present a microphone array beamforming approach to blind speech separation. Unlike previous beamforming approaches, our system does not require a-priori knowledge...
Ivan Himawan, Iain McCowan, Mike Lincoln
MLMI
2007
Springer
14 years 5 months ago
Automatic Decision Detection in Meeting Speech
Abstract. Decision making is an important aspect of meetings in organisational settings, and archives of meeting recordings constitute a valuable source of information about the de...
Pei-yun Hsueh, Johanna D. Moore
MLMI
2007
Springer
14 years 5 months ago
A Study of Phoneme and Grapheme Based Context-Dependent ASR Systems
In this paper we present a study of automatic speech recognition systems using context-dependent phonemes and graphemes as sub-word units based on the conventional HMM/GMM system a...
John Dines, Mathew Magimai-Doss
MLMI
2007
Springer
14 years 5 months ago
Automatic Labeling Inconsistencies Detection and Correction for Sentence Unit Segmentation in Conversational Speech
In conversational speech, irregularities in the speech such as overlaps and disruptions make it difficult to decide what is a sentence. Thus, despite very precise guidelines on how...
Sébastien Cuendet, Dilek Z. Hakkani-Tü...
MLMI
2007
Springer
14 years 5 months ago
Gaussian Process Latent Variable Models for Human Pose Estimation
We describe a method for recovering 3D human body pose from silhouettes. Our model is based on learning a latent space using the Gaussian Process Latent Variable Model (GP-LVM) [1]...
Carl Henrik Ek, Philip H. S. Torr, Neil D. Lawrenc...
MLMI
2007
Springer
14 years 5 months ago
Using Prosodic Features in Language Models for Meetings
Abstract. Prosody has been actively studied as an important knowledge source for speech recognition and understanding. In this paper, we are concerned with the question of exploiti...
Songfang Huang, Steve Renals
MLMI
2007
Springer
14 years 5 months ago
Term-Weighting for Summarization of Multi-party Spoken Dialogues
This paper explores the issue of term-weighting in the genre of spontaneous, multi-party spoken dialogues, with the intent of using such term-weights in the creation of extractive ...
Gabriel Murray, Steve Renals