Sciweavers

316 search results - page 28 / 64
» Evaluation of random-projection-based feature combination on...
Sort
View
TOCHI
1998
112views more  TOCHI 1998»
13 years 8 months ago
The Integrality of Speech in Multimodal Interfaces
A framework of complementary behavior has been proposed which maintains that direct manipulation and speech interfaces have reciprocal strengths and weaknesses. This suggests that...
Michael A. Grasso, David S. Ebert, Timothy W. Fini...
ICASSP
2011
IEEE
13 years 14 days ago
The IBM 2009 GALE Arabic speech transcription system
We describe the Arabic broadcast transcription system elded by IBM in the GALE Phase 4 machine translation evaluation. Key advances over our Phase 3.5 system include improvements ...
Brian Kingsbury, Hagen Soltau, George Saon, Stephe...
VLSISP
1998
140views more  VLSISP 1998»
13 years 8 months ago
Audio Feature Extraction and Analysis for Scene Segmentation and Classification
Understanding of the scene content of a video sequence is very important for content-based indexing and retrieval of multimedia databases. Research in this area in the past severa...
Zhu Liu, Yao Wang, Tsuhan Chen
INTERSPEECH
2010
13 years 3 months ago
On speaker adaptive training of artificial neural networks
In the paper we present two techniques improving the recognition accuracy of multilayer perceptron neural networks (MLP ANN) by means of adopting Speaker Adaptive Training. The us...
Jan Trmal, Jan Zelinka, Ludek Müller
ICPR
2008
IEEE
14 years 10 months ago
Visual features with semantic combination using Bayesian network for a more effective image retrieval
In many vision problems, instead of having fully annotated training data, it is easier to obtain just a subset of data with annotations, because it is less restrictive for the use...
Sabine Barrat, Salvatore Tabbone