Sciweavers

123 search results - page 22 / 25
» Improving Acoustic Models with Captioned Multimedia Speech
Sort
View
NIPS
2003
13 years 9 months ago
A Classification-based Cocktail-party Processor
At a cocktail party, a listener can selectively attend to a single voice and filter out other acoustical interferences. How to simulate this perceptual ability remains a great cha...
Nicoleta Roman, DeLiang L. Wang, Guy J. Brown
MM
2004
ACM
117views Multimedia» more  MM 2004»
14 years 1 months ago
Singing voice detection in popular music
We propose a novel technique for the automatic classification of vocal and non-vocal regions in an acoustic musical signal. Our technique uses a combination of harmonic content a...
Tin Lay Nwe, Arun Shenoy, Ye Wang
ICASSP
2011
IEEE
12 years 11 months ago
Integrating frame-based and segment-based dynamic time warping for unsupervised spoken term detection with spoken queries
Rapidly increasing quantities of multimedia and spoken content today demand fast and accurate retrieval approaches for convenient browsing. The spoken documents with wide variety ...
Chun-an Chan, Lin-Shan Lee
MLMI
2004
Springer
14 years 29 days ago
The 2004 ICSI-SRI-UW Meeting Recognition System
We describe the ICSI-SRI-UW team’s entry in the Spring 2004 NIST Meeting Recognition Evaluation. The system was derived from SRI’s 5xRT Conversational Telephone Speech (CTS) r...
Chuck Wooters, Nikki Mirghafori, Andreas Stolcke, ...
UAIS
2010
13 years 2 months ago
Auditory universal accessibility of data tables using naturally derived prosody specification
Abstract Text documents usually embody visually oriented meta-information in the form of complex visual structures, such as tables. The semantics involved in such objects result in...
Dimitris Spiliotopoulos, Gerasimos Xydas, Georgios...