Sciweavers

106 search results - page 12 / 22
» On the Modeling of Time Information for Automatic Genre Reco...
Sort
View
TSD
2004
Springer
14 years 1 months ago
Towards Lower Error Rates in Phoneme Recognition
We investigate techniques for acoustic modeling in automatic recognition of context-independent phoneme strings from the TIMIT database. The baseline phoneme recognizer is based on...
Petr Schwarz, Pavel Matejka, Jan Cernocký
ICASSP
2010
IEEE
13 years 7 months ago
A comparison of approaches for modeling prosodic features in speaker recognition
Prosodic information has been successfully used for speaker recognition for more than a decade. The best-performing prosodic system to date has been one based on features extracte...
Luciana Ferrer, Nicolas Scheffer, Elizabeth Shribe...
RIAO
2000
13 years 10 months ago
Speaker change detection using joint audio-visual statistics
In this paper, we present an approach for speaker change detection in broadcast video using joint audio-visual scene change statistics. Our experiments indicate that using joint a...
Giridharan Iyengar, Chalapathy Neti, Sankar Basu
ICASSP
2011
IEEE
13 years 9 days ago
Towards robust word discovery by self-similarity matrix comparison
Word discovery is the task of discovering and collecting occurrences of repeating words in the absence of prior acoustic and linguistic knowledge, or training material. The capabi...
Armando Muscariello, Guillaume Gravier, Fré...
PAMI
2010
218views more  PAMI 2010»
13 years 3 months ago
A Coupled Duration-Focused Architecture for Real-Time Music-to-Score Alignment
Abstract--The capacity for realtime synchronization and coordination is a common ability among trained musicians performing a music score that presents an interesting challenge for...
Arshia Cont