Sciweavers

1617 search results - page 62 / 324
» On the Performance and Use of Speaker Recognition Systems fo...
Sort
View
ICASSP
2011
IEEE
13 years 14 days ago
Deep Belief Networks using discriminative features for phone recognition
Deep Belief Networks (DBNs) are multi-layer generative models. They can be trained to model windows of coefficients extracted from speech and they discover multiple layers of fea...
Abdel-rahman Mohamed, Tara N. Sainath, George Dahl...
ICASSP
2010
IEEE
13 years 9 months ago
Acceleration of sequence kernel computation for real-time speaker identification
The sequence kernel has been shown to be a promising kernel function for learning from sequential data such as speech and DNA. However, it is not scalable to massive datasets due ...
Makoto Yamada, Masashi Sugiyama, Gordon Wichern, T...
IJCNN
2000
IEEE
14 years 10 days ago
People Recognition and Pose Estimation in Image Sequences
This paper presents a system which learns from examples to automatically recognize people and estimate their poses in image sequences with the potential application to daily surve...
Chikahito Nakajima, Massimiliano Pontil, Tomaso Po...
CVPR
2007
IEEE
14 years 10 months ago
A Bayesian algorithm for tracking multiple moving objects in outdoor surveillance video
Reliable tracking of multiple moving objects in video is an interesting challenge, made difficult in real-world video by various sources of noise and uncertainty. We propose a Bay...
Manjunath Narayana, Donna Haverkamp
ISMIR
2005
Springer
151views Music» more  ISMIR 2005»
14 years 2 months ago
Lyrics Recognition from a Singing Voice Based on Finite State Automaton for Music Information Retrieval
Recently, several music information retrieval (MIR) systems have been developed which retrieve musical pieces by the user’s singing voice. All of these systems use only the melo...
Toru Hosoya, Motoyuki Suzuki, Akinori Ito, Shozo M...