Sciweavers

106 search results - page 16 / 22
» On the Modeling of Time Information for Automatic Genre Reco...
Sort
View
MLMI
2007
Springer
14 years 2 months ago
Binaural Speech Separation Using Recurrent Timing Neural Networks for Joint F0-Localisation Estimation
A speech separation system is described in which sources are represented in a joint interaural time difference-fundamental frequency (ITD-F0) cue space. Traditionally, recurrent t...
Stuart N. Wrigley, Guy J. Brown
MM
2006
ACM
162views Multimedia» more  MM 2006»
14 years 2 months ago
An innovative three-dimensional user interface for exploring music collections enriched
We present a novel, innovative user interface to music repositories. Given an arbitrary collection of digital music files, our system creates a virtual landscape which allows the...
Peter Knees, Markus Schedl, Tim Pohle, Gerhard Wid...
ICASSP
2011
IEEE
13 years 9 days ago
Improving acoustic event detection using generalizable visual features and multi-modality modeling
Acoustic event detection (AED) aims to identify both timestamps and types of multiple events and has been found to be very challenging. The cues for these events often times exist...
Po-Sen Huang, Xiaodan Zhuang, Mark Hasegawa-Johnso...
ISMIR
2003
Springer
161views Music» more  ISMIR 2003»
14 years 1 months ago
Improving polyphonic and poly-instrumental music to score alignment
Music alignment links events in a score and points on the audio performance time axis. All the parts of a recording can be thus indexed according to score information. The automat...
Ferréol Soulez, Xavier Rodet, Diemo Schwarz
FGR
2008
IEEE
264views Biometrics» more  FGR 2008»
13 years 10 months ago
Large scale learning and recognition of faces in web videos
The phenomenal growth of video on the web and the increasing sparseness of meta information associated with it forces us to look for signals from the video content for search/info...
Ming Zhao 0003, Jay Yagnik, Hartwig Adam, David Ba...