Sciweavers

TSD
2010
Springer
13 years 6 months ago
Parallel Training of Neural Networks for Speech Recognition
Karel Veselý, Lukas Burget, Frantisek Gr&ea...
TSD
2010
Springer
13 years 6 months ago
Recovery of Rare Words in Lecture Speech
The vocabulary used in speech usually consists of two types of words: a limited set of common words, shared across multiple documents, and a virtually unlimited set of rare words, ...
Stefan Kombrink, Mirko Hannemann, Lukas Burget, Hy...
TSD
2010
Springer
13 years 6 months ago
Improving Automatic Image Captioning Using Text Summarization Techniques
This paper presents two different approaches to automatic captioning of geo-tagged images by summarizing multiple web-documents that contain information related to an image’s lo...
Laura Plaza, Elena Lloret, Ahmet Aker
TSD
2010
Springer
13 years 6 months ago
Evaluation of a Sentence Ranker for Text Summarization Based on Roget's Thesaurus
Abstract. Evaluation is one of the hardest tasks in automatic text summarization. It is perhaps even harder to determine how much a particular component of a summarization system c...
Alistair Kennedy, Stan Szpakowicz
TSD
2010
Springer
13 years 6 months ago
Extracting Human Spanish Nouns
In this article we present a simple method to extract Spanish nouns with the linguistic property of “human” animacy. We describe a non-supervised method based on lexical patter...
Sofía N. Galicia-Haro, Alexander F. Gelbukh
TSD
2010
Springer
13 years 6 months ago
Using Gradient Descent Optimization for Acoustics Training from Heterogeneous Data
In this paper, we study the use of heterogeneous data for training of acoustic models. In initial experiments, a significant drop of accuracy has been observed on in-domain test s...
Martin Karafiát, Igor Szöke, Jan Cerno...
TSD
2010
Springer
13 years 6 months ago
Diagnostics for Debugging Speech Recognition Systems
Modern speech recognition applications are becoming very complex program packages. To understand the error behaviour of the ASR systems, a special diagnosis - a procedure or a tool...
Milos Cernak
ICASSP
2010
IEEE
13 years 6 months ago
CBCD based on color features and landmark MDS-assisted distance estimation
Content-Based Copy Detection (CBCD) of digital videos is an important research field that aims at the identification of modified copies of an original clip, e.g., on the Intern...
Marzia Corvaglia, Fabrizio Guerrini, Riccardo Leon...
ICASSP
2010
IEEE
13 years 6 months ago
Exploring statistical properties for semantic annotation: sparse distributed and convergent assumptions for keywords
Does there exist a compact set of visual topics in form of keyword clusters capable to represent all images visual content within an acceptable error? In this paper, we answer thi...
Xianming Liu, Hongxun Yao, Rongrong Ji
ICASSP
2010
IEEE
13 years 6 months ago
Building pair-wise visual word tree for efficent image re-ranking
Bag-of-visual Words (BoW) image representation is getting popular in computer vision and multimedia communities. However, experiments show that the traditional BoW representation ...
Shiliang Zhang, Qingming Huang, Yijuan Lu, Wen Gao...