Sciweavers

1353 search results - page 181 / 271
» Text Indexing with Errors
Sort
View
DEBU
2010
108views more  DEBU 2010»
13 years 8 months ago
Weighted Set-Based String Similarity
Consider a universe of tokens, each of which is associated with a weight, and a database consisting of strings that can be represented as subsets of these tokens. Given a query st...
Marios Hadjieleftheriou, Divesh Srivastava
ENDM
2010
86views more  ENDM 2010»
13 years 8 months ago
Mathematical programming based debugging
Verifying that a piece of software has no bugs means proving that it has certain desired properties, such as an array index not taking values outside certain Abstract interpretati...
Leo Liberti, Stéphane Le Roux, Jeremy Lecon...
SIGIR
2008
ACM
13 years 8 months ago
Term clouds as surrogates for user generated speech
User generated spoken audio remains a challenge for Automatic Speech Recognition (ASR) technology and content-based audio surrogates derived from ASR-transcripts must be error rob...
Manos Tsagkias, Martha Larson, Maarten de Rijke
ICDM
2009
IEEE
175views Data Mining» more  ICDM 2009»
13 years 5 months ago
Maximum Margin Clustering with Multivariate Loss Function
This paper presents a simple but powerful extension of the maximum margin clustering (MMC) algorithm that optimizes multivariate performance measure specifically defined for clust...
Bin Zhao, James Tin-Yau Kwok, Changshui Zhang
JCDL
2006
ACM
176views Education» more  JCDL 2006»
14 years 2 months ago
A hierarchical, HMM-based automatic evaluation of OCR accuracy for a digital library of books
A number of projects are creating searchable digital libraries of printed books. These include the Million Book Project, the Google Book project and similar efforts from Yahoo an...
Shaolei Feng, R. Manmatha