Sciweavers

804 search results - page 127 / 161
» Text Segmentation Based on Similarity between Words
Sort
View
INFOSCALE
2007
ACM
13 years 11 months ago
A distributed incremental nearest neighbor algorithm
Searching for non-text data (e.g., images) is mostly done by means of metadata annotations or by extracting the text close to the data. However, supporting real content-based audi...
Fabrizio Falchi, Claudio Gennaro, Fausto Rabitti, ...
LREC
2008
112views Education» more  LREC 2008»
13 years 11 months ago
Modeling Document Dynamics: an Evolutionary Approach
News articles about the same event published over time have properties that challenge NLP and IR applications. A cluster of such texts typically exhibits instances of paraphrase a...
Jahna Otterbacher, Dragomir R. Radev
MEDINFO
2007
169views Healthcare» more  MEDINFO 2007»
13 years 11 months ago
Corpus-based Error Detection in a Multilingual Medical Thesaurus
Cross-language document retrieval systems require support by some kind of multilingual thesaurus for semantically indexing documents in different languages. The peculiarities of t...
Roosewelt L. Andrade, Edson José Pacheco, P...
ICWE
2007
Springer
14 years 4 months ago
Integrating Databases, Search Engines and Web Applications: A Model-Driven Approach
This paper addresses conceptual modeling and automatic code generation for search engine integration with data intensive Web applications. We have analyzed the similarities (and di...
Alessandro Bozzon, Tereza Iofciu, Wolfgang Nejdl, ...
ICASSP
2008
IEEE
14 years 4 months ago
Fine: Information embedding for document classification
The problem of document classification considers categorizing or grouping of various document types. Each document can be represented as a bag of words, which has no straightforw...
Kevin M. Carter, Raviv Raich, Alfred O. Hero