Sciweavers

286 search results - page 33 / 58
» Automatic document indexing in large medical collections
Sort
View
IPM
2006
146views more  IPM 2006»
13 years 8 months ago
Dictionary-based text categorization of chemical web pages
A new dictionary-based text categorization approach is proposed to classify the chemical web pages efficiently. Using a chemistry dictionary, the approach can extract chemistry-re...
Chunyan Liang, Li Guo, Zhaojie Xia, Feng-Guang Nie...
SIGIR
2004
ACM
14 years 2 months ago
Resource selection for domain-specific cross-lingual IR
An under-explored question in cross-language information retrieval (CLIR) is to what degree the performance of CLIR methods depends on the availability of high-quality translation...
Monica Rogati, Yiming Yang
LREC
2010
160views Education» more  LREC 2010»
13 years 10 months ago
Corpus and Evaluation Measures for Automatic Plagiarism Detection
The simple access to texts on digital libraries and the WWW has led to an increased number of plagiarism cases in recent years, which renders manual plagiarism detection infeasibl...
Alberto Barrón-Cedeño, Martin Pottha...
KDD
2002
ACM
147views Data Mining» more  KDD 2002»
14 years 9 months ago
A parallel learning algorithm for text classification
Text classification is the process of classifying documents into predefined categories based on their content. Existing supervised learning algorithms to automatically classify te...
Canasai Kruengkrai, Chuleerat Jaruskulchai
SIGMOD
2007
ACM
144views Database» more  SIGMOD 2007»
14 years 8 months ago
The TopX DB&IR engine
This paper proposes a demo of the TopX search engine, an extensive framework for unified indexing, querying, and ranking of large collections of unstructured, semistructured, and ...
Martin Theobald, Ralf Schenkel, Gerhard Weikum