Sciweavers

186 search results - page 8 / 38
» Experiments in Term Weighting and Keyword Extraction in Docu...
Sort
View
DOCENG
2010
ACM
13 years 8 months ago
Glyph extraction from historic document images
This paper is about the reproduction of ancient texts with vectorised fonts. While for OCR only recognition rates count, a reproduction process does not necessarily require the re...
Lothar Meyer-Lerbs, Arne Schuldt, Björn Gottf...
CIKM
2005
Springer
14 years 1 months ago
Query expansion using term relationships in language models for information retrieval
Language Modeling (LM) has been successfully applied to Information Retrieval (IR). However, most of the existing LM approaches only rely on term occurrences in documents, queries...
Jing Bai, Dawei Song, Peter Bruza, Jian-Yun Nie, G...
SIGIR
2005
ACM
14 years 1 months ago
Title extraction from bodies of HTML documents and its application to web page retrieval
This paper is concerned with automatic extraction of titles from the bodies of HTML documents. Titles of HTML documents should be correctly defined in the title fields; however, i...
Yunhua Hu, Guomao Xin, Ruihua Song, Guoping Hu, Sh...
ICAIL
2005
ACM
14 years 1 months ago
Effective Document Clustering for Large Heterogeneous Law Firm Collections
Computational resources for research in legal environments have historically implied remote access to large databases of legal documents such as case law, statutes, law reviews an...
Jack G. Conrad, Khalid Al-Kofahi, Ying Zhao, Georg...
CORR
2006
Springer
132views Education» more  CORR 2006»
13 years 7 months ago
Navigating multilingual news collections using automatically extracted information
We are presenting a text analysis tool set that allows analysts in various fields to sieve through large collections of multilingual news items quickly and to find information that...
Ralf Steinberger, Bruno Pouliquen, Camelia Ignat