Sciweavers

415 search results - page 30 / 83
» Finding nuggets in documents: A machine learning approach
Sort
View
ECIR
2009
Springer
14 years 7 months ago
Combination of Documents Features Based on Simulated Click-through Data
Many different ranking algorithms based on content and context have been used in web search engines to find pages based on a user query. Furthermore, to achieve better performance ...
Ali Mohammad Zareh Bidoki, James A. Thom
ICDAR
2005
IEEE
14 years 3 months ago
Text Recognition of Low-resolution Document Images
Cheap and versatile cameras make it possible to easily and quickly capture a wide variety of documents. However, low resolution cameras present a challenge to OCR because it is vi...
Charles E. Jacobs, Patrice Y. Simard, Paul A. Viol...
CORIA
2007
13 years 11 months ago
Apprentissage actif pour l'annotation de documents
ABSTRACT. In the framework of the LegDoc project at Xerox Research Centre Europe, we are developing components for the semantic annotation of semi-structured documents. While certa...
Loïc Lecerf, Boris Chidlovskii
AI
2011
Springer
13 years 1 months ago
Subspace Mapping of Noisy Text Documents
Abstract. Subspace mapping methods aim at projecting high-dimensional data into a subspace where a specific objective function is optimized. Such dimension reduction allows the re...
Axel J. Soto, Marc Strickert, Gustavo E. Vazquez, ...
COLING
2010
13 years 5 months ago
An Empirical Study on Web Mining of Parallel Data
This paper1 presents an empirical approach to mining parallel corpora. Conventional approaches use a readily available collection of comparable, nonparallel corpora to extract par...
Gum-Won Hong, Chi-Ho Li, Ming Zhou, Hae-Chang Rim