Sciweavers

2827 search results - page 101 / 566
» Marking Text Documents
Sort
View
WWW
2002
ACM
14 years 8 months ago
Using web structure for classifying and describing web pages
The structure of the web is increasingly being used to improve organization, search, and analysis of information on the web. For example, Google uses the text in citing documents ...
Eric J. Glover, Kostas Tsioutsiouliklis, Steve Law...
ICPR
2008
IEEE
14 years 9 months ago
Structural poisson mixtures for classification of documents
Considering the statistical text classification problem we approximate class-conditional probability distributions by structurally modified Poisson mixtures. By introducing the st...
Jana Novovicová, Jiri Grim, Petr Somol
CLEF
2007
Springer
14 years 2 months ago
Using Geographic Signatures as Query and Document Scopes in Geographic IR
This paper reports the participation of the University of Lisbon at the 2007 GeoCLEF task. We adopted a novel approach for GIR, focused on handling geographic features and feature ...
Nuno Cardoso, David Cruz, Marcirio Silveira Chaves...
CICLING
2006
Springer
13 years 12 months ago
A Comparative Evaluation of a New Unsupervised Sentence Boundary Detection Approach on Documents in English and Portuguese
Abstract. In this paper, we describe a new unsupervised sentence boundary detection system and present a comparative study evaluating its performance against different systems foun...
Jan Strunk, Carlos Nascimento Silla Jr., Celso A. ...
COLING
2010
13 years 3 months ago
Large Scale Parallel Document Mining for Machine Translation
A distributed system is described that reliably mines parallel text from large corpora. The approach can be regarded as cross-language near-duplicate detection, enabled by an init...
Jakob Uszkoreit, Jay Ponte, Ashok C. Popat, Moshe ...