Sciweavers

1149 search results - page 5 / 230
» Classification of Web Documents Using a Graph Model
Sort
View
NLDB
2000
Springer
13 years 11 months ago
Natural Language Analysis for Semantic Document Modeling
To ease the retrieval of documents published on the Web, the documents should be classified in a way that users find helpful and meaningful. This paper presents an approach to sema...
Terje Brasethvik, Jon Atle Gulla
ICDAR
2009
IEEE
13 years 5 months ago
Graph b-Coloring for Automatic Recognition of Documents
In order to reduce the rejection rate of our automatic reading system, we propose to pre-classify the business documents by introducing an Automatic Recognition of Documents stage...
Djamel Gaceb, Véronique Eglin, Frank Lebour...
CLEIEJ
2008
72views more  CLEIEJ 2008»
13 years 7 months ago
Measuring Contribution of HTML Features in Web Document Clustering
Documents in HTML format have many features to analyze, from the terms in special sections to the phrases that appear in the whole document. However, it is important to decide whi...
Esteban Meneses, Oldemar Rodríguez-Rojas
IJIS
2008
42views more  IJIS 2008»
13 years 7 months ago
The hybrid representation model for web document classification
Alex Markov, Mark Last, Abraham Kandel
VLDB
2000
ACM
125views Database» more  VLDB 2000»
13 years 11 months ago
Focused Crawling Using Context Graphs
Maintaining currency of search engine indices by exhaustive crawling is rapidly becoming impossible due to the increasing size and dynamic content of the web. Focused crawlers aim...
Michelangelo Diligenti, Frans Coetzee, Steve Lawre...