Sciweavers

145 search results - page 2 / 29
» Web Contents Tracking by Learning of Page Grammars
Sort
View
PKDD
2009
Springer
269views Data Mining» more  PKDD 2009»
14 years 1 months ago
Enhanced Web Page Content Visualization with Firefox
This paper aims at presenting how natural language processing and machine learning techniques can help the internet surfer to get a better overview of the pages he is reading. The ...
Lorand Dali, Delia Rusu, Dunja Mladenic
TREC
2001
13 years 8 months ago
Retrieving Web Pages Using Content, Links, URLs and Anchors
For this year's web track, we concentrated on the entry page finding task. For the content-only runs, in both the ad-hoc task and the entry page finding task, we used an infor...
Thijs Westerveld, Wessel Kraaij, Djoerd Hiemstra
ICADL
2004
Springer
137views Education» more  ICADL 2004»
14 years 11 days ago
Using Content-Based and Link-Based Analysis in Building Vertical Search Engines
This paper reports our research in the Web page filtering process in specialized search engine development. We propose a machine-learning-based approach that combines Web content a...
Michael Chau, Hsinchun Chen
WWW
2011
ACM
13 years 1 months ago
Identifying primary content from web pages and its application to web search ranking
Web pages are usually highly structured documents. In some documents, content with different functionality is laid out in blocks, some merely supporting the main discourse. In ot...
Srinivas Vadrevu, Emre Velipasaoglu
SIGIR
2005
ACM
14 years 16 days ago
Title extraction from bodies of HTML documents and its application to web page retrieval
This paper is concerned with automatic extraction of titles from the bodies of HTML documents. Titles of HTML documents should be correctly defined in the title fields; however, i...
Yunhua Hu, Guomao Xin, Ruihua Song, Guoping Hu, Sh...